parse a text file

Posted on 2007-11-24
Last Modified: 2010-03-05
Each alarm starts with UNIT6, I'm trying to get all the rows of each alarm into one row and from there I can sort and do whatever I need. I've had some success using while loops but the problem is some alarms have 4 rows,  some have 5,6, or 7. Once it gets beyond 4 rows I get duplicates or I only get alarms with 5 rows depending on how I set up the while loop. I attached the code I have been trying so far it is somewhat messy from all the retries.
Here are is a partial alarm log:
           UNIT6        LOT-0086  BLOCK-0086    ENVIR     2007-11-14  00:09:19.37
*   ALARM                         D142B      
   (10239) 7312 RATIO THRESHOLD                    

           UNIT6        LOT-0086             ENVIR     2007-11-14  00:09:20.05
*   ALARM                                        
   (10241) 7000 CABINET OPEN                                                    
                Cabinet open.                                        
                02 01 00

           UNIT6        LOT-0086             COMM      2007-11-14  00:09:20.46
**  ALARM                                        
   (10242) 8999 BLOCKED FROM USE                                                
                PCBLOCKHub      PCBLOCKHub      CC bank            
                02 00 1d
#!/usr/bin/perl -w
use strict;
open DDDLOG,  "expx.txt" or die "Cannot open file $!";
open DDDRPT1, ">all.txt" or die "Cannot open file $!";
my @FFF; my $fff;
my @DDD; my $ddd;
my @NUM; my $num;
my @GGG; my $ggg;
my @DESC; my $desc;
my @DESC2; my $desc2;
my @DESC3; my $desc3;
########  PARSE RAW FILE ################
MAIN: while (<DDDLOG>){
            if (/\s+UNIT6\s+LOT-\d\d\d\d/){
            $fff = $_;
            @FFF = split /\s+/, $fff;
            #print "@FFF\n";
         LOOP2: while (<DDDLOG>){
                     redo MAIN if (/\s+UNIT6/);
                     if (/\*|\*\*|\*\*\*/){
                     $ddd = $_;
                     @DDD = split /\s+/, $ddd;
                     #print "@FFF @DDD\n";
             LOOP3: while (<DDDLOG>){
                         redo MAIN if (/\s+UNIT6/);
                         if (/7\d\d\d|8\d\d\d/){
                         $num = $_;
                         @NUM = split /\s+/, $num;
                         #print DDDRPT1 "@FFF @DDD @NUM\n";
                   LOOP4: while (<DDDLOG>){
                               redo MAIN if (/\s+UNIT6/);
                               if (/[a-zA-Z0-9]/){
                              $desc = $_;
                               @DESC = split /\s+/, $desc;
                              #print DDDRPT1 "@FFF @DDD @NUM @DESC\n";
                               # print DDDRPT1 "$DESC[1] $DESC[2] $DESC[3]\n";
     LOOP5: while (<DDDLOG>){
                 redo MAIN if (/\s+UNIT6/);
                 if (/[a-zA-Z0-9]/){
                 $desc2 = $_;
                 @DESC2 = split /\s+/, $desc2;
               print DDDRPT1 "@FFF @DDD @NUM @DESC @DESC2\n";
                  # print DDDRPT1 "$DESC[1] $DESC[2] $DESC[3]\n";
                 #elsif(/ /){
                 #print DDDRPT1 "@FFF @DDD @NUM @DESC\n";
     LOOP6: while (<DDDLOG>){
                 redo MAIN if (/\s+UNIT6/);
                 if (/[a-zA-Z0-9]/){
                 $desc3 = $_;
                 @DESC3 = split /\s+/, $desc3;
               #print DDDRPT1 "@FFF @DDD @NUM @DESC @DESC2 @DESC3\n";
                  # print DDDRPT1 "$DESC[1] $DESC[2] $DESC[3]\n";
                 elsif(/ /){
                 #print DDDRPT1 "@FFF @DDD @NUM @DESC\n";
close DDDRPT1;
close DDDLOG;

Open in new window

Question by:omcr
LVL 84

Accepted Solution

ozo earned 125 total points
ID: 20343480
#!/usr/bin/perl -w
use strict;

open DDDLOG,  "expx.txt" or die "Cannot open file $!";
open DDDRPT1, ">all.txt" or die "Cannot open file $!";

my $e="";
while( <DDDLOG> ){
    print DDDRPT1 $e and $e="\n" if /^\s+UNIT6\s/;
    print DDDRPT1;
print DDDRPT1 $e;
close DDDRPT1;


Author Comment

ID: 20343665
Amazing. I'm not going say how much time (days) I've spent on this.
If you can take a second to explain it that would help. About the only thing I think I understand is $e adds a "\n" to the end everytime it sees UNIT6. Are you chomping every line in the file, I think so, and then newline in $e signals where to end. But I don't understand how you get each line into $e and make it one line without the  use of arrays?. Anyway there is one problem, the actual log file contains several lines of  trash at the beginning and end. I'm going to try to enclose your code in a while loop and see what happens

Featured Post

Free Tool: Subnet Calculator

The subnet calculator helps you design networks by taking an IP address and network mask and returning information such as network, broadcast address, and host range.

One of a set of tools we're offering as a way of saying thank you for being a part of the community.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

I have been pestered over the years to produce and distribute regular data extracts, and often the request have explicitly requested the data be emailed as an Excel attachement; specifically Excel, as it appears: CSV files confuse (no Red or Green h…
There are many situations when we need to display the data in sorted order. For example: Student details by name or by rank or by total marks etc. If you are working on data driven based projects then you will use sorting techniques very frequently.…
Explain concepts important to validation of email addresses with regular expressions. Applies to most languages/tools that uses regular expressions. Consider email address RFCs: Look at HTML5 form input element (with type=email) regex pattern: T…

856 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question