Still celebrating National IT Professionals Day with 3 months of free Premium Membership. Use Code ITDAY17


extract data from text file

Posted on 2011-02-19
Medium Priority
Last Modified: 2012-05-11

I have a large text file from which I need to extract lines as below:
time-range 02293404
 absolute end 23:59 19 September 2009
time-range 10May2010_SR405807
 absolute end 23:59 10 May 2010
time-range 11july2010
 absolute end 23:59 11 July 2010

The final output would be:
scheduler <name> start-date YYYY-MM-DD.HH:MM stop-date YYYY-MM-DD.HH:MM

For start-data we can use static date as: 2009-01-01.00:00
Stop-date would be the end date above.

scheduler A start-date 2009-01-01.00:00 stop-date 2009-09-10.23:59
scheduler B start-date 2009-01-01.00:00 stop-date 2010-10-19.23:59
scheduler C start-date 2009-01-01.00:00 stop-date 2010-11-11.23:59

Thanks for all the help in advance.
Question by:dpk_wal
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
  • 2
LVL 26

Expert Comment

ID: 34935793
Where do you get scheduler names from?  Your sample text file lines don't appear to include the info.
LVL 32

Author Comment

ID: 34937252
Sorry for the typo:
>> time-range 02293404
the word after time-range is the scheduler name; in original post the names are 02293404, 10May2010_SR405807, 11july2010.

LVL 16

Accepted Solution

sjklein42 earned 2000 total points
ID: 34938314
%mm = ('Jan','01', 'Feb','02', 'Mar','03', 'Apr','04', 'May','05', 'Jun','06', 'Jul','07', 'Aug','08', 'Sep','09', 'Oct','10', 'Nov','11', 'Dec','12');

while ( <> )

	if ( $_ ne '' )
		# time-range 10May2010_SR405807

		if ( ! ( /^time-range / ) ) { die"*** expected time-range not found\n"; }
		$schedName = $';

		#  absolute end 23:59 10 May 2010

		$_ = <>;
		if ( ! ( /^ absolute end ([0-9]+)\:([0-9]+) ([0-9]+) ([a-z]+) ([0-9]+)/i ) )
			{ die"*** expected absolute end not found\n"; }
		($hh, $min, $dd, $mmm, $yyyy) = ($1, $2, $3, $4, $5);

		# scheduler A start-date 2009-01-01.00:00 stop-date 2009-09-10.23:59

		$mm = $mm{substr($mmm,0,3)};
		print "scheduler $schedName start-date 2009-01-01.00:00 stop-date "
			. $yyyy . "-" . $mm . "-" . $dd . "." . $hh . ":" . $min . "\n";

Open in new window

C:\temp>perl foo.txt
scheduler 02293404 start-date 2009-01-01.00:00 stop-date 2009-09-19.23:59
scheduler 10May2010_SR405807 start-date 2009-01-01.00:00 stop-date 2010-05-10.23:59
scheduler 11july2010 start-date 2009-01-01.00:00 stop-date 2010-07-11.23:59

Open in new window

LVL 32

Author Closing Comment

ID: 34940262
Thank you!

Featured Post

On Demand Webinar - Networking for the Cloud Era

This webinar discusses:
-Common barriers companies experience when moving to the cloud
-How SD-WAN changes the way we look at networks
-Best practices customers should employ moving forward with cloud migration
-What happens behind the scenes of SteelConnect’s one-click button

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

I've just discovered very important differences between Windows an Unix formats in Perl,at least 5.xx.. MOST IMPORTANT: Use Unix file format while saving Your script. otherwise it will have ^M s or smth likely weird in the EOL, Then DO NOT use m…
I have been pestered over the years to produce and distribute regular data extracts, and often the request have explicitly requested the data be emailed as an Excel attachement; specifically Excel, as it appears: CSV files confuse (no Red or Green h…
Learn several ways to interact with files and get file information from the bash shell. ls lists the contents of a directory: Using the -a flag displays hidden files: Using the -l flag formats the output in a long list: The file command gives us mor…
Explain concepts important to validation of email addresses with regular expressions. Applies to most languages/tools that uses regular expressions. Consider email address RFCs: Look at HTML5 form input element (with type=email) regex pattern: T…

721 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question