Perl - sort and count

Posted on 2012-03-20
Last Modified: 2012-03-20
I have a file that has some info for user records, each record is separated by a new line such as:

dn: John Doe
alias: JDoe
contacts: 20101118
contacts: Visable

dn: Mary Doe
alias: MDoe
contacts: 20101118
contacts: Visable
contacts: 20101119
contacts: Hidden

dn: Bill Doe
alias: BDoe
contacts: 20101119
contacts: Hidden

I can pull the records out I want using - Perl -n00e ......

What I need to do is pull out the records that has more than two contacts: lines in them.

How can I pull each record out and do a count of how many lines they have in them that contain "contacts:"....

In the example above I need to be able to pull out the record for:

dn: Mary Doe
alias: MDoe
contacts: 20101118
contacts: Visable
contacts: 20101119
contacts: Hidden

but would like to get a count on all records on the number of contacts: lines in each one.

Question by:bt707
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
LVL 31

Accepted Solution

farzanj earned 500 total points
ID: 37744323
Try this:

 perl -ne 'BEGIN{$/="\n\n";} (@c) = /(contacts[:])/g; print if (@c>2)' xx' filename

Open in new window

If you also want to print the count of contacts, do this:
perl -ne 'BEGIN{$/="\n\n";} (@c) = /(contacts[:])/g; if (@c>2){print;print @c."\n"}' filename

Open in new window


Author Closing Comment

ID: 37744366
Yes that worked well.


Featured Post

Enroll in July's Course of the Month

July's Course of the Month is now available! Enroll to learn HTML5 and prepare for certification. It's free for Premium Members, Team Accounts, and Qualified Experts.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

On Microsoft Windows, if  when you click or type the name of a .pl file, you get an error "is not recognized as an internal or external command, operable program or batch file", then this means you do not have the .pl file extension associated with …
In the distant past (last year) I hacked together a little toy that would allow a couple of Manager types to query, preview, and extract data from a number of MongoDB instances, to their tool of choice: Excel (…
Explain concepts important to validation of email addresses with regular expressions. Applies to most languages/tools that uses regular expressions. Consider email address RFCs: Look at HTML5 form input element (with type=email) regex pattern: T…
Six Sigma Control Plans

636 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question