Solved

counting values in a list 3

Posted on 2013-11-17
261 Views
I know very little about PERL :(

I have list of millions of rows of data. All data values are separated by a comma

I have to count how many rows contain a specific data value

for example

101,102,143,145,146,149
101,102,143,145,147,148
101,102,143,145,147,149
101,102,143,145,148,149
101,102,143,146,147,148
101,102,143,146,147,149
101,103,143,146,148,149
102,103,143,147,148,149
102,103,144,145,146,147
102,104,144,145,146,148
102,104,144,145,146,149
102,104,144,145,147,148

from that sample list, I have to count how many rows have the value 103 for the second data entry, but only if the value in the first column is 101, so for this list the number of rows with 103 in the second data entry where the first value is 101, would be 1

I have a previous script (attached) from "ozo" which counts the rows with particular values in the first column.

Could you alter this script (as it works very fast) to count the number of rows with 103 in the second column only where the value 101 is in the first column ?

Thankyou very much in advance count1.txt
0
Question by:MichaelGlancy
[X]
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

• Help others & share knowledge
• Earn cash & points
• 5
• 4

LVL 14

Expert Comment

ID: 39655001
``````#!/usr/bin/perl
use strict;
use warnings;
open M,"<master.vim" or die "master.vim \$!";
my \$c=0;
\$c+=/^101,103/ while <M>;
close M;
open C,">count.txt" or die "count.txt \$!";
print C "\$c\n";
close C;
``````
0

Author Comment

ID: 39655009
Doesnt work, its posting too high a result in a test list that i have
0

LVL 14

Expert Comment

ID: 39655012
Can you post the test list that you have?
0

LVL 14

Accepted Solution

jb1dev earned 500 total points
ID: 39655015
Oh wait, I guess the numbers can be higher than 3 digits? Need to add a comma after 103.

Try

``````#!/usr/bin/perl
use strict;
use warnings;
open M,"<master.vim" or die "master.vim \$!";
my \$c=0;
\$c+=/^101,103,/ while <M>;
close M;
open C,">count.txt" or die "count.txt \$!";
print C "\$c\n";
close C;
``````
0

Author Comment

ID: 39655016
Sorry, it works on the little list I supplied. The test list I have is 5 million entries. I can accept this answer just now and we can work on the next
0

Author Closing Comment

ID: 39655017
great
0

Author Comment

ID: 39655035
It works fine on the large list. unfortunately it is picking up

all values with 101,103

and it is including values further down the list with

101,1034 etc so giving me a false result

thankyou
0

LVL 14

Expert Comment

ID: 39655037
0

Author Comment

ID: 39655044
Ive posted a new question
0

Featured Post

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question