Solved

average data in third column per month and round to the nearest hundredth

Posted on 2012-03-15
2
305 Views
Last Modified: 2012-03-16
I have a file called output.txt and I want to average data per month. The original attached file should create month in the first column average value in the second column. Each row should look like this below. The averages are bogus just for example. See attached file.

output2.txt sample file rows with bogus data.
198001 120.22
198002 133.42
output.txt
0
Comment
Question by:libertyforall2
  • 2
2 Comments
 
LVL 4

Expert Comment

by:bazika
ID: 37729176
Could You please clarify the stucture of Your input file (which is called "output.txt")?

Sometimes there are 4 columns there, and later columns have 5 columns. Which exactly fields should be analyzed?
0
 
LVL 4

Accepted Solution

by:
bazika earned 500 total points
ID: 37729248
Here is the example of shell script.
I suppose, that the first field in output.txt is "YYYYMMDD", so selecting "substr($1, 1, 6 )" gives us "YYYYMM" (year+month).

Also, the summary of all fields from the 3-rd is taken (i.e. field3+field4+field5 ...)

I do not know, which awk version is used, therefor, I do additional initialization of the arrays.

#!/bin/ksh

cat output.txt | awk '
$3 != "" { cur_mon = substr($1, 1, 6 ) ;
        if( a_sum[cur_mon] == "" ) { a_sum[cur_mon] = 0 ; a_cou[cur_mon]  = 0 }
        for(i=3;i<=NF;i++) {
                a_sum[cur_mon] += $i;
                a_cou[cur_mon] ++ ;
        }
}
END {
        for (cur_mon in a_sum )
                { printf "%s %.2f\n", cur_mon, a_sum[cur_mon]/a_cou[cur_mon] ; }
        }
' | sort -k 1

Open in new window

0

Featured Post

Is Your Active Directory as Secure as You Think?

More than 75% of all records are compromised because of the loss or theft of a privileged credential. Experts have been exploring Active Directory infrastructure to identify key threats and establish best practices for keeping data safe. Attend this month’s webinar to learn more.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Suggested Solutions

Title # Comments Views Activity
incorrect file reading in bash 7 78
Arduino EDI - Programming language 3 79
tidtcpserver connection lost handle 2 69
Modify a small python script 19 94
A short article about a problem I had getting the GPS LocationListener working.
Entering a date in Microsoft Access can be tricky. A typo can cause month and day to be shuffled, entering the day only causes an error, as does entering, say, day 31 in June. This article shows how an inputmask supported by code can help the user a…
An introduction to basic programming syntax in Java by creating a simple program. Viewers can follow the tutorial as they create their first class in Java. Definitions and explanations about each element are given to help prepare viewers for future …
In this seventh video of the Xpdf series, we discuss and demonstrate the PDFfonts utility, which lists all the fonts used in a PDF file. It does this via a command line interface, making it suitable for use in programs, scripts, batch files — any pl…

932 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question

Need Help in Real-Time?

Connect with top rated Experts

10 Experts available now in Live!

Get 1:1 Help Now