awk- looping through groups of lines

05-06-2011

Registered User

28, 0

Join Date: Apr 2011

Last Activity: 17 August 2011, 4:10 AM EDT

Location: Helsinki, Finland

Posts: 28

Thanks Given: 14

Thanked 0 Times in 0 Posts

awk- looping through groups of lines

Hello,

I'm working with a file that has three columns. The first one represents a certain channel and the third one a timestamp (second one is not important). Example input is as follows:

Code:

2513   12   10.771
 2513   13   10.771
 2513   14   10.771
 2513   15   10.771
 2644    8    10.771
 2645   14    10.771
 2647     7    10.771
----------------------
 2513     0    10.772
 2513     1    10.772
 2513     2    10.772
 2513     3    10.772
 2513     4    10.772
 2513     5    10.772
 2513     6    10.772
----------------------
 2513     7    10.772
 2513     8    10.772
 2513     9    10.772
 2513     10    10.772
 2513     11  10.772
 2513     12   10.772
 2513     13   10.772

The input doesn't have the "----------------------" part, I just put it there so the groups of lines that I want to analyze become a bit clearer.

I want to analyze the lines by groups of 7 (since 7 same timestamps represent 1 packet). The problem is that the timestamps repeat themselves from time to time, so for example sometimes you might find 14 or 21 consecutive timestamps with the same value (even though values in the other two columns do vary). What I want to get is a count of the times that the first column values (channels) appear (only counted once per packet, so, every group of 7 lines).

Desired output:

Code:

The code I've tried so far doesn't consider the repeated fields (the groups of 7), so it only counts one time per timestamp (which means I get a value of 2 instead of 3 for channel 2513):

Code:

 awk '{ 
                          while (getline > 0 && NF > 0){
                           timec= $3;
                           pidc= $1;
                           if(timec == $3 && pidc != pidp){
                               pid[$1]++;
                             }
                           pidp=$1}
                           } 
                           END {for (i in pid){ print i, pid[i]}}'

Any help is much appreciated.
Thanks!

Last edited by acsg; 05-06-2011 at 08:41 AM.. Reason: clearer input v1.2

acsg

View Public Profile for acsg

Find all posts by acsg

05-06-2011

Registered User

3,733, 1,154

Join Date: Apr 2009

Last Activity: 3 August 2016, 11:03 AM EDT

Posts: 3,733

Thanks Given: 7

Thanked 1,154 Times in 1,124 Posts

I think you want the first line to say:

Code:

2513 4

Also post desired output for the rest of that sample data (10.772 timestamp).

bartus11

View Public Profile for bartus11

Find all posts by bartus11

05-06-2011

Registered User

28, 0

Join Date: Apr 2011

Last Activity: 17 August 2011, 4:10 AM EDT

Location: Helsinki, Finland

Posts: 28

Thanks Given: 14

Thanked 0 Times in 0 Posts

Quote:

Originally Posted by bartus11

I think you want the first line to say:

Code:

2513 4

Also post desired output for the rest of that sample data (10.772 timestamp).

Hello,

The desired output is for the whole input... meaning that I want to count the fact that, for example, channel 2513, appears in all 3 'packets' (groups of 7 lines).

acsg

View Public Profile for acsg

Find all posts by acsg

05-06-2011

Moderator

12,296, 3,792

Join Date: Nov 2008

Last Activity: 1 January 2021, 1:47 AM EST

Location: Amsterdam

Posts: 12,296

Thanks Given: 679

Thanked 3,792 Times in 3,282 Posts

Like this?

Code:

awk '{B[$1]} !(NR%7){for(i in B){delete B[i];A[i]++}} END{for(i in A)print i,A[i]}' infile

This User Gave Thanks to Scrutinizer For This Post:

Scrutinizer

View Public Profile for Scrutinizer

Find all posts by Scrutinizer

05-06-2011

Registered User

3,733, 1,154

Join Date: Apr 2009

Last Activity: 3 August 2016, 11:03 AM EDT

Posts: 3,733

Thanks Given: 7

Thanked 1,154 Times in 1,124 Posts

Try:

Code:

perl -lane '$x=int(($.-1)/7);$a{$x}{$F[0]}=1;END{for $i (keys %a){for $j (keys %{$a{$i}}){$b{$j}++}};for $i (keys %b){print "$i $b{$i}"}}' file

This User Gave Thanks to bartus11 For This Post:

bartus11

View Public Profile for bartus11

Find all posts by bartus11

05-09-2011

Registered User

28, 0

Join Date: Apr 2011

Last Activity: 17 August 2011, 4:10 AM EDT

Location: Helsinki, Finland

Posts: 28

Thanks Given: 14

Thanked 0 Times in 0 Posts

Quote:

Originally Posted by Scrutinizer

Like this?

Code:

awk '{B[$1]} !(NR%7){for(i in B){delete B[i];A[i]++}} END{for(i in A)print i,A[i]}' infile

Thank you!! This seems to do the trick but I don't quite understand how it does it... could you please explain what the !(NR%7) is for? and why did you use the 'delete' ?

acsg

View Public Profile for acsg

Find all posts by acsg

05-09-2011

Moderator

12,296, 3,792

Join Date: Nov 2008

Last Activity: 1 January 2021, 1:47 AM EST

Location: Amsterdam

Posts: 12,296

Thanks Given: 679

Thanked 3,792 Times in 3,282 Posts

Hi, here is a clarification:

{B[$1]}	Create an array element B[$1] . If such an element already exists then this will not create a new element, hence an element will only be created once for the value $1, irrespective of the number of occurrences of $1 (in a group of 7, see below)
!(NR%7)	If the remainder of the line number divide by 7 equals 0 (if it is not greater than 1) then we are at a multiple of 7, so seven lines will have been read)
for(i in B){delete B[i];A[i]++	then for each element in B increase the count in array A and then discard the array element B[i]. Afterwards all elements in array B will have been discarded. This sequence gets repeated after every 7 lines.
END{for(i in A)print i,A[i]}	print all the array element in array A and their values

Hope this helps...

S.

This User Gave Thanks to Scrutinizer For This Post:

Scrutinizer

View Public Profile for Scrutinizer

Find all posts by Scrutinizer

Shell Programming and Scripting

awk- looping through groups of lines

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Rearrange groups of lines from several files

Discussion started by: migurus

2. Shell Programming and Scripting

Best way to sort file with groups of text of 4-5 lines by the first one

Discussion started by: devmsv

3. Shell Programming and Scripting

Print values within groups of lines with awk

Discussion started by: Ophiuchus

4. Shell Programming and Scripting

Match single line in file1 to groups of lines in file2

Discussion started by: pathunkathunk

5. Shell Programming and Scripting

Help on looping using awk

Discussion started by: jeffreybsu

6. Shell Programming and Scripting

Looping through only blank lines of a file.

Discussion started by: suraj.sheikh

7. UNIX for Dummies Questions & Answers

Remove groups of repeating lines

Discussion started by: glev2005

8. UNIX for Dummies Questions & Answers

Help with AWK looping

Discussion started by: new2awk

9. UNIX for Dummies Questions & Answers

Help in Array looping and creating multiple lines

Discussion started by: sexyTrojan

10. Shell Programming and Scripting

Breaking long lines into (characters, newline, space) groups

Discussion started by: rowie718