keeping last record among group of records with common fields (awk) Post: 302711565

Sponsored Content

Top Forums UNIX for Dummies Questions & Answers keeping last record among group of records with common fields (awk) Post 302711565 by Don Cragun on Sunday 7th of October 2012 03:12:51 AM

10-07-2012

Registered User

Here are some options for you. This script provides several ways of processing the input file giving different results depending on whether you want the highest value for $3 or the last value for $3, all entries with matching field 1 and field 2 values adjacent or spread throughout the input file, and whether or not you care if the output order is the same as the input file order:

Code:

#!/bin/ksh
printf "Following assumes all entries with matching field 1 & field 2 are
adjacent and prints the last entry found.\n"
awk 'BEGIN {FS = OFS = ";"}
last != $1 FS $2 {
        if(last != "") print last, hi3
        last = $1 FS $2
        hi3 = $3
        next
}
        {hi3 = $3}
END {   if(last != "") print last, hi3}' input

printf "\nFollowing assumes all entries with matching field 1 & field 2 are
adjacent and prints the entry with highest value in field 3.\n"
awk 'BEGIN {FS = OFS = ";"}
last != $1 FS $2 {
        if(last != "") print last, hi3
        last = $1 FS $2
        hi3 = $3
        next
}
        {if($3 > hi3) hi3 = $3}
END {   if(last != "") print last, hi3}' input

printf "\nFollowing assumes entries with matching field 1 & field 2 might not be
adjacent and prints the last entry found.  Output order is not guaranteed to
match the order of first appearance in the input file.\n"
awk 'BEGIN {FS = OFS = ";"}
{       k3[$1 FS $2] = $3}
END {   for (k in k3) print k, k3[k]}' input

printf "\nFollowing assumes entries with matching field 1 & field 2 might not be
adjacent and prints the highest entry found.  Output order is not guaranteed to
match the order of first appearance in the input file.\n"
awk 'BEGIN {FS = OFS = ";"}
{       if(kc[$1 FS $2]++ == 0) k3[$1 FS $2] = $3
        else if($3 > k3[$1 FS $2]) k3[$1 FS $2] = $3
}
END {   for (k in kc) print k, k3[k]}' input

printf "\nFollowing assumes entries with matching field 1 & field 2 might not be
adjacent and prints the highest entry found.  Output order is guaranteed to
match the order of first appearance in the input file.\n"
awk 'BEGIN {FS = OFS = ";"}
{       if(kc[$1 FS $2]++ == 0) {
                k3[$1 FS $2] = $3
                order[++cnt] = $1 FS $2
        } else if($3 > k3[$1 FS $2]) k3[$1 FS $2] = $3
}
END {   for(i = 1; i <= cnt; i++) print order[i], k3[order[i]]}' input

When the file input contains:

Code:

split;test;2
ref.1;rack.1;2
ref.1;rack.1;1
ref.1;rack.2;1
split;test;3
ref.2;rack.3;1
ref.2;rack.3;2
ref.2;rack.3;3
split;test;1

the output from the above script is:

Code:

Following assumes all entries with matching field 1 & field 2 are
adjacent and prints the last entry found.
split;test;2
ref.1;rack.1;1
ref.1;rack.2;1
split;test;3
ref.2;rack.3;3
split;test;1

Following assumes all entries with matching field 1 & field 2 are
adjacent and prints the entry with highest value in field 3.
split;test;2
ref.1;rack.1;2
ref.1;rack.2;1
split;test;3
ref.2;rack.3;3
split;test;1

Following assumes entries with matching field 1 & field 2 might not be
adjacent and prints the last entry found.  Output order is not guaranteed to
match the order of first appearance in the input file.
split;test;1
ref.2;rack.3;3
ref.1;rack.1;1
ref.1;rack.2;1

Following assumes entries with matching field 1 & field 2 might not be
adjacent and prints the highest entry found.  Output order is not guaranteed to
match the order of first appearance in the input file.
split;test;3
ref.2;rack.3;3
ref.1;rack.1;2
ref.1;rack.2;1

Following assumes entries with matching field 1 & field 2 might not be
adjacent and prints the highest entry found.  Output order is guaranteed to
match the order of first appearance in the input file.
split;test;3
ref.1;rack.1;2
ref.1;rack.2;1
ref.2;rack.3;3

This User Gave Thanks to Don Cragun For This Post:

Don Cragun

View Public Profile for Don Cragun

Find all posts by Don Cragun

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

awk: record has too many fields

Hi, I'm trying this command - but get this error. Do you guys have any workaround for this? cat tf|sed 's/{//g'|sed 's/,//g'|awk '{for (i=1;i<=NF;i++) {if ($i == "OPTIME") {k = i + 2; print $i,$k}}}' awk: record `2005 Jul 28 17:35:29...' has too many fields record number 15 This is how...

2. Infrastructure Monitoring

Processing records as group - awk

I have a file has following records policy glb id 1233 name Permit ping from "One" to "Second" "Address1" "Any" "ICMP-ANY" permit policy id 999251 service "snmp-udp" exit policy glb id 1234 name Permit telnet from "One" to "Second" "Address2" "Any" "TCP-ANY" permit policy id 1234...

3. Shell Programming and Scripting

Print all the fields of record using awk

Hi, i want to generate print statement using awk. i have 20+ and 30+ fields in each line Now its priting only first eight fields print statement as output not all. my record is as shown below filename ...

4. Shell Programming and Scripting

Common records using AWK

Hi, To be honest, I am really impressed and amazed at the pace I find solutions for un-solved coding mysteries in this forum. I have a file like this input1.txt x y z 1 2 3 a b c 4 -3 7 k l m n 0 p 1 2 a b c 4 input2 x y z 9 0 -1 a b c 0 6 9 k l m 8 o p 1 2 a f x 9 Output...

5. Shell Programming and Scripting

Finding out the common lines in two files using 4 fields with the help of awk and UNIX

Dear All, I have 2 files. If field 1, 2, 4 and 5 matches in both file1 and file2, I want to print the whole line of file1 and file2 one after another in my output file. File1: sc2/80 20 . A T 86 F=5;U=4 sc2/60 55 . G T ...

6. Shell Programming and Scripting

awk - compare 1st 15 fields of record with 20 fields

I'm trying to compare 2 files for differences in a selct number of fields. When differnces are found it will write the whole record of the second file including appending '|C' out to a delta file. Each record will have 20 fields, but only want to do comparison of 1st 15 fields. The 1st field of...

7. Shell Programming and Scripting

awk print matching records and occurences of each record

Hi all , I have two files : dblp.xml with dblp records and itu1.txt with faculty members records. I need to find out how many dblp records are related to the faculty members. More specific: I need to find out which names from itu1.txt are a match in dblp. xml file , print them and show how many...

8. Shell Programming and Scripting

Sorting group of records and loading last record

Hi Everyone, I have below record set. File is fixed widht file 101newjersyus 20150110 101nboston us 20150103 102boston us 20140106 102boston us 20140103 I need to group record based on first 3 letters in our case(101 and 102) and sort last 8 digit in ascending order and print only...

9. Shell Programming and Scripting

Keeping record of file 2 based on a reference file 1 in awk

I have 2 input files (tab separated): file1: make_A 1990 foo bar make_B 2010 this that make_C 2004 these those file2: make_X 1970 1995 ref_1:43 ref_2:65 make_A 1970 1995 ref_1:4 ref_2:21 ref_3:18 make_A 1980 2002 ref_1:7 ref_2:7 ref_3:0 ...

10. UNIX for Beginners Questions & Answers

Awk: group multiple fields from different records

Hi, My input looks like that: A|123|qwer A|456|tyui A|456|wsxe B|789|dfgh Using awk, I am trying to get: A|123;456|qwer;tyui;wsxe B|789|dfgh For records with same $1, group all the $2 in a field (without replicates), and all the $3 in a field (without replicates). What I have tried:...

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

awk: record has too many fields

Discussion started by: chaandana

2. Infrastructure Monitoring

Processing records as group - awk

Discussion started by: baskar

3. Shell Programming and Scripting

Print all the fields of record using awk

Discussion started by: raghavendra.nsn

4. Shell Programming and Scripting

Common records using AWK

Discussion started by: jacobs.smith

5. Shell Programming and Scripting

Finding out the common lines in two files using 4 fields with the help of awk and UNIX

Discussion started by: NamS

6. Shell Programming and Scripting

awk - compare 1st 15 fields of record with 20 fields

Discussion started by: sljnk

7. Shell Programming and Scripting

awk print matching records and occurences of each record

Discussion started by: iori

8. Shell Programming and Scripting

Sorting group of records and loading last record

Discussion started by: patricjemmy6

9. Shell Programming and Scripting

Keeping record of file 2 based on a reference file 1 in awk

Discussion started by: beca123456

10. UNIX for Beginners Questions & Answers

Awk: group multiple fields from different records

Discussion started by: beca123456