compare fields in a file with duplicate records

01-28-2009

Registered User

9, 0

Join Date: Jul 2008

Last Activity: 17 July 2009, 5:47 AM EDT

Posts: 9

Thanks Given: 0

Thanked 0 Times in 0 Posts

compare fields in a file with duplicate records

Hi:

I've been searching the net but didnt find a clue. I have a file in which, for some records, some fields coincide. I want to compare one (or more) of the dissimilar fields and retain the one record that fulfills a certain condition. For example, on this file:

Code:

99  TR   1991  5  06   60.000  -19.995  277
64  TR   1991  5  06   60.000  -19.995  275
60  TA   1990  7  17   60.000  -19.998  300
89  TA   1990  7  17   60.000  -19.998  277
46  CU   1992  8  29   61.020  -16.880  009
35  CU   1992  8  29   61.020  -16.880  003

retain the record with greater value on the last field for each duplicate:

Code:

99  TR   1991  5  06   60.000  -19.995  277
60  TA   1990  7  17   60.000  -19.998  300
46  CU   1992  8  29   61.020  -16.880  009

while sending the unwanted data to another file:

Code:

64  TR   1991  5  06   60.000  -19.995  275
89  TA   1990  7  17   60.000  -19.998  277
35  CU   1992  8  29   61.020  -16.880  003

Thanks, in advance.

r.-

rleal

View Public Profile for rleal

Find all posts by rleal

01-29-2009

Registered User

20, 1

Join Date: Jan 2009

Last Activity: 17 March 2009, 3:42 AM EDT

Posts: 20

Thanks Given: 0

Thanked 1 Time in 1 Post

You can use awk for this.

Code:

$ awk '{if(arr[$2]=="")arr[$2]=$0; else{split(arr[$2],tmp);if($8>tmp[8]) sub(/tmp[8]$/,arr[$2],$8);}}END{for(item in arr)  print arr[item]}' infile
46  CU   1992  8  29   61.020  -16.880  009
60  TA   1990  7  17   60.000  -19.998  300
99  TR   1991  5  06   60.000  -19.995  277

skar_a

View Public Profile for skar_a

Find all posts by skar_a

Shell Programming and Scripting

compare fields in a file with duplicate records

10 More Discussions You Might Find Interesting

1. UNIX for Beginners Questions & Answers

Discarding records with duplicate fields

Discussion started by: beca123456

2. Shell Programming and Scripting

Deleting duplicate records from file 1 if records from file 2 match

Discussion started by: vestport

3. Shell Programming and Scripting

Remove somewhat Duplicate records from a flat file

Discussion started by: jolney

4. UNIX for Dummies Questions & Answers

CSV file:Find duplicates, save original and duplicate records in a new file

Discussion started by: arvindosu

5. Shell Programming and Scripting

Find Duplicate records in first Column in File

Discussion started by: Murugesh

6. Shell Programming and Scripting

find out duplicate records in file?

Discussion started by: tiger2000

7. Shell Programming and Scripting

How to find Duplicate Records in a text file

Discussion started by: G.Aavudai

8. UNIX for Advanced & Expert Users

Duplicate records from oracle to text file.

Discussion started by: shilendrajadon

9. Shell Programming and Scripting

Remove all instances of duplicate records from the file

Discussion started by: vukkusila

10. Shell Programming and Scripting

Delete Duplicate records from a tilde delimited file

Discussion started by: irshadm