find uniq lines in file, using the first field of line

10-20-2009

Registered User

8, 0

Join Date: Oct 2009

Last Activity: 14 June 2012, 10:29 AM EDT

Location: UK

Posts: 8

Thanks Given: 0

Thanked 0 Times in 0 Posts

find uniq lines in file, using the first field of line

Hello all, new to unix and have just found the forum.
I think I will be here quite often, and hope that in time i will be able to provide soem help, role on not being a newbie anymore

I have a question which iI am hoping someone could help me with.

If i have a file with lines in in thus ...

purple 2 color
purple 3 color
purple 4 color
purple 5 color
blue 1 color
blue 2 color
blue 3 color

how do i sort the list so only uniq instances of field 1 show with the highest value of field 2 ?

so in this case i would want to see ....

purple 5 color
blue 3 color

any help would be most welcome

cheers
grom

grom

View Public Profile for grom

Find all posts by grom

10-20-2009

Administrator Emeritus

9,179, 1,331

Join Date: Jun 2009

Last Activity: 26 February 2019, 5:57 PM EST

Posts: 9,179

Thanks Given: 430

Thanked 1,331 Times in 1,120 Posts

Hi.

You can use awk alone for this, or something like:

Code:

sort -nrk2,1 file1 | awk 'C != $1 { C = $1; print }'

Scott

View Public Profile for Scott

Find all posts by Scott

10-20-2009

Registered User

8, 0

Join Date: Oct 2009

Last Activity: 14 June 2012, 10:29 AM EDT

Location: UK

Posts: 8

Thanks Given: 0

Thanked 0 Times in 0 Posts

many thanks!

works like a charm scottn, very much appreciated.
I have a lot to learn

Cheers

grom

View Public Profile for grom

Find all posts by grom

10-20-2009

Administrator Emeritus

9,179, 1,331

Join Date: Jun 2009

Last Activity: 26 February 2019, 5:57 PM EST

Posts: 9,179

Thanks Given: 430

Thanked 1,331 Times in 1,120 Posts

So do I, because actually, it doesnt!!

Add:

Code:

purple 27 color

and you'll see why!

Code:

sort -nrk2 file1 | awk '!C[$1] { C[$1]=1; print }'

Scott

View Public Profile for Scott

Find all posts by Scott

10-20-2009

Registered User

8, 0

Join Date: Oct 2009

Last Activity: 14 June 2012, 10:29 AM EDT

Location: UK

Posts: 8

Thanks Given: 0

Thanked 0 Times in 0 Posts

See what you mean. I am glad your on the ball, I had only tried it on the example I posted, just tried again with your revised solution (and added numbers in there too) and all is ok.

could I trouble you and ask for a brief breakdown on how it works?

I have ordered the sed & awk book from amazon, hope i understand it when it comes LOL

cheers

grom

View Public Profile for grom

Find all posts by grom

10-20-2009

Administrator Emeritus

9,179, 1,331

Join Date: Jun 2009

Last Activity: 26 February 2019, 5:57 PM EST

Posts: 9,179

Thanks Given: 430

Thanked 1,331 Times in 1,120 Posts

sort is not a command I use often (evidently!).

My understanding was that by using -nrk2,1 the fields would be sorted (reverse) numerically by field 2, then by field 1 (presumably alphabetically - which was wrong).

If you try

Code:

sort -nrk2 file1

Then that's OK (numerically), but the rest (-nrk2,1) doesn't sort the first field after that as I thought.

Sort is a powerful command if you can master it - something I have neither the time nor inclination to do!

Given that the records are sorted reverse numerically in field 2 with "sort -nrk2"...

Code:


sort -nrk2 file1 | awk '!C[$1] { C[$1]=1; print }'

the awk says:
if I don't have a color (where $1 = purple, or whatever) in my array (C[purple]), then define something (anything) for purple (C[$1]=1) so that I do, and then print the line. If I do have something already defined, then do nothing (thus printing only the first line with each (sorted) color).

Last edited by Scott; 10-20-2009 at 04:40 PM.. Reason: added the word "reverse" in a couple of places, for clarity

Scott

View Public Profile for Scott

Find all posts by Scott

10-20-2009

Registered User

8, 0

Join Date: Oct 2009

Last Activity: 14 June 2012, 10:29 AM EDT

Location: UK

Posts: 8

Thanks Given: 0

Thanked 0 Times in 0 Posts

thanks, that has helped me make a bit of sense out of what, to me at the moment, seems like voodoo magic

I look forward to the point where I may be able to help someone on these forums, although I had better not hold my breath, i think i have a lot to learn.

once again, many thanks for your kind help, it's appreciated.

Cheers

grom

View Public Profile for grom

Find all posts by grom

UNIX for Dummies Questions & Answers

find uniq lines in file, using the first field of line

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Find all lines in file such that each word on that line appears in at least n lines of the file

Discussion started by: uncleMonty

2. UNIX for Advanced & Expert Users

How to find a string in a line in UNIX file and delete that line and previous 3 lines ?

Discussion started by: vadlamudy

3. Shell Programming and Scripting

Trying to find the distinct lines using uniq command

Discussion started by: kraljic

4. Shell Programming and Scripting

Printing uniq first field with the the highest second field

Discussion started by: ailnilanjan

5. Shell Programming and Scripting

Unix help to find blank lines in a file and print numbers on that line

Discussion started by: Lucky Ali

6. Shell Programming and Scripting

Find 5 lines and replace with 18 line in sql file where it contains multiple blocks.

Discussion started by: Zaheer.mic

7. Shell Programming and Scripting

Find lines in text file with certain data in first field

Discussion started by: rstev39147

8. UNIX for Dummies Questions & Answers

Sort and uniq lines of a file while keeping a header line

Discussion started by: Digby

9. Shell Programming and Scripting

shellscript to find a line in between a particular set of lines of a text file

Discussion started by: millan

10. UNIX for Dummies Questions & Answers

How to uniq third field in a file

Discussion started by: babycakes