Duplicate columns and lines

11-23-2009

Registered User

32, 0

Join Date: Aug 2008

Last Activity: 9 February 2010, 8:35 AM EST

Posts: 32

Thanks Given: 0

Thanked 0 Times in 0 Posts

Duplicate columns and lines

Hi all,

I have a tab-delimited file and want to remove identical lines, i.e. all of line 1,2,4 because the columns are the same as the columns in other lines. Any input is appreciated.

abc gi4597 9997 cgcgtgcg $%^&*()()*
abc gi4597 9997 cgcgtgcg $%^&*()()*
ttt gi9865 8879 tgcgtgtt *(())^#@!!
abc gi4597 9997 cgcgtgcg $%^&*()()*
fgy gi9876 0975 cgaggcgc @#$%^*&*((
abc gi4597 9997 ttgttgttc $%^&*()()*

---------- Post updated at 09:29 AM ---------- Previous update was at 09:19 AM ----------

It just clicked:
awk 'x[$1,$2,$3,$4,$5,$6]++' filename

Any other methods would be helpful

dr_sabz

View Public Profile for dr_sabz

Find all posts by dr_sabz

11-24-2009

Registered User

12, 0

Join Date: Nov 2009

Last Activity: 1 December 2009, 3:50 AM EST

Posts: 12

Thanks Given: 0

Thanked 0 Times in 0 Posts

If you want a single line among several identical lines you make a | sort -u. In fact I am not sure I understood your request.

popescu1954

View Public Profile for popescu1954

Find all posts by popescu1954

UNIX for Dummies Questions & Answers

Duplicate columns and lines

10 More Discussions You Might Find Interesting

1. UNIX for Dummies Questions & Answers

Print only the duplicate line only with matching columns

Discussion started by: Indra2011

2. Shell Programming and Scripting

Remove columns with duplicate entries

Discussion started by: Sanchari

3. Shell Programming and Scripting

Count duplicate lines ignoring certain columns

Discussion started by: coppuca

4. Shell Programming and Scripting

Replace duplicate columns with values from first occurrence

Discussion started by: asyed

5. Shell Programming and Scripting

Remove Duplicate by considering multiple columns

Discussion started by: jacobs.smith

6. UNIX for Dummies Questions & Answers

remove duplicate lines based on two columns and judging from a third one

Discussion started by: TheTransporter

7. UNIX for Advanced & Expert Users

In a huge file, Delete duplicate lines leaving unique lines

Discussion started by: krishnix

8. UNIX for Dummies Questions & Answers

help to identify duplicate columns adjacent value

Discussion started by: umapearl

9. Shell Programming and Scripting

Remove duplicate columns in input file

Discussion started by: linux_usr

10. Shell Programming and Scripting

how to identify duplicate columns in a row

Discussion started by: suresh3566