Removing lines that are (same in content) based on columns


 
Thread Tools Search this Thread
Top Forums UNIX for Dummies Questions & Answers Removing lines that are (same in content) based on columns
# 1  
Old 11-08-2007
Lightbulb Removing lines that are (same in content) based on columns

I have a file which looks like

AA BB CC DD EE FF GG HH KK
AA BB GG HH KK FF CC DD EE
AA BB CC DD EE UU VV XX ZZ
AA BB VV XX ZZ UU CC DD EE

....
I want the script to give me only one line based on duplicate contents:

AA BB CC DD EE FF GG HH KK
AA BB CC DD EE UU VV XX ZZ
# 2  
Old 11-08-2007
Call me stupid but I don't follow what you are after here... the out put example you've given has two lines but you seem to be asking for one line? Please elaborate on what you are after Smilie
# 3  
Old 11-09-2007
I suppose he wahts something like this,
but for more general solution it would be
better if we have more sample data:

Code:
awk '{x[$3v$4v$5v$7v$8v$9]}
!($7v$8v$9v$3v$4v$5 in x)' v=" " filename

Use nawk or /usr/xpg4/bin/awk on Solaris.
# 4  
Old 11-09-2007
As a clarification:
I am looking for one line per duplicate info. Therefore, the 4 lines input file (given as an example) is formed of two blocks, 2 lines each, containaing duplicate data. Thus the result is two lines.

AA BB CC DD EE FF GG HH KK ----- First Block
AA BB GG HH KK FF CC DD EE ----- First Block

AA BB CC DD EE UU VV XX ZZ ----- Second Block
AA BB VV XX ZZ UU CC DD EE ----- Second Blcok


The results should be like the following:
AA BB CC DD EE FF GG HH KK ----- Result for First Block

AA BB CC DD EE UU VV XX ZZ ----- Result for Second Block

I hope it is clear now!
# 5  
Old 11-09-2007
Quote:
Originally Posted by radoulov
I suppose he wahts something like this,
but for more general solution it would be
better if we have more sample data:

Code:
awk '{x[$3v$4v$5v$7v$8v$9]}
!($7v$8v$9v$3v$4v$5 in x)' v=" " filename

Use nawk or /usr/xpg4/bin/awk on Solaris.
Radoulov,
Your script did not work with me! (awk error)
# 6  
Old 11-09-2007
Quote:
Originally Posted by adsforall
Radoulov,
Your script did not work with me! (awk error)
Hm,
and you're not on Solaris?
# 7  
Old 11-09-2007
Quote:
Originally Posted by radoulov
Hm,
and you're not on Solaris?
I am using Solaris 9
 
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Removing duplicate lines on first column based with pipe delimiter

Hi, I have tried to remove dublicate lines based on first column with pipe delimiter . but i ma not able to get some uniqu lines Command : sort -t'|' -nuk1 file.txt Input : 38376KZ|09/25/15|1.057 38376KZ|09/25/15|1.057 02006YB|09/25/15|0.859 12593PS|09/25/15|2.803... (2 Replies)
Discussion started by: parithi06
2 Replies

2. Shell Programming and Scripting

Removing duplicates from delimited file based on 2 columns

Hi guys,Got a bit of a bind I'm in. I'm looking to remove duplicates from a pipe delimited file, but do so based on 2 columns. Sounds easy enough, but here's the kicker... Column #1 is a simple ID, which is used to identify the duplicate. Once dups are identified, I need to only keep the one... (2 Replies)
Discussion started by: kevinprood
2 Replies

3. Shell Programming and Scripting

Copying lines from multiple logfiles, based on content of the line

d df d d (1 Reply)
Discussion started by: larsk
1 Replies

4. UNIX for Advanced & Expert Users

Removing files based on name and content

Consider i have 2 directories a1 and a2. under a1, i have below files test1 test2 test3. Under a2,i have below files. test1 test2 test3 test4 test5My requirement is i will pass the directory names(2 parameters) and directory in which files needs to be removed.(3rd parameter) a)first... (11 Replies)
Discussion started by: pandeesh
11 Replies

5. Shell Programming and Scripting

Concatenating more than two lines into one based on some columns

Hi, I need to concatenate some lines in a file based on the First 4 coloumns of a file .. (For Eg.) Consider a file ... I,01,000002,0666,00000.00,000,00,000,000, ,0 I,01,000002,0667,00000.00,000,00,000,000, ,0 I,01,000002,0666,00056.10 I,01,000002,0667,00056.10 I,01,000002,0666,00001... (6 Replies)
Discussion started by: Sri3001
6 Replies

6. UNIX for Dummies Questions & Answers

remove duplicate lines based on two columns and judging from a third one

hello all, I have an input file with four columns like this with a lot of lines and for example, line 1 and line 5 match because the first 4 characters match and the fourth column matches too. I want to keep the line that has the lowest number in the third column. So I discard line 5.... (5 Replies)
Discussion started by: TheTransporter
5 Replies

7. UNIX for Dummies Questions & Answers

Removing Lines based on matching first column

I have a file1 that looks like this: File 1 a b b c c e d e and a file 2 that looks like this: File 2 b c e e Note that file 2 is the right hand column from file1. I want to remove any lines from file1 that begin with the column in file2. In this case the desired output... (6 Replies)
Discussion started by: kschiltz55
6 Replies

8. Shell Programming and Scripting

awk : extracting unique lines based on columns

Hi, snp.txt CHR_A SNP_A BP_A_st BP_A_End CHR_B BP_B SNP_B R2 p-SNP_A p-SNP_B 5 rs1988728 74904317 74904318 5 74960646 rs1427924 0.377333 0.000740085 0.013930081 5 ... (12 Replies)
Discussion started by: genehunter
12 Replies

9. Shell Programming and Scripting

finding duplicates in columns and removing lines

I am trying to figure out how to scan a file like so: 1 ralphs office","555-555-5555","ralph@mail.com","www.ralph.com 2 margies office","555-555-5555","ralph@mail.com","www.ralph.com 3 kims office","555-555-5555","kims@mail.com","www.ralph.com 4 tims... (17 Replies)
Discussion started by: totus
17 Replies

10. Shell Programming and Scripting

Help removing lines with duplicated columns

Hi Guys... Please Could you help me with the following ? aaaa bbbb cccc sdsd aaaa bbbb cccc qwer as you can see, the 2 lines are matched in three fields... how can I delete this pupicate ? I mean to delete the second one if 3 fields were duplicated ? Thanks (14 Replies)
Discussion started by: yahyaaa
14 Replies
Login or Register to Ask a Question