Removing Lines based on matching first column Post: 302539738

10 More Discussions You Might Find Interesting

1. UNIX for Dummies Questions & Answers

Removing lines that are (same in content) based on columns

I have a file which looks like AA BB CC DD EE FF GG HH KK AA BB GG HH KK FF CC DD EE AA BB CC DD EE UU VV XX ZZ AA BB VV XX ZZ UU CC DD EE .... I want the script to give me only one line based on duplicate contents: AA BB CC DD EE FF GG HH KK AA BB CC DD EE UU VV XX ZZ

2. Shell Programming and Scripting

Matching words based on column headers

Hi , Pls help on this. Input file: NAME1 BSC1 TEXT ID 1 MAINSFAIL TEXT ID 2 DGON TEXT ID 3 lOADONDG NAME2 BSC2 TEXT ID 1 DGON TEXT ID 3 lOADONG

3. Shell Programming and Scripting

Matching 2 files based on one column

Hi, On a similar subject, the following. I have two files: file1.txt dbSNP_rsID,Chromosome,Position,Gene rs10399749,chr. 01,45162,? rs4030303,chr. 01,72434,? rs4030300,chr. 01,72515,? rs940550,chr. 01,78032,? rs13328714,chr. 01,81468,? rs11490937,chr. 01,222077,? rs6683466,chr....

4. Shell Programming and Scripting

awk print non matching lines based on column

My item was not answered on previous thread as code given did not work I wanted to print records from file2 where comparing column 1 and 16 for both files find rows where column 16 in file 1 does not match column 16 in file 2 Here was CODE give to issue ~/unix.com$ cat f1...

5. Shell Programming and Scripting

Removing duplicate records in a file based on single column

Hi, I want to remove duplicate records including the first line based on column1. For example inputfile(filer.txt): ------------- 1,3000,5000 1,4000,6000 2,4000,600 2,5000,700 3,60000,4000 4,7000,7777 5,999,8888 expected output: ---------------- 3,60000,4000 4,7000,7777...

6. Shell Programming and Scripting

Filtering lines for column elements based on corresponding counts in another column

Hi, I have a file like this ACC 2 2 21 aaa AC 443 3 22 aaa GCT 76 1 33 xxx TCG 34 2 33 aaa ACGT 33 1 22 ggg TTC 99 3 44 wee CCA 33 2 33 ggg AAC 1 3 55 ddd TTG 10 1 22 ddd TTGC 98 3 22 ddd GCT 23 1 21 sds GTC 23 4 32 sds ACGT 32 2 33 vvv CGT 11 2 33 eee CCC 87 2 44...

7. Shell Programming and Scripting

Find lines with matching column 1 value, retain only the one with highest value in column 2

I have a file like: I would like to find lines lines with duplicate values in column 1, and retain only one based on two conditions: 1) keep line with highest value in column 3, 2) if column 3 values are equal, retain the line with the highest value in column 4. Desired output: I was able to...

8. Shell Programming and Scripting

Based on column in file1, find match in file2 and print matching lines

file1: file2: I need to find matches for any lines in file1 that appear in file2. Desired output is '>' plus the file1 term, followed by the line after the match in file2 (so the title is a little misleading): This is honestly beyond what I can do without spending the whole night on it, so I'm...

9. Shell Programming and Scripting

Insert value of column based on file name matching

At the top of the XYZ file, I need to insert the ABC data value of column 2 only when ABC column 1 matches the prefix XYZ file name (not the ".txt"). Is there an awk solution for this? ABC Data 0101 0.54 0102 0.48 0103 1.63 XYZ File Name 0101.txt 0102.txt 0103.txt ...

10. Shell Programming and Scripting

Removing duplicate lines on first column based with pipe delimiter

Hi, I have tried to remove dublicate lines based on first column with pipe delimiter . but i ma not able to get some uniqu lines Command : sort -t'|' -nuk1 file.txt Input : 38376KZ|09/25/15|1.057 38376KZ|09/25/15|1.057 02006YB|09/25/15|0.859 12593PS|09/25/15|2.803...

LEARN ABOUT OSF1

comm

comm(1) 						      General Commands Manual							   comm(1)

NAME

       comm - Compares two sorted files.

SYNOPSIS

       comm [-123] file1 file2

STANDARDS

       Interfaces documented on this reference page conform to industry standards as follows:

       command: XCU5.0

       Refer to the standards(5) reference page for more information about industry standards and associated tags.

OPTIONS

       Suppresses  output  of  the  first column (lines in file1 only).  Suppresses output of the second column (lines in file2 only).	Suppresses
       output of the third column (lines common to file1 and file2).

       The command comm -123 produces no output.

OPERANDS

       A pathname of the first file to be compared. If file1 is a hyphen (-), the standard input is used.  A pathname of the  second  file  to	be
       compared. If file2 is a hyphen (-), the standard input is used.

       If both file1 and file2 refer to standard input or to the same FIFO special, block special or character special file, the results are unde-
       fined.

DESCRIPTION

       The comm command reads file1 and file2 and writes three columns to standard output, showing which lines are common to the files	and  which
       are unique to each.

       The  leftmost  column  of  standard output includes lines that are in file1 only.  The middle column includes lines that are in file2 only.
       The rightmost column includes lines that are in both file1 and file2.

       If you specify a hyphen (-) in place of one of the file names, comm reads standard input.

       Generally, file1 and file2 should be sorted according to the collating sequence specified by  the  LC_COLLATE  environment  variable.  (See
       sort(1).)  If the input files are not sorted properly, the output of comm might not be useful.

EXIT STATUS

       Successful completion.  Error occurred.

EXAMPLES

       In the following examples, file1 contains the following sorted list of North American cities:

	      Anaheim Baltimore Boston Chicago Cleveland Dallas Detroit Kansas City Milwaukee Minneapolis New York Oakland Seattle Toronto

	      The second file, file2, contains this sorted list:

	      Atlanta Chicago Cincinnati Houston Los Angeles Montreal New York Philadelphia Pittsburgh San Diego San Francisco St. Louis

	      To display the lines unique to each file and common to the two files, enter: comm file1 file2

	      This command results in the following output: Anaheim	 Atlanta Baltimore Boston	    Chicago	 Cincinnati Cleveland Dal-
	      las Detroit      Houston Kansas City	Los Angeles Milwaukee Minneapolis      Montreal 	  New York Oakland	 Philadel-
	      phia	Pittsburgh	San Diego      San Francisco Seattle	  St. Louis Toronto

	      The  leftmost column contains lines in file1 only, the middle column contains lines in file2 only, and the rightmost column contains
	      lines common to both files.  To display any one or two of the three output columns, include the appropriate flags  to  suppress  the
	      columns you do not want.	For example, the following command displays columns 1 and 2 only: comm -3 file1 file2

	      Anaheim
		     Atlanta Baltimore Boston
		     Cincinnati Cleveland Dallas Detroit
		     Houston Kansas City
		     Los Angeles Milwaukee Minneapolis
		     Montreal Oakland
		     Philadelphia
		     Pittsburgh
		     San Diego
		     San Francisco Seattle
		     St. Louis Toronto

	      The following command displays output from only the second column: comm -13 file1 file2

	      Atlanta Cincinnati Houston Los Angeles Montreal Philadelphia Pittsburgh San Diego San Francisco St. Louis

	      The following command displays output from only the third column: comm -12 file1 file2

	      Chicago New York

SEE ALSO

       Commands:  cmp(1), diff(1), sdiff(1), sort(1), uniq(1)

																	   comm(1)