finding duplicates in columns and removing lines Post: 302189010

10 More Discussions You Might Find Interesting

1. UNIX for Dummies Questions & Answers

Removing lines that are (same in content) based on columns

I have a file which looks like AA BB CC DD EE FF GG HH KK AA BB GG HH KK FF CC DD EE AA BB CC DD EE UU VV XX ZZ AA BB VV XX ZZ UU CC DD EE .... I want the script to give me only one line based on duplicate contents: AA BB CC DD EE FF GG HH KK AA BB CC DD EE UU VV XX ZZ

2. Shell Programming and Scripting

Help removing lines with duplicated columns

Hi Guys... Please Could you help me with the following ? aaaa bbbb cccc sdsd aaaa bbbb cccc qwer as you can see, the 2 lines are matched in three fields... how can I delete this pupicate ? I mean to delete the second one if 3 fields were duplicated ? Thanks

3. Shell Programming and Scripting

Finding duplicates from positioned substring across lines

I have million's of records each containing exactly 50 characters and have to check the uniqueness of 4 character substring of 50 character (postion known prior) and report if any duplicates are found. Eg. data... AAAA00000000000000XXXX0000 0000000000... upto50 chars...

4. Shell Programming and Scripting

Removing duplicates from string (not duplicate lines)

please help me in getting following: Input Desired output x="foo" foo x="foo foo" foo x="foo foo" foo x="foo abc foo" foo abc x="foo foo1 foo2" foo foo1 foo2 I need to remove duplicated from string..

5. Shell Programming and Scripting

finding duplicates in csv based on key columns

Hi team, I have 20 columns csv files. i want to find the duplicates in that file based on the column1 column10 column4 column6 coulnn8 coulunm2 . if those columns have same values . then it should be a duplicate record. can one help me on finding the duplicates, Thanks in advance. ...

6. Shell Programming and Scripting

Help in removing duplicates

I have an input file abc.txt with info like: abcd rateuse inklite robet rateuse abcd I need to remove duplicates from the file (eg: abcd,rateuse) from the file and need to place the contents in same file abc.txt if needed can be placed in another file. can anyone help me in this :(

7. Shell Programming and Scripting

Removing duplicates in fixed width file which has multiple key columns

Hi All , I have a requirement where I need to remove duplicates from a fixed width file which has multiple key columns .Also , need to capture the duplicate records into another file . File has 8 columns. Key columns are col1 and col2. Col1 has the length of 8 col 2 has the length of 3. ...

8. Shell Programming and Scripting

UNIX scripting for finding duplicates and null records in pk columns

Hi, I have a requirement.for eg: i have a text file with pipe symbol as delimiter(|) with 4 columns a,b,c,d. Here a and b are primary key columns.. i want to process that file to find the duplicates and null values are in primary key columns(a,b) . I want to write the unique records in which...

9. Shell Programming and Scripting

Removing duplicates from delimited file based on 2 columns

Hi guys,Got a bit of a bind I'm in. I'm looking to remove duplicates from a pipe delimited file, but do so based on 2 columns. Sounds easy enough, but here's the kicker... Column #1 is a simple ID, which is used to identify the duplicate. Once dups are identified, I need to only keep the one...

10. Shell Programming and Scripting

Removing carriage returns from multiple lines in multiple files of different number of columns

Hello Gurus, I have a multiple pipe separated files which have records going over multiple Lines. End of line separator is \n and records going over multiple lines have <CR> as separator. below is example from one file. 1|ABC DEF|100|10 2|PQ RS T|200|20 3| UVWXYZ|300|30 4| GHIJKL|400|40...

LEARN ABOUT OSF1

uniq

uniq(1) 						      General Commands Manual							   uniq(1)

NAME

       uniq - Removes or lists repeated lines in a file

SYNOPSIS

   Current Syntax
       uniq [-cdu] [-f fields] [-s chars] [input-file [output-file]]

   Obsolescent Syntax
       uniq [-cdu] [-fields] [+chars] [input-file [output-file]]

       The uniq command reads from the specified input_file, compares adjacent lines, removes the second and succeeding occurrences of a line, and
       writes to standard output.

STANDARDS

       Interfaces documented on this reference page conform to industry standards as follows:

       uniq:  XCU5.0

       Refer to the standards(5) reference page for more information about industry standards and associated tags.

OPTIONS

       Precedes each output line with a count of the number of times each line appears in the file.  This option supersedes the -d and -u options.
       Displays  repeated lines only.  Ignores the first fields fields on each input line when doing comparisons, where fields is a positive deci-
       mal integer.  A field is the maximal string matched by the basic regular expression:

	      [[:blank:]]*[^[:blank:]]*

	      If the fields argument specifies more fields than appear on an input line, a null string is used for comparisons.  Ignores the spec-
	      ified number of characters when doing comparisons.  The chars argument is a positive decimal integer.

	      If specified with the -f option, the first chars characters after the first fields fields are ignored.  If the chars argument speci-
	      fies more characters than remain on an input line, uniq uses a null string for comparison.  Displays unique lines only.	Equivalent
	      to -f fields.  (Obsolescent) Equivalent to -s chars.  (Obsolescent)

OPERANDS

       A pathname for the input file.

	      If this operand is omitted or specified as -, then standard input is read.  A pathname for the output file.

	      If this operand is omitted, then standard output is written.

DESCRIPTION

       The  input_file	and  output_file  arguments must be different files.  If the input_file operand is not specified, or if it is -, uniq uses
       standard input.

       Repeated lines must be on consecutive lines to be found.  You can arrange them with the sort command before processing.

EXAMPLES

       To delete repeated lines in the following file called fruit and save it to a file named newfruit, enter: uniq fruit newfruit

       The file fruit contains the following lines:

       apples apples bananas cherries cherries peaches pears

       The file newfruit contains the following lines:

       apples bananas cherries peaches pears

EXIT STATUS

       The following exit values are returned: Successful completion.  An error occurred.

ENVIRONMENT VARIABLES

       The following environment variables affect the execution of uniq: Provides a default value for the internationalization variables that  are
       unset or null. If LANG is unset or null, the corresponding value from the default locale is used.  If any of the internationalization vari-
       ables contain an invalid setting, the utility behaves as if none of the variables had been defined.  If set to a  non-empty  string  value,
       overrides  the  values of all the other internationalization variables.	Determines the locale for the interpretation of sequences of bytes
       of text data as characters (for example, single-byte as opposed to multibyte characters in arguments).  Determines the locale for the  for-
       mat  and  contents  of  diagnostic messages written to standard error.  Determines the location of message catalogues for the processing of
       LC_MESSAGES.

SEE ALSO

       Commands:  comm(1), sort(1)

       Standards:  standards(5)

																	   uniq(1)

10 More Discussions You Might Find Interesting

1. UNIX for Dummies Questions & Answers

Removing lines that are (same in content) based on columns

Discussion started by: adsforall

2. Shell Programming and Scripting

Help removing lines with duplicated columns

Discussion started by: yahyaaa

3. Shell Programming and Scripting

Finding duplicates from positioned substring across lines

Discussion started by: gapprasath

4. Shell Programming and Scripting

Removing duplicates from string (not duplicate lines)

Discussion started by: vickylife