Extract duplicate fields in rows Post: 302147920

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

duplicate rows in a file

hi all can anyone please let me know if there is a way to find out duplicate rows in a file. i have a file that has hundreds of numbers(all in next row). i want to find out the numbers that are repeted in the file. eg. 123434 534 5575 4746767 347624 5575 i want 5575 please help

2. Shell Programming and Scripting

How to extract duplicate rows

I have searched the internet for duplicate row extracting. All I have seen is extracting good rows or eliminating duplicate rows. How do I extract duplicate rows from a flat file in unix. I'm using Korn shell on HP Unix. For.eg. FlatFile.txt ======== 123:456:678 123:456:678 123:456:876...

3. HP-UX

How to get Duplicate rows in a file

Hi all, I have written one shell script. The output file of this script is having sql output. In that file, I want to extract the rows which are having multiple entries(duplicate rows). For example, the output file will be like the following way. ...

4. Shell Programming and Scripting

How to extract duplicate rows

Hi! I have a file as below: line1 line2 line2 line3 line3 line3 line4 line4 line4 line4 I would like to extract duplicate lines (not unique, triplicate or quadruplicate lines). Output will be as below: line2 line2 I would appreciate if anyone can help. Thanks.

5. Shell Programming and Scripting

Find duplicate based on 'n' fields and mark the duplicate as 'D'

Hi, In a file, I have to mark duplicate records as 'D' and the latest record alone as 'C'. In the below file, I have to identify if duplicate records are there or not based on Man_ID, Man_DT, Ship_ID and I have to mark the record with latest Ship_DT as "C" and other as "D" (I have to create...

6. Shell Programming and Scripting

Extract fields from different rows.

Hi, I have data like below. SID=D6EB96CC0 HID=9C246D6 CSource=xya Cappe=1 Versionc=3670 MAR1=STL MARS2=STL REQ_BUFFER_ENCODING=UTF-8 REQ_BUFFER_ORIG_ENCODING=UTF-8 RESP_BODY_ENCODING=UTF-8 CON_ID=2713 I want to select CSource=xya

7. Shell Programming and Scripting

Delete duplicate rows

Hi, This is a followup to my earlier post him mno klm 20 76 . + . klm_mango unix_00000001; alp fdc klm 123 456 . + . klm_mango unix_0000103; her tkr klm 415 439 . + . klm_mango unix_00001043; abc tvr klm 20 76 . + . klm_mango unix_00000001; abc def klm 83 84 . + . klm_mango...

8. Shell Programming and Scripting

Extract and count number of Duplicate rows

Hi All, I need to extract duplicate rows from a file and write these bad records into another file. And need to have a count of these bad records. i have a command awk ' {s++} END { for(i in s) { if(s>1) { print i } } }' ${TMP_DUPE_RECS}>>${TMP_BAD_DATA_DUPE_RECS}...

9. Shell Programming and Scripting

Extract duplicate rows with conditions

Gents Can you help please. Input file 5490921425 1 7 1310342 54909214251 5490921425 2 1 1 54909214252 5491120937 1 1 3 54911209371 5491120937 3 1 1 54911209373 5491320785 1 ...

10. Shell Programming and Scripting

Extract and exclude rows based on duplicate values

Hello I have a file like this: > cat examplefile ghi|NN603762|eee mno|NN607265|ttt pqr|NN613879|yyy stu|NN615002|uuu jkl|NN607265|rrr vwx|NN615002|iii yzA|NN618555|ooo def|NN190486|www BCD|NN628717|ppp abc|NN190486|qqq EFG|NN628717|aaa HIJ|NN628717|sss > I can sort the file by...

LEARN ABOUT OSX

cut

CUT(1)							    BSD General Commands Manual 						    CUT(1)

NAME

     cut -- cut out selected portions of each line of a file

SYNOPSIS

     cut -b list [-n] [file ...]
     cut -c list [file ...]
     cut -f list [-d delim] [-s] [file ...]

DESCRIPTION

     The cut utility cuts out selected portions of each line (as specified by list) from each file and writes them to the standard output.  If no
     file arguments are specified, or a file argument is a single dash ('-'), cut reads from the standard input.  The items specified by list can
     be in terms of column position or in terms of fields delimited by a special character.  Column numbering starts from 1.

     The list option argument is a comma or whitespace separated set of numbers and/or number ranges.  Number ranges consist of a number, a dash
     ('-'), and a second number and select the fields or columns from the first number to the second, inclusive.  Numbers or number ranges may be
     preceded by a dash, which selects all fields or columns from 1 to the last number.  Numbers or number ranges may be followed by a dash, which
     selects all fields or columns from the last number to the end of the line.  Numbers and number ranges may be repeated, overlapping, and in
     any order.  If a field or column is specified multiple times, it will appear only once in the output.  It is not an error to select fields or
     columns not present in the input line.

     The options are as follows:

     -b list
	     The list specifies byte positions.

     -c list
	     The list specifies character positions.

     -d delim
	     Use delim as the field delimiter character instead of the tab character.

     -f list
	     The list specifies fields, separated in the input by the field delimiter character (see the -d option.)  Output fields are separated
	     by a single occurrence of the field delimiter character.

     -n      Do not split multi-byte characters.  Characters will only be output if at least one byte is selected, and, after a prefix of zero or
	     more unselected bytes, the rest of the bytes that form the character are selected.

     -s      Suppress lines with no field delimiter characters.  Unless specified, lines with no delimiters are passed through unmodified.

ENVIRONMENT

     The LANG, LC_ALL and LC_CTYPE environment variables affect the execution of cut as described in environ(7).

EXIT STATUS

     The cut utility exits 0 on success, and >0 if an error occurs.

EXAMPLES

     Extract users' login names and shells from the system passwd(5) file as ``name:shell'' pairs:

	   cut -d : -f 1,7 /etc/passwd

     Show the names and login times of the currently logged in users:

	   who | cut -c 1-16,26-38

SEE ALSO

     colrm(1), paste(1)

STANDARDS

     The cut utility conforms to IEEE Std 1003.2-1992 (``POSIX.2'').

HISTORY

     A cut command appeared in AT&T System III UNIX.

BSD
								 December 21, 2006							       BSD

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

duplicate rows in a file

Discussion started by: infyanurag

2. Shell Programming and Scripting

How to extract duplicate rows

Discussion started by: bobbygsk

3. HP-UX

How to get Duplicate rows in a file

Discussion started by: raghu.iv85

4. Shell Programming and Scripting

How to extract duplicate rows

Discussion started by: chromatin

5. Shell Programming and Scripting

Find duplicate based on 'n' fields and mark the duplicate as 'D'

Discussion started by: machomaddy

6. Shell Programming and Scripting

Extract fields from different rows.

Discussion started by: chetan.c

7. Shell Programming and Scripting

Delete duplicate rows

Discussion started by: jacobs.smith

8. Shell Programming and Scripting

Extract and count number of Duplicate rows

Discussion started by: Arun Mishra

9. Shell Programming and Scripting

Extract duplicate rows with conditions

Discussion started by: jiam912

10. Shell Programming and Scripting

Extract and exclude rows based on duplicate values

Discussion started by: CHoggarth

LEARN ABOUT OSX

cut