04-24-2008
Hi Totus,
from aigles solution.... delimitter is ,
so, if you have tabs/spaces...i think you can use it as
awk -F " " '!mail[$4]++' inputfile
(logic is you have to specify the column correctly; i hope you noticed that i am using $4)
-ilan
10 More Discussions You Might Find Interesting
1. UNIX for Dummies Questions & Answers
I have a file which looks like
AA BB CC DD EE FF GG HH KK
AA BB GG HH KK FF CC DD EE
AA BB CC DD EE UU VV XX ZZ
AA BB VV XX ZZ UU CC DD EE
....
I want the script to give me only one line based on duplicate contents:
AA BB CC DD EE FF GG HH KK
AA BB CC DD EE UU VV XX ZZ (7 Replies)
Discussion started by: adsforall
7 Replies
2. Shell Programming and Scripting
Hi Guys...
Please Could you help me with the following ?
aaaa bbbb cccc sdsd
aaaa bbbb cccc qwer
as you can see, the 2 lines are matched in three fields...
how can I delete this pupicate ? I mean to delete the second one if 3 fields were duplicated ?
Thanks (14 Replies)
Discussion started by: yahyaaa
14 Replies
3. Shell Programming and Scripting
I have million's of records each containing exactly 50 characters and have to check the uniqueness of 4 character substring of 50 character (postion known prior) and report if any duplicates are found.
Eg. data...
AAAA00000000000000XXXX0000 0000000000... upto50 chars... (2 Replies)
Discussion started by: gapprasath
2 Replies
4. Shell Programming and Scripting
please help me in getting following:
Input Desired output
x="foo" foo
x="foo foo" foo
x="foo foo" foo
x="foo abc foo" foo abc
x="foo foo1 foo2" foo foo1 foo2
I need to remove duplicated from string.. (8 Replies)
Discussion started by: vickylife
8 Replies
5. Shell Programming and Scripting
Hi team,
I have 20 columns csv files. i want to find the duplicates in that file based on the column1 column10 column4 column6 coulnn8 coulunm2 . if those columns have same values . then it should be a duplicate record.
can one help me on finding the duplicates,
Thanks in advance.
... (2 Replies)
Discussion started by: baskivs
2 Replies
6. Shell Programming and Scripting
I have an input file abc.txt with info like:
abcd
rateuse
inklite
robet
rateuse
abcd
I need to remove duplicates from the file (eg: abcd,rateuse) from the file and need to place the contents in same file abc.txt if needed can be placed in another file.
can anyone help me in this :( (4 Replies)
Discussion started by: rkrish
4 Replies
7. Shell Programming and Scripting
Hi All ,
I have a requirement where I need to remove duplicates from a fixed width file which has multiple key columns .Also , need to capture the duplicate records into another file .
File has 8 columns.
Key columns are col1 and col2.
Col1 has the length of 8 col 2 has the length of 3.
... (5 Replies)
Discussion started by: saj
5 Replies
8. Shell Programming and Scripting
Hi,
I have a requirement.for eg: i have a text file with pipe symbol as delimiter(|) with 4 columns a,b,c,d. Here a and b are primary key columns..
i want to process that file to find the duplicates and null values are in primary key columns(a,b) . I want to write the unique records in which... (5 Replies)
Discussion started by: praveenraj.1991
5 Replies
9. Shell Programming and Scripting
Hi guys,Got a bit of a bind I'm in. I'm looking to remove duplicates from a pipe delimited file, but do so based on 2 columns. Sounds easy enough, but here's the kicker...
Column #1 is a simple ID, which is used to identify the duplicate.
Once dups are identified, I need to only keep the one... (2 Replies)
Discussion started by: kevinprood
2 Replies
10. Shell Programming and Scripting
Hello Gurus,
I have a multiple pipe separated files which have records going over multiple Lines. End of line separator is \n and records going over multiple lines have <CR> as separator. below is example from one file.
1|ABC DEF|100|10
2|PQ
RS
T|200|20
3| UVWXYZ|300|30
4| GHIJKL|400|40... (7 Replies)
Discussion started by: dJHa
7 Replies
cdoc(1) General Commands Manual cdoc(1)
Name
cdoc - invokes CDA Converter
Syntax
cdoc [ -s format ] [ -d format ] [ -O options_file ] [ -o outputfile ] inputfile
Description
The command converts the revisable format file, inputfile, to another revisable format or to a final form file. If inputfile is not speci-
fied, reads from standard input. Unless a destination file is specified with the -o option, the command writes files to standard output.
Options
-s format Specifies the format of inputfile and invokes an appropriate input converter as part of CDA. The ddif, dtif, dots (for
analysis output only) and text converters are provided in the base system kit. Additional converters can be added by
the CDA Converter Library and other layered products. Converter Library and other layered products. Contact your sys-
tem manager for a complete list of the input formats supported on your system. The default format is ddif.
-d format Specifies the format of outputfile and invokes an appropriate output converter as part of CDA. The ddif, dtif, text,
analysis, and ps converters are provided in the base system kit. Additional converters can be added by the CDA Con-
verter Library and other layered products. Contact your system manager for a complete list of the output formats sup-
ported on your system. The default format is ddif.
-O options_file Names the file passed to the input and output converters to control specific processing options for each converter.
Refer to your documentation set for a description of converter options.
The options file has a default file type of .cda_options. Each line of the options file specifies a format name that
can optionally be followed by _input or _output to restrict the option to either an input or output converter. The sec-
ond word is a valid option preceded by one or more spaces, tabs, or a slash (/) and can contain upper- and lowercase
letters, numbers, dollar signs, and underlines. The case of letters is not significant. If an option requires a value,
then spaces, tabs, or an equal sign can separate the option from the value.
Each line can optionally be preceded by spaces and tabs and can be terminated by any character other than those that
can be used to specify the format names and options. The syntax and interpretation of the text that follows the format
name is specified by the supplier of the front and back end converters for the specified format.
To specify several options for the same input or output format, specify one option on a line. If an invalid option for
an input or output format or an invalid value for an option is specified, the option may be ignored or an error message
may be returned. Each input or output format that supports processing options specifies any restrictions or special
formats required when specifying options.
By default, any messages that occur during processing of the options file are written to the system standard error
location. For those input and output formats that support a LOG option, messages can be directed to a log file.
-o outputfile Specifies the name of the output file. If not specified, writes to standard output.
See Also
vdoc(1), dxvdoc(1X), DDIF(5), DTIF(5), DOTS(5), CDA(5)
cdoc(1)