04-24-2008
Hi Totus,
from aigles solution.... delimitter is ,
so, if you have tabs/spaces...i think you can use it as
awk -F " " '!mail[$4]++' inputfile
(logic is you have to specify the column correctly; i hope you noticed that i am using $4)
-ilan
10 More Discussions You Might Find Interesting
1. UNIX for Dummies Questions & Answers
I have a file which looks like
AA BB CC DD EE FF GG HH KK
AA BB GG HH KK FF CC DD EE
AA BB CC DD EE UU VV XX ZZ
AA BB VV XX ZZ UU CC DD EE
....
I want the script to give me only one line based on duplicate contents:
AA BB CC DD EE FF GG HH KK
AA BB CC DD EE UU VV XX ZZ (7 Replies)
Discussion started by: adsforall
7 Replies
2. Shell Programming and Scripting
Hi Guys...
Please Could you help me with the following ?
aaaa bbbb cccc sdsd
aaaa bbbb cccc qwer
as you can see, the 2 lines are matched in three fields...
how can I delete this pupicate ? I mean to delete the second one if 3 fields were duplicated ?
Thanks (14 Replies)
Discussion started by: yahyaaa
14 Replies
3. Shell Programming and Scripting
I have million's of records each containing exactly 50 characters and have to check the uniqueness of 4 character substring of 50 character (postion known prior) and report if any duplicates are found.
Eg. data...
AAAA00000000000000XXXX0000 0000000000... upto50 chars... (2 Replies)
Discussion started by: gapprasath
2 Replies
4. Shell Programming and Scripting
please help me in getting following:
Input Desired output
x="foo" foo
x="foo foo" foo
x="foo foo" foo
x="foo abc foo" foo abc
x="foo foo1 foo2" foo foo1 foo2
I need to remove duplicated from string.. (8 Replies)
Discussion started by: vickylife
8 Replies
5. Shell Programming and Scripting
Hi team,
I have 20 columns csv files. i want to find the duplicates in that file based on the column1 column10 column4 column6 coulnn8 coulunm2 . if those columns have same values . then it should be a duplicate record.
can one help me on finding the duplicates,
Thanks in advance.
... (2 Replies)
Discussion started by: baskivs
2 Replies
6. Shell Programming and Scripting
I have an input file abc.txt with info like:
abcd
rateuse
inklite
robet
rateuse
abcd
I need to remove duplicates from the file (eg: abcd,rateuse) from the file and need to place the contents in same file abc.txt if needed can be placed in another file.
can anyone help me in this :( (4 Replies)
Discussion started by: rkrish
4 Replies
7. Shell Programming and Scripting
Hi All ,
I have a requirement where I need to remove duplicates from a fixed width file which has multiple key columns .Also , need to capture the duplicate records into another file .
File has 8 columns.
Key columns are col1 and col2.
Col1 has the length of 8 col 2 has the length of 3.
... (5 Replies)
Discussion started by: saj
5 Replies
8. Shell Programming and Scripting
Hi,
I have a requirement.for eg: i have a text file with pipe symbol as delimiter(|) with 4 columns a,b,c,d. Here a and b are primary key columns..
i want to process that file to find the duplicates and null values are in primary key columns(a,b) . I want to write the unique records in which... (5 Replies)
Discussion started by: praveenraj.1991
5 Replies
9. Shell Programming and Scripting
Hi guys,Got a bit of a bind I'm in. I'm looking to remove duplicates from a pipe delimited file, but do so based on 2 columns. Sounds easy enough, but here's the kicker...
Column #1 is a simple ID, which is used to identify the duplicate.
Once dups are identified, I need to only keep the one... (2 Replies)
Discussion started by: kevinprood
2 Replies
10. Shell Programming and Scripting
Hello Gurus,
I have a multiple pipe separated files which have records going over multiple Lines. End of line separator is \n and records going over multiple lines have <CR> as separator. below is example from one file.
1|ABC DEF|100|10
2|PQ
RS
T|200|20
3| UVWXYZ|300|30
4| GHIJKL|400|40... (7 Replies)
Discussion started by: dJHa
7 Replies
fspec(4) Kernel Interfaces Manual fspec(4)
NAME
fspec - format specification in text files
DESCRIPTION
It is sometimes convenient to maintain text files on the HP-UX system with non-standard tabs, (meaning tabs that are not set at every
eighth column). Generally, such files must be converted to a standard format - frequently by replacing all tabs with the appropriate num-
ber of spaces - before they can be processed by HP-UX system commands. A format specification occurring in the first line of a text file
specifies how tabs are to be expanded in the remainder of the file.
A format specification consists of a sequence of parameters separated by blanks and surrounded by the brackets and Each parameter consists
of a keyletter, possibly followed immediately by a value. The following parameters are recognized:
The parameter specifies tab settings for the file. The value of tabs must be one of the following:
1. A list of column numbers separated by commas, indicating tabs set at the specified columns;
2. A followed immediately by an integer n, indicating tabs at intervals of n columns;
3. A followed by the name of a ``canned'' tab specification.
Standard tabs are specified by or equivalently, etc. Recognized canned tabs are defined by the command (see
tabs(1)).
The parameter specifies a maximum line size. The value of size must be an integer. Size checking is performed after
tabs have been expanded, but before the margin is inserted at the beginning of the line.
The parameter specifies a number of spaces to be inserted at the beginning of each line. The value of margin must be an
integer.
The parameter takes no value. Its presence indicates that the line containing the format specification is to be deleted
from the converted file.
The parameter takes no value. Its presence indicates that the current format is to prevail only until another format
specification is encountered in the file.
Default values (assumed for parameters not supplied) are and If the parameter is not specified, no size checking is performed. If the
first line of a file does not contain a format specification, the above defaults are assumed for the entire file. The following is an
example of a line containing a format specification:
If a format specification can be disguised as a comment, it is not necessary to code the parameter.
Several HP-UX system commands correctly interpret the format specification for a file. Among them is which can be used to convert files to
a standard format acceptable to other HP-UX system commands.
SEE ALSO
ed(1), newform(1), tabs(1).
fspec(4)