12-04-2008
awk -F"," '{ my_array[$1","$2]+=$3 } END { for (each_rec in my_array) printf("%s,%s\n", each_rec,my_array[each_rec])}' input_file > output_file
10 More Discussions You Might Find Interesting
1. Shell Programming and Scripting
I have a input file with formating:
6000000901 ;36200103 ;h3a01f496 ;
2000123605 ;36218982 ;heefa1328 ;
2000273132 ;36246985 ;h08c5cb71 ;
2000041207 ;36246985 ;heef75497 ;
Each fields is seperated by semi-comma. Sometime, the second files is... (6 Replies)
Discussion started by: anhtt
6 Replies
2. Shell Programming and Scripting
Dear experts,
I have a file1 that looks like
60127930928 2091
60129382039 2092
60126382937 2091
60128937928 2061
60127329389 2062
60123748730 2061
60128730293 2061
and file 2 that looks like
60127930928 2091
60129382039 2092
60126382937 2093
60128937928 2061
60127329389... (2 Replies)
Discussion started by: aismann
2 Replies
3. Shell Programming and Scripting
I have a file (key.dat) that contains two columns:
AA|1234|
BB|567|
CC|8910|
I have another file (extract.dat) that contains some data:
SD|458|John|Smith|
AA|3345|Frank|Williams|
AA|1234|Bill|Garner|
BD|0098|Yu|Lin|
BB|567|Gail|Hansen|
CC|8910|Ken|Nielsen|
I want to compare the... (5 Replies)
Discussion started by: ChicagoBlues
5 Replies
4. Linux
Hi
I am having 2 fields and if f1=f2 i wanna print that line
eg
1 2
1 3
1 9
2 2
3 5
9 9
In the abov eg. the highlighted lines shud be printed
2 2
9 9
Thanking u (3 Replies)
Discussion started by: binnybio
3 Replies
5. Shell Programming and Scripting
Hi all,
say for example i have the next input file
30
Au 7.500000 7.500000 5.774000
Au 7.500000 8.995000 8.363000
Au 7.500000 6.005000 8.363000
Au 20.633000 7.500000 9.226000
Au 20.632000 6.005000 6.637000
Au 20.632000 ... (4 Replies)
Discussion started by: ezitoc
4 Replies
6. Shell Programming and Scripting
Hi,
I am trying to use awk to remove all rows where the first 4 fields are duplicates. e.g. in the following data lines 6-9 would be removed, leaving one copy of the duplicated row (row 5)
Borgarhraun FH9822 ol24 FH9822_ol24_m20 ol Deformed c
Borgarhraun FH9822 ol24 ... (3 Replies)
Discussion started by: tomahawk
3 Replies
7. Shell Programming and Scripting
Hi,
I have data like below.
SID=D6EB96CC0
HID=9C246D6
CSource=xya
Cappe=1
Versionc=3670
MAR1=STL
MARS2=STL
REQ_BUFFER_ENCODING=UTF-8
REQ_BUFFER_ORIG_ENCODING=UTF-8
RESP_BODY_ENCODING=UTF-8
CON_ID=2713
I want to select
CSource=xya (18 Replies)
Discussion started by: chetan.c
18 Replies
8. Shell Programming and Scripting
Hello,
I met a challenge to extract part of the table. I'd like to grep the first three matches based on field1 and field2. Input:
D A 92.85 1315 83 11
D A 95.90 757 28 3
D A 94.38 480 20 7
D A 91.21 307 21 6
D A 94.26 244 ... (6 Replies)
Discussion started by: yifangt
6 Replies
9. Shell Programming and Scripting
Hello,
I need this output. thank you very much.
input:
Code:
***table***wood
***snack***top
***table***garfield
***big***zen
***table***cars
output:
Code:
***table***wood2345garfield2345cars
***snack***top
***big***zen (7 Replies)
Discussion started by: tara123
7 Replies
10. UNIX for Dummies Questions & Answers
I have a file like this
AFF3 BCL2
AGTRAP BRAF
AHRR NCOA2
AKAP9 BRAF
And second input file like this
chromosome start end gene
chr1 38177326 38664955 AFF3
chr4 148077060 148088064 AGTRAP
chr13 74211117 74292309 AHRR
chr5 3928185 ... (4 Replies)
Discussion started by: raj_k
4 Replies
uniq(1) General Commands Manual uniq(1)
NAME
uniq - report repeated lines in a file
SYNOPSIS
fields] chars] [input_file [output_file]]
DESCRIPTION
reads the input text file input_file, comparing adjacent lines, and copies the result to output_file. If input_file is not specified, the
standard input and standard output are used. If input_file is specified, but output_file is not, results are printed to standard output.
input_file and output_file must not be the same file.
Line-Comparison Options
recognizes the following options when comparing adjacent lines:
Print those lines that are repeated in the original file.
Print copy only of each repeated line in the input file.
Generate an output report in default style
except that each line is preceded by a count of the number of times it occurred. If this option is specified, the and
options are ignored if either or both are also present.
If none of the options or are present, prints the results of the union of the and options, producing a copy of the original input file with
the second and succeeding copies of any repeated lines removed. (Note that repeated lines must be adjacent in order to be found -- see
sort(1)).
Field-Skip Options
Two options are provided for skipping an initial portion of each line when making comparisons:
Ignore the first
fields fields, together with any blanks before each. fields is a positive decimal integer. A field is defined as a
string of non-space, non-tab characters separated by tabs and/or spaces from its neighbors.
Ignore the first
chars characters. chars is a positive decimal integer. Each line in the input is assumed to be terminated with a
new line character for purposes of comparison. Fields are skipped before characters.
EXTERNAL INFLUENCES
Environment Variables
must be equal to the value it had when the input files were sorted.
determines the interpretation of text within files as single- and/or multi-byte characters, and defines a space character when the or
option is used.
determines the language in which messages are displayed.
If or is not specified in the environment or is set to the empty string, the value of is used as a default for each unspecified or empty
variable. If is not specified or is set to the empty string, a default of "C" (see lang(5)) is used instead of If any internationalization
variable contains an invalid setting, behaves as if all internationalization variables are set to "C". See environ(5).
International Code Set Support
Single- and multi-byte character code sets are supported.
RETURN VALUE
Exit values are:
0 Successful completion.
>0 Error condition occurred.
AUTHOR
was developed by OSF and HP.
SEE ALSO
comm(1), sort(1).
STANDARDS CONFORMANCE
uniq(1)