08-28-2009
Merge two rows using awk or python
Hi,
Suppose I have a space delimited file like this:
Serial# 1970 1971 1972 1973 1974
193532 21 2 X X X
200201 20 30 X X 40
200201 X X 13 15 X
393666 66 3 X X 5
393666 77 X X X X
First, I want to check the serial#, if any two lines have the same serial#,(in this case line 2+3, and line 4+5 qualify), then merge these two lines by replacing X with the value of the other line.
Also, when there is a conflict, in this case line 4+5 have the first column as 66,77(rather than having X in either line or in both lines), then do not merge even though they have the same serial#, but flag both lines with FLAGGED on the CONFLICT_FLAG column.
The result would be:
Serial# 1970 1971 1972 1973 1974 CONFLICT_FLAG
193532 21 2 X X X
200201 20 30 13 15 40
393666 66 3 X X 5 FLAGGED
393666 77 X X X X FLAGGED
Is it possible to do this in either python or awk? Thank you.
10 More Discussions You Might Find Interesting
1. Shell Programming and Scripting
Hi guys,
Please guide me if you have a solution to this problem. I have tried paste -s but it's not giving the desired output.
I have a file with the following content-
A123 box1
B345 bat2
C431 my_id
A123 service
C431 box1
A123 my_id
I need two different outputs-
OUTPUT1
A123... (6 Replies)
Discussion started by: smriti_shridhar
6 Replies
2. Shell Programming and Scripting
Hi guz I want to merge multiple rows into a multiple columns based on the first column.
The file has symbol //
I want to break the symbool // and I nedd exactlynew column at that point
the output will be like this
please guyz help in this isssue!!!!!
merging rows into columns ... (4 Replies)
Discussion started by: bogu0001
4 Replies
3. Shell Programming and Scripting
I have a large file (10M lines) that contains two columns: a frequency and a string, ex:
3 aaaaa
4 bbbbb
2 ccccc
5 aaaaa
1 ddddd
4 ccccc
I need to merge the lines whose string part is the same, while updating the frequency. The output should look like this:
8 aaaaa
4 bbbbb
5 ccccc... (2 Replies)
Discussion started by: tootles564
2 Replies
4. UNIX for Dummies Questions & Answers
Dear all
I have big file with two columns
A_AA960715 GO:0006952
A_AA960715 GO:0008152
A_AA960715 GO:0016491
A_AA960715 GO:0007165
A_AA960715 GO:0005618
A_AA960716 GO:0006952
A_AA960716 GO:0005618
A_AA960716... (15 Replies)
Discussion started by: AAWT
15 Replies
5. UNIX for Dummies Questions & Answers
Dear all,
Please help me ,,,,
if I have input file like this
A_AA960715 leucine-rich repeat-containing protein GO:0006952 defense response P
A_AA960715 leucine-rich repeat-containing protein GO:0008152 metabolic process P
A_AA960715 leucine-rich... (5 Replies)
Discussion started by: AAWT
5 Replies
6. Shell Programming and Scripting
Hi,
I have two files A (2190 rows) and file B (1100 rows). I want to merge the contents of two files based on common field, also I need the unmatched rows from file A
file A:
ABC
XYZ
PQR
file B:
>LMN|chr1:11000-12456:
>ABC|chr15:176578-187678:
>PQR|chr3:14567-15866:
output... (3 Replies)
Discussion started by: Diya123
3 Replies
7. Shell Programming and Scripting
Hello,
I need this output. thank you very much.
input:
Code:
***table***wood
***snack***top
***table***garfield
***big***zen
***table***cars
output:
Code:
***table***wood2345garfield2345cars
***snack***top
***big***zen (7 Replies)
Discussion started by: tara123
7 Replies
8. Shell Programming and Scripting
In a folder I'll several times daily receive new files that I want to combine into one big file, without any duplicate rows.
The file name in the folder will look like e.q:
MissingData_2014-08-25_09-30-18.txt
MissingData_2014-08-25_09-30-14.txt
MissingData_2014-08-26_09-30-12.txt
The content... (9 Replies)
Discussion started by: Bergans
9 Replies
9. UNIX for Dummies Questions & Answers
Hi,
I wanted to merge the content and below is input and required output info.
Input:
/hello,a,r
/hello,a,L
/hello,a,X
/hi,b,v
/hi,b,c
O/p:
/hello,a,r:L:X
/hi,v,:v:c
Use code tags, thanks. (6 Replies)
Discussion started by: ankitas
6 Replies
10. Programming
First off I am very new to python but not to scripting I have done a lot of bash scripting.
I need to create a python script for work that will combine multiple pdf files into one pdf file and archive both the combined file and the original pdf files.
So we receive zip files from a client... (6 Replies)
Discussion started by: SaltCityScripts
6 Replies
IGAWK(1) Utility Commands IGAWK(1)
NAME
igawk - gawk with include files
SYNOPSIS
igawk [ all gawk options ] -f program-file [ -- ] file ...
igawk [ all gawk options ] [ -- ] program-text file ...
DESCRIPTION
Igawk is a simple shell script that adds the ability to have ``include files'' to gawk(1).
AWK programs for igawk are the same as for gawk, except that, in addition, you may have lines like
@include getopt.awk
in your program to include the file getopt.awk from either the current directory or one of the other directories in the search path.
OPTIONS
See gawk(1) for a full description of the AWK language and the options that gawk supports.
EXAMPLES
cat << EOF > test.awk
@include getopt.awk
BEGIN {
while (getopt(ARGC, ARGV, "am:q") != -1)
...
}
EOF
igawk -f test.awk
SEE ALSO
gawk(1)
Effective AWK Programming, Edition 1.0, published by the Free Software Foundation, 1995.
AUTHOR
Arnold Robbins (arnold@skeeve.com).
Free Software Foundation Nov 3 1999 IGAWK(1)