match columns using awk


 
Thread Tools Search this Thread
Top Forums Shell Programming and Scripting match columns using awk
# 1  
Old 02-17-2009
match columns using awk

Hi All,

I need some help in writing a small script using Awk.

My input file has following deatils
A,B,C,D
8239359,8239359,8388125,8388125
8239359,8239359,8388125,8388125
7165981,7165981,8363138,8363138
8283830,8283830,8382987,8382987
8209964,8209964,8367098,8367098
8279587,8279587,8345975,8345975
8283830,8283830,8382987,8382987

My aim is to get the following output
If a=b(any line column) then search (c=d) and print a,b,c,d
please note that the numbers a,b,c,d are not sorted

I would appreciate the help.

Best Regards
Pistachio
# 2  
Old 02-17-2009
If you mean a==b and c==d on the same line:
Code:
awk -F, '{ if($1==$2 && $3==$4) {print $0}  }'  inputfile

# 3  
Old 02-17-2009
... and if you mean something different, please post an example of the desired output.
# 4  
Old 02-17-2009
Thanks

Hi Jim ,

My input file basically consist of two files where a,c are part of one file and b and d are part of other file.I need to get only thoes records which bascally match a is equal to b (anywhere in the column) and c is in the same line as b is should be equal to d(anywhere in the coulmn) and then print all a,b,c,d

Appreciate the help in advance
# 5  
Old 02-17-2009
Please give two short input file examples, plus the expected output.
# 6  
Old 02-17-2009
Thanks

He guys thanks for the help

but let me try to explain this better.

If a==b then only check if c==d otherwise skip the record.
input
8239359,8239359,8388125,8388125
8283830,8283831,8382988,8382988
8283830,,8382987,
8283831,8283830,8382987,8382987
8209964,8209965,8367099,8367098
8209965,8209964,8367098,8367098
8209964,8209965,8367098,8367099

output
8239359,8239359,8388125,8388125
8283830,8283830,8382987,8382987
8283831,8283831,8382988,8382988
8209964,8209964,8367098,8367098
8209965,8209965,8367098,8367098


Best Regards
Pistachio
# 7  
Old 02-17-2009
You said at the start: two input files. one output file. This is very confusing for me.
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. UNIX for Beginners Questions & Answers

Data match 2 files based on first 2 columns matching only and join if match

Hi, i have 2 files , the data i need to match is in masterfile and i need to pull out column 3 from master if column 1 and 2 match and output entire row to new file I have tried with join and awk and i keep getting blank outputs or same file is there an easier way than what i am... (4 Replies)
Discussion started by: axis88
4 Replies

2. Shell Programming and Scripting

Request: How to Parse dynamic SQL query to pad extra columns to match the fixed number of columns

Hello All, I have a requirement in which i will be given a sql query as input in a file with dynamic number of columns. For example some times i will get 5 columns, some times 8 columns etc up to 20 columns. So my requirement is to generate a output query which will have 20 columns all the... (7 Replies)
Discussion started by: vikas_trl
7 Replies

3. Shell Programming and Scripting

awk pattern match not printing desired columns

Hi all, I'm trying to match the following two files with the code below: awk -F, 'NR==FNR {a=$0; next} ($12,$4) in a {print $12,$1,a}' OFS="," file4.csv file3.csv but the code does not print the entire row from file4 in addition to column 12 and 1 of file3. file4: o,c,q,co,ov,b... (1 Reply)
Discussion started by: bkane3
1 Replies

4. Shell Programming and Scripting

Evaluate 2 columns, add sum IF two columns match on two rows

Hi all, I know this sounds suspiciously like a homework course; but, it is not. My goal is to take a file, and match my "ID" column to the "Date" column, if those conditions are true, add the total number of minutes worked and place it in this file, while not printing the original rows that I... (6 Replies)
Discussion started by: mtucker6784
6 Replies

5. Shell Programming and Scripting

Match files based on either of the two columns awk

Dear Shell experts, I have 2 files with structure: File 1: ID and count head test_GI_count1.txt 1000094 2 10039307 1 10039641 1 10047177 11 10047359 1 1008555 2 10120302 1 10120672 13 10121776 1 10121865 32 And 2nd file: head Protein_gi_GeneID_symbol.txt protein_gi GeneID... (11 Replies)
Discussion started by: smitra
11 Replies

6. Shell Programming and Scripting

Match on columns and replace other columns

Hi Friends, I have the following input file cat input chr1 100 200 0.1 0.2 na 1 na nd chr1 105 200 0.1 0.2 1 1 na 98 chr1 110 290 nf 1 na nd na 1 chr2 130 150 12 3 na 1 na 1 chr3 450 600 nf nf na 10 na nd chr4 300 330 1 1 10 11 23 34 My requirement is 1. If $6 is na make $7 nd and... (2 Replies)
Discussion started by: jacobs.smith
2 Replies

7. Shell Programming and Scripting

match two key columns in two files and print output (awk)

I have two files... file1 and file2. Where columns 1 and 2 of file1 match columns 1 and 2 of file2 I want to create a new file that is all file1 + columns 3 and 4 of file2 :b: Many thanks if you know how to do this.... :b: file1 31-101 106 0 92 31-101 106 29 ... (2 Replies)
Discussion started by: pelhabuan
2 Replies

8. UNIX for Dummies Questions & Answers

Awk match on columns and delete line

Hi, I have a file like this a 1 2 b 2 2 c 2 3 d 4 5 f 5 6 output a 1 2 c 2 3 d 4 5 f 5 6 Basically, I want to delete the whole line if $2 and $3 are the same. Thanks (5 Replies)
Discussion started by: jacobs.smith
5 Replies

9. Shell Programming and Scripting

Awk match multiple columns in multiple lines in single file

Hi, Input 7488 7389 chr1.fa chr1.fa 3546 9887 chr5.fa chr9.fa 7387 7898 chrX.fa chr3.fa 7488 7389 chr21.fa chr3.fa 7488 7389 chr1.fa chr1.fa 3546 9887 chr9.fa chr5.fa 7898 7387 chrX.fa chr3.fa Desired Output 7488 7389 chr1.fa chr1.fa 2 3546 9887 chr5.fa chr9.fa 2... (2 Replies)
Discussion started by: jacobs.smith
2 Replies

10. Shell Programming and Scripting

awk to match a numeric range specified by two columns

Hi Everyone, Here's a snippet of my data: File 1 = testRef2: A1BG - 13208 13284 AAA1 - 34758475 34873943 AAAS - 53701240 53715412File 2 = 42MLN.3.bedS2: 13208 13208 13360 13363 13484 13518 13518My awk script: awk 'NR == FNR{a=$1;next} {$1>=a}{$1<=a}{print... (5 Replies)
Discussion started by: heecha
5 Replies
Login or Register to Ask a Question

Featured Tech Videos