Lookup name from another file


 
Thread Tools Search this Thread
Top Forums Shell Programming and Scripting Lookup name from another file
# 1  
Old 01-28-2014
Lookup name from another file

Hi All,

I want to lookup name for an id in col2 input from another file and add the name to each line.

Input 1
Code:
 comp100001_c0_seq1      At1g31340       30.40   569     384     11      3       1673    313     834     7e-62    237
 comp100003_c0_seq1      At1g35370_2     35.00   80      50      2       597     364     678     753     1e-09   42.7

Input 2

Code:
       [R] KOG0017 FOG: Transposon-encoded proteins with TYA, reverse transcriptase, integrase domains in various combinations
         ath:  At1g10260
         ath:  At1g11265
         ath:  At1g35050
         ath:  At1g35370_2
           ath:  At1g35647
         
         [OR] KOG0001 Ubiquitin and ubiquitin-like proteins
         ath:  At1g31340
         ath:  At1g53930
         ath:  At1g53950
         ath:  At1g53980
             ath:  At1g64470

Expected output


Code:
comp100001_c0_seq1      At1g31340       30.40   569     384     11      3       1673    313     834     7e-62    237    [OR] KOG0001 Ubiquitin and ubiquitin-like proteins
 comp100003_c0_seq1      At1g35370_2     35.00   80      50      2       597     364     678     753     1e-09   42.7    [R] KOG0017 FOG: Transposon-encoded proteins with TYA, reverse transcriptase, integrase domains in various combinations

# 2  
Old 01-28-2014
Hello,

Following may help.


Code:
awk 'NR==FNR{a[$2];next} ($2 in a) {print $0}' file2  file1

Output will be as follows.


Code:
comp100001_c0_seq1      At1g31340       30.40   569     384     11      3       1673    313     834     7e-62    237
 comp100003_c0_seq1      At1g35370_2     35.00   80      50      2       597     364     678     753     1e-09   42.7


EDIT: What is the logic to get the last column data in your expected Output. Sorry I have noticed just now the last column.


Thanks,
R. Singh
# 3  
Old 01-28-2014
The last column is the name corresponding with col2 of input 1 which is at the last header at the top starting with [some alphabets] some description.

So for At1g31340 it is [OR] Ubiquitin and ubiquitin-like proteins and for At1g35370_2 it is [R] KOG0017 FOG: Transposon-encoded proteins with TYA, reverse transcriptase, integrase domains in various combinations.

Please note that names for some ids may not be found. they should be left as it is, that is no name is added to the last column.
# 4  
Old 01-28-2014
Here is an awk program based on some assumptions:
Code:
awk '
        NR == FNR {
                if ( $0 ~ /\[[A-Z]*\]/ )
                        D = $0
                else
                        A[$NF] = D
                next
        }
        $2 in A {
                $0 = $0 FS A[$2]
        }
        1
' input2 input1

This User Gave Thanks to Yoda For This Post:
# 5  
Old 01-28-2014
Thanks a lot Yoda, you are too good boss. Could you please exaplain the code please.



Thanks,
R. Singh
# 6  
Old 01-28-2014
thank you, this looks perfect,
# 7  
Old 01-28-2014
Thank you, this looks perfect
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Lookup value from another file

Hi Experts, I need you help in coding the below scenario in unix. Please help Lets say i have below 2 records in my file1 a;b;c a;d and in file 2 , i have the below information a:alpha b:beta c:code d:delta then i want to lookup on file2 and replace my content in file... (2 Replies)
Discussion started by: ashishagg2005
2 Replies

2. Shell Programming and Scripting

Lookup Within a file

A text file has two logs of same event & both logs have to be correlated into a single line before same can be processed further RAW OPERATING LOG Date Month Year Time Event 10 JAN 2014 1000 4 11 APR 2013 1230 2 12 FEB 2014 ... (7 Replies)
Discussion started by: newageBATMAN
7 Replies

3. UNIX for Dummies Questions & Answers

Help with AWK - Compare a field in a file to lookup file and substitute if only a match

I have the below 2 files: 1) Third field from file1.txt should be compared to the first field of lookup.txt. 2) If match found then third field, file1.txt should be substituted with the second field from lookup.txt. 3)Else just print the line from file1.txt. File1.txt:... (4 Replies)
Discussion started by: venalla_shine
4 Replies

4. Shell Programming and Scripting

Lookup file

Hi, need your help to lookup these 2 files main.txt RNPMS01,PMS717W_Marasi,CXP9016141/1_R7G04,EXECUTING RNPMS01,RAP765W_BakaranBatu,CXP9014346/1_R6AG03,EXECUTING RNPMS01,RNPMS01,CXP9014711/2_R5Z,EXECUTING RNPMS01,TBT510W_Bandar_Utama,CXP9014346/1_R6AG03,EXECUTING... (8 Replies)
Discussion started by: singgih
8 Replies

5. UNIX for Advanced & Expert Users

Clueless about how to lookup and reverse lookup IP addresses under a file!!.pls help

Write a quick shell snippet to find all of the IPV4 IP addresses in any and all of the files under /var/lib/output/*, ignoring whatever else may be in those files. Perform a reverse lookup on each, and format the output neatly, like "IP=192.168.0.1, ... (0 Replies)
Discussion started by: choco4202002
0 Replies

6. Shell Programming and Scripting

Lookup file

I have two files ,File1 lookup to file2 based on 1st&3rd against 1st and 2nd column and produce the following o/p File1 450000|USD|USD 450006|GKSGD|SGD 450002|XSGD|SGD File2 ----- 450000|USD|2000.00|10000.000 450006|SGD|1000.200|3000.000 450002|SGD|3000.000|20000.00 O/p ... (4 Replies)
Discussion started by: mohan705
4 Replies

7. Shell Programming and Scripting

Lookup on a file

Hi, I have the following requirement. I have one lookup file which contains 15 columns and 7000 records. Ex: 123,MEDICA,134,145,1178,123,678,345,2345,HP,COL,K12,SR,OX,78919 I have input file which contains 14 columns and 20 million records.Some times the record count is more... (4 Replies)
Discussion started by: ukatru
4 Replies

8. UNIX for Advanced & Expert Users

Lookup on a file

Hi, I have the following requirement. I have one lookup file which contains 15 columns and 7000 records. Ex: 123,MEDICA,134,145,1178,123,678,345,2345,HP,COL,K12,SR,OX,78919 I have input file which contains 14 columns and 20 million records.Some times the record count is more than 20... (1 Reply)
Discussion started by: ukatru
1 Replies

9. UNIX for Dummies Questions & Answers

Lookup with a file

Hi All, i have a variable which has a value in it. RETAILER='JEWL' i have a text file. Name: file.txt file.txt ________ WLG 150 JEWL 60 CVS 240 FLN 120 WND 120 I am trying to write a korn script.the script, based on the value in the RETAILER will do a look up against the... (5 Replies)
Discussion started by: pavan_test
5 Replies

10. UNIX for Dummies Questions & Answers

file lookup

I need to do a text lookup for multiple records against a file that contains a key and two results. I found this code: str=`awk '$1 == search' search=$1 lkup.tbl It's a little tricky because the first $1 is the key variables location in the lkup.tbl and the second $1 is the parameter passed... (6 Replies)
Discussion started by: gillbates
6 Replies
Login or Register to Ask a Question