comparing columns of a file with another file

01-11-2011

Registered User

2, 0

Join Date: Jan 2011

Last Activity: 13 January 2011, 9:28 AM EST

Posts: 2

Thanks Given: 0

Thanked 0 Times in 0 Posts

comparing columns of a file with another file

Hello experts,

It has been just one month since I started with perl. I have the following doubt.

I have two files

File 1 looks like this

Code:

GKHGGSS0098       PPP.100.F.LE
GKHYXDF9081       KKK.100.F.LE
GKHSDFT6546       JKL.100.F.LE
GKHGGHJ3123       ABC.100.F.LE

File 2 looks like this

Code:

>GKHGGSS0098 
atatatatagacagatgaacagat
>GKHGGSS0098 
atatatatatatatatatatatatatatata
>GKHYXDF9081
gggacacatagacagatagaca
>GKHSDFT6546
gacagatatatatatatatatata
>GKHGGHJ3123
ggccgcgcgcatagacaccagatagacagat

So I want to write a script that reads column 2 in file 1 and considers only those entries whose first 3 letters are PPP or KKK.

In the above there is one PPP and one KKK so the script will take two entries from the file. The the script needs to see the first column value for PPP and KKK, which are GKHGGSS0098 and GKHYXDF9081.

Finally the script will compare these first column values of file one with that of file two (with the part after the > sign).

Once they match, the script will extract them from file two and store them in a result file.

So in this case, the output file will have

Code:

>GKHGGSS0098 
atatatatagacagatgaacagat
>GKHGGSS0098 
 atatatatatatatatatatatatatatata
>GKHYXDF9081
gggacacatagacagatagaca

I donot know how to go about with this. Should I use a hash. Please help

Newbie

Last edited by Franklin52; 01-11-2011 at 04:51 PM.. Reason: Please use code tags

newbie_perl2011

View Public Profile for newbie_perl2011

Find all posts by newbie_perl2011

01-11-2011

Registered User

1,271, 299

Join Date: Sep 2009

Last Activity: 17 July 2019, 5:46 PM EDT

Location: ./India/Bangalore

Posts: 1,271

Thanks Given: 70

Thanked 299 Times in 290 Posts

Try this,

Code:

#!/usr/bin/perl
open(FH,"<","file1");
open(FH1,"<","file2");
while (<FH>) {
if(/(.+?)\s+(PPP|KKK).*/) {$hs{$1}=$2;}
}
while(<FH1>) {
if ($p) {print $_;$p=0;}
if(/\>(.+?)\s+/) {if ($hs{$1}) {print "\>",$1,"\n";$p=1;}}
}

pravin27

View Public Profile for pravin27

Find all posts by pravin27

01-13-2011

Registered User

2, 0

Join Date: Jan 2011

Last Activity: 13 January 2011, 9:28 AM EST

Posts: 2

Thanks Given: 0

Thanked 0 Times in 0 Posts

Hey thank you so much. But however I want to create two resulting files. like a file which will be like file 1 but only contain
GKHGGSS0098 PPP.100.F.LE
GKHYXDF9081 KKK.100.F.LE

And another file like file 2 that will contain the corresponding data as printed on the screen now..

Please do let me know if I am clear

newbie_perl2011

View Public Profile for newbie_perl2011

Find all posts by newbie_perl2011

Shell Programming and Scripting

comparing columns of a file with another file

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Match Columns in one file and extract columns from another file

Discussion started by: genehunter

2. Shell Programming and Scripting

Comparing Select Columns from two CSV files in UNIX and create a third file based on comparision

Discussion started by: ady_koolz

3. Shell Programming and Scripting

Comparing Columns and writing a new file

Discussion started by: cs_novice

4. Shell Programming and Scripting

Comparing columns in a file

Discussion started by: shoaibjameel123

5. Shell Programming and Scripting

Remove duplicate lines from first file comparing second file

Discussion started by: gani_85

6. UNIX for Dummies Questions & Answers

Comparing the 2nd column in two different files and printing corresponding 9th columns in new file

Discussion started by: Unilearn

7. UNIX Desktop Questions & Answers

COMPARING COLUMNS IN A TEXT FILE

Discussion started by: whitecross

8. Shell Programming and Scripting

Replace specific columns in one file with columns in another file

Discussion started by: mehdib

9. Shell Programming and Scripting

Comparing Columns and printing the difference from a particular file

Discussion started by: buzzusa

10. Shell Programming and Scripting

Help with comparing columns from a csv file

Discussion started by: sickboy