Compare two files


 
Thread Tools Search this Thread
Top Forums Shell Programming and Scripting Compare two files
# 1  
Old 07-25-2012
Compare two files

Hi all

I have to compare two files and find out common entries and write the data before it

I have to compare these two files:

One file with 3 columns

Quote:
Lepirudin:Refludan:F2
Cetuximab:Erbitux:FCGR2A,FCGR2B,FCGR2C,EGFR,FCGR3B,C1R,C1QA,C1QB,C1QC,FCGR3A,C1S,FCGR1A
Dornase Alfa:Pulmozyme:Not Available
Denileukin diftitox:Ontak:IL2RA,IL2RB,IL2RG
Etanercept:Enbrel:C1S,C1R,C1QA,C1QB,C1QC,TNF,TNFRSF1B,FCGR1A,FCGR3A,FCGR2A,FCGR2B,FCGR2C,LTA,FCGR3B
Bivalirudin:Angiomax:F2
Leuprolide:Eligard:GNRHR
Peginterferon alfa-2a:Pegasys:IFNAR2,IFNAR1
Alteplase:Activase (Genentech Inc):PLG,FGA,PLAUR,SERPINE1
Sermorelin:Geref:GHRHR
Interferon alfa-n1:Wellferon (GlaxoSmithKline):IFNAR2,IFNAR1
Darbepoetin alfa:Aranesp:EPOR
Urokinase:Abbokinase:NID1,PLG,PLAUR,PLAU,PLAT,SERPINE1,SERPINB2,SERPINA5,LRP2,ST14
Goserelin:Zoladex:LHCGR,GNRHR
Reteplase:Retavase (Centocor):PLG,FGA,PLAUR,SERPINE1
Epoetin alfa:Epogen:EPOR
Salmon Calcitonin:Calcimar:CALCR
Interferon alfa-n3:Alferon (Interferon Sciences Inc.):IFNAR1,IFNAR2
Pegfilgrastim:Neulasta (Amgen Inc.):CSF3R,ELANE
Sargramostim:Immunex:CSF2RA,IL3RA,CSF2RB,SDC2,PRG2
Secretin:SecreFlo:SCTR
Peginterferon alfa-2b:PEG-Intron (Schering Corp):IFNAR1,IFNAR2
Asparaginase:Elspar (Merck & Co. Inc):Not Available
Thyrotropin Alfa:Thyrogen (Genzyme Inc):TSHR
Antihemophilic Factor:Advate:LRP1,MCFD2,F10,F9,VWF,PHYH,ASGR2,HSPA5,CALR,CANX,LMAN1
Anakinra:Kineret (Amgen Inc):IL1R1
Gramicidin D:Neosporin:
Intravenous Immunoglobulin:Civacir:C4B,C5,FCGR1A,FCGR1B,FCGR2A,FCGR2B,FCGR2C,FCGR3A,FCGR3B,C3,C4A
Anistreplase:Eminase (Wulfing Pharma GmbH):PLG,FGA,PLAUR,SERPINE1
Insulin recombinant:Novolin R (Novo Nordisk):LRP2,IGFBP7,SYTL4,INSR,IGF1R,RB1,CTSD,IDE,PCSK2,CPE,PCSK1,NOV
Tenecteplase:TNKase (Genentech Inc):CANX,LRP1,PLG,FGA,PLAUR,SERPINE1,SERPINB2,CLEC3B,KRT8,ANXA2,CALR
Menotropins:Repronex:FSHR,LHCGR
second file with two columns

Quote:
ARFGEF2 CAD
DDEF2 CAD
PSCD3 CAD
PSCD4 CAD
CAMK1 CAD,HT,HT,HT,HT,HT,HT,HT,HT,HT,HT,HT,HT
HSP90AA1 CAD,CAD,CAD,T2D,T2D
KDR CAD,CD,CD
VEGF CAD,CAD,CAD,CAD,T2D,T2D,T2D
CTNNA3 CAD,HT,T2D
PTPRM CAD,T2D
RAC2 CAD,CAD,T1D,T1D
SMAD3 CAD,T2D,T2D,T2D,T2D,T2D,T2D,T2D
SORBS1 CAD,CAD,CAD
CD36 CAD
IRS1 CAD,CAD,CAD
IRS2 CAD,CAD,CAD,CAD
MTFMT CAD,CAD,CAD,T1D,T1D,T1D
SARS CAD
GNPDA2 CAD
NANS CAD
SRD5A1 CAD


I have to compare column 1 of second file and colmn 3 of first file and
then if there is something similar then write colmn2 of second file in front of it and column 1 and colmn2 of first file in front of it

so tht result shuld be

if any entry in colmn3 of first file=column1 of second file
then, colun2 of second file and column 1 and column2 of first file should be written in front of it

for example:

Quote:
SARS CD,HT Refuldan Lepritdun
# 2  
Old 07-28-2012
what you required is not clear pls specify how you want below data
Code:
SARS CD,HT Refuldan Lepritdun

and more example for output
# 3  
Old 07-28-2012
Your samples seem mismatched, but I think this is what you were intending.

Code:
awk '
    NR == FNR {
        hash[$1] = $2;
        next;
    }

    {
        split( $0, a, ":" );
        n = split( a[3], b, "," );
        for( i = 1; i <= n; i++ )
        {
            if( b[i] in hash )
            {
                printf( "%s %s %s %s\n", b[i], hash[b[i]], a[2], a[1] );
                next;
            }
        }
    }
'  file2  file1

Note that the second file is placed before the first as the programme needs to read the second file before reading the first.
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Compare files and share output from both files

hi all, Thanks to all for your great help... I have a scenario that I have two files (file1 & file2). I need to compare two files entire row by row and share the output if any discrepancies within two files. File1: DB1|TB1|C1,C3 DB2|TB2|C1,C2 DB3|TB3|C1,C2,C3,C4 File2: ... (2 Replies)
Discussion started by: Selva_2507
2 Replies

2. Shell Programming and Scripting

Compare multiple files, and extract items that are common to ALL files only

I have this code awk 'NR==FNR{a=$1;next} a' file1 file2 which does what I need it to do, but for only two files. I want to make it so that I can have multiple files (for example 30) and the code will return only the items that are in every single one of those files and ignore the ones... (7 Replies)
Discussion started by: castrojc
7 Replies

3. Shell Programming and Scripting

Compare two files, then overwrite first file with only that in both files

I want to compare two files, and search for items that are in both. Then override the first file with that containing only elements which were in both files. I imagine something with diff, but not sure. File 1 One Two Three Four Five File 2 One Three Four Six Eight (2 Replies)
Discussion started by: castrojc
2 Replies

4. Shell Programming and Scripting

Compare files

Please help me with awk.I have two files with the below details file1 123456789 2012 987654321 2011 a1234567892012 a1234abcde2012 b1234567892012 c1234567892012 98765a12342012 file2 a1234 01234 b1234 33333 I need to check whether the items in file2 is present in file1 .If it is... (2 Replies)
Discussion started by: Mary James
2 Replies

5. Shell Programming and Scripting

Require compare command to compare 4 files

I have four files, I need to compare these files together. As such i know "sdiff and comm" commands but these commands compare 2 files together. If I use sdiff command then i have to compare each file with other which will increase the codes. Please suggest if you know some commands whcih can... (6 Replies)
Discussion started by: nehashine
6 Replies

6. Shell Programming and Scripting

Compare 2 folders to find several missing files among huge amounts of files.

Hi, all: I've got two folders, say, "folder1" and "folder2". Under each, there are thousands of files. It's quite obvious that there are some files missing in each. I just would like to find them. I believe this can be done by "diff" command. However, if I change the above question a... (1 Reply)
Discussion started by: jiapei100
1 Replies

7. Shell Programming and Scripting

How to compare 2 files & get only few columns based on a condition related to both files?

Hiiiii friends I have 2 files which contains huge data & few lines of it are as shown below File1: b.dat(which has 21 columns) SSR 1976 8 12 13 10 44.00 39.0700 70.7800 7.0 0 0.00 0 2.78 0.00 0.00 0 0.00 2.78 0 NULL ISC 1976 8 12 22 32 37.39 36.2942 70.7338... (6 Replies)
Discussion started by: reva
6 Replies

8. Shell Programming and Scripting

compare files in two directories and output changed files to third directory

I have searched about 30 threads, a load of Google pages and cannot find what I am looking for. I have some of the parts but not the whole. I cannot seem to get the puzzle fit together. I have three folders, two of which contain different versions of multiple files, dist/file1.php dist/file2.php... (4 Replies)
Discussion started by: bkeep
4 Replies

9. Shell Programming and Scripting

compare two files and to remove the matching lines on both the files

I have two files and need to compare the two files and to remove the matching lines from both the files (4 Replies)
Discussion started by: shellscripter
4 Replies

10. Shell Programming and Scripting

compare two files

I have file1 and file2: file1: 11 xxx kksd ... 22 kkk kdsglg... 33 sss kdfjdksa... 44 kdsf dskjfkas ... hh kdkf kdkkd.. jg dkf dfkdk ... ... file2: jg 22 hh ... I need to check each line of file1. if the field one is in file2, I will keep it; if not, the whole line will be... (17 Replies)
Discussion started by: fredao
17 Replies
Login or Register to Ask a Question