Compare fields in files


 
Thread Tools Search this Thread
Top Forums Shell Programming and Scripting Compare fields in files
# 1  
Old 02-05-2012
Compare fields in files

Hi,

I need the most efficient way of comparing the following and arriving at the result

I have a file which has entries like,

File1:

1|2|5|7|8|2|3|6|3|1

File2:

1|2|3|1|2|7|9|2

I need to compare the entries in these two file with those of a general file,

1|2|3|5|2|5|6|9|3|1 ---> constant file

My need is to calculate the similar entries between file1 and constant file and the same for file2 and so on.
Only matching.

In my case,
the compare with file1 and constant file will account to: "2"
similarly for file2 it will be: "4"

Please help me to implement this in an easy and effective way.

thanks.
# 2  
Old 02-05-2012
Quote:
the compare with file1 and constant file will account to: "2"
Why this result?

Regards,
Birei
# 3  
Old 02-05-2012
The result seems to be the number of fields (pipe separated) that match the constant file. The question I have is: is there just one record in the constant file, or multiple records?
# 4  
Old 02-05-2012
Assuming agama's assumption is correct.
The constant file is like this
Code:
1|2|4|6|5|7|2|2|9|8

Here is the code:
Code:
paste constant file1 file2 |
awk '
BEGIN {
        FS="\t"
        sep="|"
}
{
        nc=split($1, c, sep)
        nf1=split($2, f1, sep)
        nf2=split($3, f2, sep)

        k=0
        for (i=1; i<=nf1; ++i ) {
                if ( f1[i] == c[i] ) { k+=1 }
        }
        print "File1:", k

        k=0
        for (i=1; i<=nf2; ++i ) {
                if ( f2[i] == c[i] ) { k+=1 }
        }
        print "File2:", k

}'

# 5  
Old 02-06-2012
Is there multiple files like file1,2,3 and so on?
Is there a way to do this in python.

Thanks in advance,
Uma
# 6  
Old 02-06-2012
Assuming - as per your example - that all your files contain 1 line :

Code:
$ cat f1
1|2|5|7|8|2|3|6|3|1
$ cat f2
1|2|3|1|2|7|9|6
$ cat f3
1|0|5|0|8|2|1|1|1
$ nawk -F\| 'NR==FNR{n=NF;split($0,A,"\|");next}FNR==1{d=0}{for(i=0;++i<=n;)if($i==A[i]) ++d;print d}' f*
3
4
$ nawk -F\| 'NR==FNR{n=NF;split($0,A,"\|");next}FNR==1{d=0}{for(i=0;++i<=n;)if($i==A[i]) ++d;print d}' f1 f2 f3
3
4
$

This User Gave Thanks to ctsgnb For This Post:
# 7  
Old 02-06-2012
Is f1 - constant here?

I think the need is to compare f1 with f2,f3,f4 and so on.. and get the matched count in R2,R3 and so on.

f1 -> 1|2|3|4
f2 -> 1|2|3|1
f3 -> 1|3|3|2

so the result must be,

R2 -> 3 (as 1,2, and 3 matches)
R4 -> 2 (as 1 and 3 matches)

Am i correct here!
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. UNIX for Beginners Questions & Answers

Is there a UNIX command that can compare fields of files with differing number of fields?

Hi, Below are the sample files. x.txt is from an Excel file that is a list of users from Windows and y.txt is a list of database account. $ head -500 x.txt y.txt ==> x.txt <== TEST01 APP_USER_PROFILE USER03 APP_USER_PROFILE TEST02 APP_USER_EXP_PROFILE TEST04 APP_USER_PROFILE USER01 ... (3 Replies)
Discussion started by: newbie_01
3 Replies

2. Shell Programming and Scripting

Compare fields in two files

Hi, I am trying to check two files based on certain string and field. cat f1 source=\GREP\" hi this \\ source=\SED\" skdmsmd dnksdns source=\PERL\" cat f2 source=\SED\" source=\GREP\" vlamskds amdksk m source=\AWK\" awk \here\" (3 Replies)
Discussion started by: greet_sed
3 Replies

3. UNIX for Dummies Questions & Answers

Compare 2 fields in 2 files

I am trying to compare two files (separted by a pipe) using 2 fields (field 1,3 from fileA and 1,2 from fileB) if the two files match i want the whole record of fileA adding the extra fields left from fileB. 1. A.txt cat|floffy|12|anything|anythings cat|kitty|15|lala|lalala... (6 Replies)
Discussion started by: sabercats
6 Replies

4. Shell Programming and Scripting

Compare three files based on two fields

Guys, I tried searching on the internet and I couldn't get the answer for this problem. I have 3 files. First 2 fields of all of them are of same type, say they come from various databases but first two fields in the 3 files means the same. I need to verify the entries that are not present... (4 Replies)
Discussion started by: PikK45
4 Replies

5. Shell Programming and Scripting

Compare two fields in text files?

Hi, I have two text files, compare column one in both the files and if it matches then the output should contain the id in column one, the number and the description. Both the files are sorted. Is there a one liner to get this done, kindly help. Thank you File 1: NC_000964 92.33 ... (2 Replies)
Discussion started by: pulikoti
2 Replies

6. Shell Programming and Scripting

compare fields in different files

HI I'm having some troubles to compare and permut diffrent fields indexed with another filed like the following example `: file1 1 1 2 2 3 3 file2 7 1 9 2 10 3 result------------------- (6 Replies)
Discussion started by: yassinegoth
6 Replies

7. Shell Programming and Scripting

Compare files with fields separated with semicolon

Dear experts I have files like ABD : 5869 events, relative ratio : 1.173800E-01 , sum of ratios : 1.173800E-01 VBD : 12147 events, relative ratio : 2.429400E-01 , sum of ratios : 3.603200E-01 SDF : 17000 events, relative ratio : 3.400000E-01 , sum of ratios : 7.003200E-01 OIP: 14984... (9 Replies)
Discussion started by: Alkass
9 Replies

8. Shell Programming and Scripting

AWK Compare files, different fields, output

Hi All, Looking for a quick AWK script to output some differences between two files. FILE1 device1 1.1.1.1 PINGS device1 2.2.2.2 PINGS FILE2 2862 SITE1 device1-prod 1.1.1.1 icmp - 0 ... (4 Replies)
Discussion started by: stacky69
4 Replies

9. Shell Programming and Scripting

Compare fields in 2 files using AWK

Hi unix gurus, I have a urgent requirement, I need to write a AWK script to compare each fields in 2 files using AWK. Basically my output should be like this. file1 row|num1|num2|num3 1|one|two|three 2|one|two|three file2 row|num1|num2|num3 1|one|two|three 2|one|two|four ... (5 Replies)
Discussion started by: rashmisb
5 Replies

10. Shell Programming and Scripting

Compare two files based on values of fields.

Hi All, I have two files and data looks like this: File1 Contents #Field1,Field2 Dist_Center_file1.txt;21 Dist_Center_file3.txt;20 Dist_Center_file2.txt;20 File2 Contents (*** No Header ***) Dist_Center_file1.txt;23 Dist_Center_file2.txt;20 Dist_Center_file3.txt;20 I have... (4 Replies)
Discussion started by: Hangman2
4 Replies
Login or Register to Ask a Question