Compare files using awk


 
Thread Tools Search this Thread
Top Forums Shell Programming and Scripting Compare files using awk
# 1  
Old 06-04-2012
Compare files using awk

Please help me to compare two files and remove the items in file2 from file1

file 1:delimited using pipe(|)

file1
Code:
 
00012|Description - 1|||||AA12345|1|AB12345|2|2012/06/03
AB123|Description - 2|||||AA12345|3|ZA11111|4|2012/06/04
11111|Description - 3|||||AP00012|1|AB12345|2|2012/06/03
ABCDE|Description,description - 4|||||PA12345|10|AB12345|20|2012/06/03

file2
Code:
 
11111
o1234
00012

Expected output

output
Code:
 
AB123|Description - 2|||||AA12345|3|ZA11111|4|2012/06/04
11111|Description - 3|||||AP00012|1|AB12345|2|2012/06/03
ABCDE|Description,description - 4|||||PA12345|10|AB12345|20|2012/06/03

Here output doesnot contain first row in file1(row containing 00012) since file2 contains 00012. Comparioson should happen between first item in file1 and item in file2.So even if 00012 is present in the 3rd row ,it is not removed

Also, is there any way to direct the removed rows/items to another output file
# 2  
Old 06-04-2012
but how come this line in file 1 is not removed ??
"11111" since this number is present in both the files... it should be removed right?
# 3  
Old 06-04-2012
It depends on what you want to select. If you want only those rows of file1 where field 1 is in file2 than use:

Code:
awk 'BEGIN{FS="|"}NR==FNR{a[$1]=$1;next}a[$1]' file2 file1 >selected_lines

If you want to deselect rows of file1 where field 1 is not in file2 than use:

Code:
awk 'BEGIN{FS="|"}NR==FNR{a[$1]=$1;next}!a[$1]' file2 file1 >selected_lines

This User Gave Thanks to sdf For This Post:
# 4  
Old 06-04-2012
Code:
[root@ jun4]# cat file1
00012|Description - 1|||||AA12345|1|AB12345|2|2012/06/03
AB123|Description - 2|||||AA12345|3|ZA11111|4|2012/06/04
11111|Description - 3|||||AP00012|1|AB12345|2|2012/06/03
ABCDE|Description,description - 4|||||PA12345|10|AB12345|20|2012/06/03
[root@ jun4]# cat file2
11111
o1234
00012
[root@jun4]# ./com.sh
[root@jun4]# cat file1
AB123|Description - 2|||||AA12345|3|ZA11111|4|2012/06/04
ABCDE|Description,description - 4|||||PA12345|10|AB12345|20|2012/06/03

my primitive code :-p

Code:
while read line1
do
        while read line2
        do
                first=$( echo $line1 | cut -d'|' -f1 )
#               echo "first : $first"
#               echo "line2: $line2"
                if [[ "$first" == "$line2" ]];then
#                       echo "matched"
                        echo $line1>>file3 #redirecting deleted line to new file: file3"
                        lineNo=$( grep -n "^$first" file1 | cut -d':' -f1 )
#                       echo "lineNO: $lineNo"
                        echo "`sed -e ''$lineNo'd' file1`" >file1
                fi
        done<file2
done<file1

# 5  
Old 06-05-2012
Thanks Vivek,its my mistake.11111 should be removed from the output as it is present in file2

file1

Code:
 
00012|Description - 1|||||AA12345|1|AB12345|2|2012/06/03
AB123|Description - 2|||||AA12345|3|ZA11111|4|2012/06/04
11111|Description - 3|||||AP00012|1|AB12345|2|2012/06/03
ABCDE|Description,description - 4|||||PA12345|10|AB00012|20|2012/06/03

file2

Code:
 
11111
o1234
00012

Expected output

output

Code:
 
AB123|Description - 2|||||AA12345|3|ZA11111|4|2012/06/04
ABCDE|Description,description - 4|||||PA12345|10|AB00012|20|2012/06/03

# 6  
Old 06-05-2012
oh okay.. :-)
# 7  
Old 06-09-2012
Quote:
Originally Posted by sdf
It depends on what you want to select. If you want only those rows of file1 where field 1 is in file2 than use:

Code:
awk 'BEGIN{FS="|"}NR==FNR{a[$1]=$1;next}a[$1]' file2 file1 >selected_lines

If you want to deselect rows of file1 where field 1 is not in file2 than use:

Code:
awk 'BEGIN{FS="|"}NR==FNR{a[$1]=$1;next}!a[$1]' file2 file1 >selected_lines

i used the below code
Code:
 
awk 'BEGIN{FS="|"}NR==FNR{a[$1]=$1;next}a[$1]'

This gives me the error

Code:
 
a[$1]': Event not found

Changed the code by adding '\' as
Code:
 
nawk 'BEGIN{FS="|"}NR==FNR{a[$1]=$1;next}\!a[$1]'

It worked.Hope this is correct.Thanks to sdf and Vivek
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

[awk] Compare two files

HI!! I am trying to compare two files using AWK but I have some problems. I need to count how many times letters are used in two texts. This is my script { long=length($0) for (i=1;i<=long;i++) { aux=substr($0,i,1) if ( aux != " " && aux != "" ) ... (7 Replies)
Discussion started by: ettore8888
7 Replies

2. Shell Programming and Scripting

awk compare files

I have a below requirement and trying to compare the files using awk File 1 - Already stored on a prev day id | text | email id --------------------------------- 89564|this is line 1 | xyz@sample.txt 985384|this is line 2 | abc@sample.txt 657342|this is line 3 |... (3 Replies)
Discussion started by: rakesh_411
3 Replies

3. Shell Programming and Scripting

Compare 2 files, awk maybe?

I have 2 files, file1: alfa numbers numbers vita numbers numbers gama numbers numbers delta numbers numbers epsilon numbers numbers zita numbers numbers ... file2: 'zita' keepnumbers keepnumbers keepnumbers 'gama' keepnumbers keepnumbers keepnumbers 'misc' ... (11 Replies)
Discussion started by: phaethon
11 Replies

4. HP-UX

Awk compare two files

Hi guys, I have 2 files: File1 ABC|2203|115.50 ABC|2288|328.12 ABC|2289|611.09 ABC|2290|698 DEF|1513|721.3 DEF|1514|40 DEF|1515|5 File2 ABC|2288|328.12 ABC|2289|666.08 ABC|2290|698.00 DEF|1513|721.30 (3 Replies)
Discussion started by: Eduardo Aceves
3 Replies

5. Shell Programming and Scripting

awk command to compare a file with set of files in a directory using 'awk'

Hi, I have a situation to compare one file, say file1.txt with a set of files in directory.The directory contains more than 100 files. To be more precise, the requirement is to compare the first field of file1.txt with the first field in all the files in the directory.The files in the... (10 Replies)
Discussion started by: anandek
10 Replies

6. Shell Programming and Scripting

Compare two files with awk

Hello, I have a script which extracts the values from a csv file when a specific date is entered : #!/bin/sh awk 'BEGIN{printf("Entrez la date : "); getline date < "-"} $0 ~ date {f=1;print;next} /^{2}\//{f=0} f' file1.csv This script gives me a number of lines with different values. ... (6 Replies)
Discussion started by: freyr
6 Replies

7. UNIX for Dummies Questions & Answers

Using AWK to compare 2 files

Hi How can I use awk to compare specific columns in 2 files and print the difference. I currently have this: BEGIN { OFS = FS = "," } NR == FNR { b = $3 next } { e = "" for (x in b) { if (match ($1, x)) { if (RSTART == 1 && RLENGTH > length(e)) { e=x (2 Replies)
Discussion started by: ladyAnne
2 Replies

8. Shell Programming and Scripting

compare two files using awk

Hi, I want to compare two files using awk and write an output based on if the records matched. Both the files are space delimitted. File A: 8351 00000000000636 2009044 -00001.000 8351 00000000000637 2009044 -00002.000 8351 00000000000638 2009044 -00001.000 8351 00000000000640... (7 Replies)
Discussion started by: gpaulose
7 Replies

9. Shell Programming and Scripting

Compare two files using awk

Hi. I'm new to awk and have searched for a solution to my problem, but haven't found the right answer yet. I have two files that look like this: file1 Delete,3105551234 Delete,3105551236 Delete,5625559876 Delete,5625556789 Delete,5625553456 Delete,5625551234 Delete,5625556956... (8 Replies)
Discussion started by: paul.o
8 Replies

10. Shell Programming and Scripting

awk compare 2 files

Hi i hope some awk gurus here can help me.. here is what i need i have 2 files: File1 152445 516532 405088.pdf 152445 516533 405089.pdf 152491 516668 405153.jpg 152491 520977 408779.jpg 152491 0 409265.pdf File2 516532 /tmp/MainStreet_Sum09_Front_FNL.pdf 516533... (9 Replies)
Discussion started by: kenray
9 Replies
Login or Register to Ask a Question