![]() |
Hello and Welcome from United States to the UNIX and Linux Forums! Thank You for Visiting and Joining Our Global Community.
|
|
google unix.com
|
|||||||
| Forums | Register | Forum Rules | Links | Albums | FAQ | Members List | Calendar | Search | Today's Posts | Mark Forums Read |
| Shell Programming and Scripting Post questions about KSH, CSH, SH, BASH, PERL, PHP, SED, AWK and OTHER shell scripts and shell scripting languages here. |
More UNIX and Linux Forum Topics You Might Find Helpful
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| Sed Help in Updating something only in one particular file. | bisla.yogender | Shell Programming and Scripting | 4 | 03-24-2008 11:48 AM |
| TO know whether file is updating or not | raj333 | Shell Programming and Scripting | 2 | 11-05-2007 05:29 PM |
| Updating file in loop | braindrain | Shell Programming and Scripting | 4 | 06-20-2007 01:37 PM |
| Why my sar is not updating the output file. | skneeli | UNIX for Advanced & Expert Users | 1 | 11-12-2006 08:29 PM |
| Updating a File in a Zip Archive | dbridle | AIX | 6 | 09-27-2006 03:29 PM |
![]() |
|
|
LinkBack | Thread Tools | Search this Thread | Rate Thread | Display Modes |
|
|
|
||||
|
awk updating one file with another, comparing, updating
Hello,
I read and search through this wonderful forum and tried different approaches but it seems I lack some knowledge and neurones ^^ Here is what I'm trying to achieve : file1: test filea 3495; test fileb 4578; test filec 7689; test filey 9978; test filez 12300; file2: test filea 3495; test filed 4578; test filec 7689; test filex 8978; results: test filea 3495; test filed 4578; test filec 7689; test filex 8978; test filey 9978; test filez 12300; comparison in based on last field (field $3), new content from file2 (here content with "key" 8978 is new) should be added to final output and content that is different in file2 (test filed 4578; here) should replace file1 one. here is where I am now: awk 'NF { key=$NF;keys[key]++ } NR == FNR { key1[key] = $NF ORS;rec1[key] = $0 ORS;next } { key2[key] = $NF ORS;rec2[key] = $0 ORS;next } END { for (k in keys) { if (key1[k] == key2[k]) { print rec2[k] } else { print rec1[k] } } }' $file1 $file2 > $file1.updated for readability: awk ' NF { key=$NF; keys[key]++ } NR == FNR { key1[key] = $NF ORS; rec1[key] = $0 ORS; next } { key2[key] = $NF ORS; rec2[key] = $0 ORS; next } END { for (k in keys) { if (key1[k] == key2[k]) { print rec2[k] } else { print rec1[k] } } }' $file1 $file2 > $file1.updated but.. this doesn't work well :/ |
|
||||
|
oh my....
![]() thanks a lot ! I thought the solution was something like store keys from file1, iterate them on file2, then reverse the iteration to find missing records... I was far far away from the beauty of awk... if I understand correctly, awk reads the two files and automagically merged records itself ? It means that there is no need to store values from file1 to compare them to file2 ? Beautifull... Two things I don't get: the use of the underscore (while i guess it stands for "all read records" ?), and why is END not at the end ? About the sort command wouldn't it fail on the ';' ? Do you know how to specify 'last field' of line with sort ? Or is something like : | awk '{ printf substr($NF, 1, length($NF)-1);$NF = "";printf " %s\n",$0 }' | sort -n | awk '{ printf "%s%s;\n",$0,$1 }' | awk '{$1="";sub(/^ +/, "");printf "%s\n",$0}' preferable ? Thanks a lot again radoulov ^^ |
|
|||||
|
Quote:
of the key ($NF in this case) and the value is alway the last one it sees (the one in file2). It will associate to every key ($NF) the entire record ($0) and it will update the value when it sees the same key. Quote:
it you want the code more readable, you could use this instead (and this is compatible even with the old plain Solaris awk): Code:
awk '{
key_record[$NF] = $0 # associate key ($NF) with entire record ($0)
}
END {
# after the entire input has been read
for (key in key_record) # for every key stored
print key_record[key] # print the associated value
}' file1 file2
Quote:
Quote:
In that case I would go with: Code:
perl -lane'
$h{$F[-1]} = $_;
print join "\n", map $h{$_}, sort {$a <=> $b} keys %h
if eof'
Code:
perl -lane'
chop $F[-1] and $h{$F[-1]} = $_;
print join "\n", map $h{$_}, sort {$a <=> $b} keys %h
if eof'
Code:
read<file;set -- $REPLY;sort -k$#n file |
|
||||
|
Thanks a lot for taking the time to explain all this radoulov ^^ that's really great !
Quote:
27384; 7384; or 384; but I tried so many different things, I guess this should be a remain of some mistypes/mistakes on my side or because of the Windows line endings some files seems to have (some files are created on Windows and some on Unix) ? No the last field is not fixed because I'm on a bash script utility for sql queries files sorting/updating, this have to be used on several different files where the number of fields is not always the same and where the key value can be, rarely but happens, in the middle of the line. So in this case taking a $key arguments from cli: awk 'END{for(k in _)print _[k]}{_[$'"$key"']=$0}' $file1 $file2 > $file1.updated with an additionnal conditional on argument '0' for the end of the line (because I didn't get $key to turn into NF and awk taking '"$key"'). I'm making it for a small community and it has to be really simple. If you're not afraid to read awfull code I can post it ^^ |
|
||||
|
Hi am very new to awk & unix, my requirement is very similar to this..
I want to compare by first column, file1: 0000-00058|Green 0000-00059|Green 0000-00060|Green 0402-01055|Green 0402-01058|Green 0402-01059|Green 0402-01061|Green 0402-01065|Green file2: 0000-00057|Red 0000-00058|Blue 0000-00059|Red 0000-00060|Blue My result should be 0000-00058|Blue 0000-00059|Red 0000-00060|Blue 0402-01055|Green 0402-01058|Green 0402-01059|Green 0402-01061|Green 0402-01065|Green 0000-00057|Red awaiting for your reply... Thank u. |
![]() |
| Bookmarks |
| Tags |
| solaris |
| Thread Tools | Search this Thread |
| Display Modes | Rate This Thread |
|
|