You could use the 'grep -v word ' or 'nawk '$0 !~ /word/' to filter files, redirect output into another files, and after complete filtering, compare files.
That is for word Tran and file fl1 you would use comands:
grep -v Tran fl1 > fltr_fl
or
nawk '$0 !~ /Tran/' fl1 > fltr_fl
For "tokenize" I would use the nawk -F[ ' {if ($0 ~ "Credit") { sub($1,"",$0);} print }' fl1 (should be remuwed else first '[' after that, maybe someone else will help on that)
I would put filtering commands in script and on end of that run the diff on filtered files.
Also the filtering the lines with worlds I would do in : for wrd in ...all words...; do .. done
So, it would be this way:
Code:
words="Tran Loc Addr Charge"
for wrd im $words ; do
nawk -v chk=#wrd '$0 !~ chk' in_fl >tmp;
cp tmp in_fl;
done