Compare two files word by word Post: 302676173

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Can a shell script pull the first word (or nth word) off each line of a text file?

Greetings. I am struggling with a shell script to make my life simpler, with a number of practical ways in which it could be used. I want to take a standard text file, and pull the 'n'th word from each line such as the first word from a text file. I'm struggling to see how each line can be...

2. Shell Programming and Scripting

To read data word by word from given file & storing in variables

File having data in following format : file name : file.txt -------------------- 111111;name1 222222;name2 333333;name3 I want to read this file so that I can split these into two paramaters i.e. 111111 & name1 into two different variables(say value1 & value2). i.e val1=11111 &...

3. UNIX for Dummies Questions & Answers

Script to search for a particular word in files and print the word and path name

Hi, i am new to unix shell scripting and i need a script which would search for a particular word in all the files present in a directory. The output should have the word and file path name. For example: "word" "path name". Thanks for the reply in adv,:)

4. Programming

Python: Compare 2 word lists

Hi. I am trying to write a Python programme that compares two different text files which both contain a list of words. Each word has its own line worda wordb wordc I want to compare textfile 2 with textfile 1, and if there's a word in textfile 2 that is NOT in textfile 1, I want to...

5. Shell Programming and Scripting

Find and replace a word in all the files (that contain the word) under a directory

Hi Everyone, I am looking for a simple way for replacing all the files under a directory that use the server "xsgd1234dap" with "xsdr3423pap". For Example: In the Directory, $pwd /home/nick $ grep -l "xsgd1234dap" *.sh | wc -l 119 I have "119" files that are still using...

6. UNIX for Dummies Questions & Answers

Find EXACT word in files, just the word: no prefix, no suffix, no 'similar', just the word

I have a file that has the words I want to find in other files (but lets say I just want to find my words in a single file). Those words are IDs, so if my word is ZZZ4, outputs like aaZZZ4, ZZZ4bb, aaZZZ4bb, ZZ4, ZZZ, ZyZ4, ZZZ4.8 (or anything like that) WON'T BE USEFUL. I need the whole word...

7. Shell Programming and Scripting

Search for a specific word and print only the word from the input file

Hi, I have a sample file as shown below, I am looking for sed or any command which prints the complete word only from the input file. Ex: $ cat "sample.log" I am searching for a word which is present in this file We can do a pattern search using grep but I need to cut only the word which...

8. Shell Programming and Scripting

Find a word and increment the number in the word & save into new files

Hi All, I am looking for a perl/awk/sed command to auto-increment the numbers line in file, P1.tcl: run_build_model sparc_ifu_dec run_drc set_faults -model path_delay -atpg_effectiveness -fault_coverage add_delay_paths P1 set_atpg -abort_limit 1000 run_atpg -ndetects 1000 I would like...

9. UNIX for Beginners Questions & Answers

UNIX script to check word count of each word in file

I am trying to figure out to find word count of each word from my file sample file hi how are you hi are you ok sample out put hi 1 how 1 are 1 you 1 hi 1 are 1 you 1 ok 1 wc -l filename is not helping , i think we will have to split the lines and count and then print and also...

10. UNIX for Beginners Questions & Answers

How to search for a word in column header that fully matches the word not partially in awk?

I have a multicolumn text file with header in the first row like this The headers are stored in an array called . which contains I want to search for each elements of this array from that multicolumn text file. And I am using this awk approach for ii in ${hdr} do gawk -vcol="$ii" -F...

LEARN ABOUT REDHAT

diff

DIFF(1) 							     GNU Tools								   DIFF(1)

NAME

       diff - find differences between two files

SYNOPSIS

       diff [options] from-file to-file

DESCRIPTION

       In  the	simplest  case, diff compares the contents of the two files from-file and to-file.  A file name of - stands for text read from the
       standard input.	As a special case, diff - - compares a copy of standard input to itself.

       If from-file is a directory and to-file is not, diff compares the file in from-file whose file name is that of  to-file,  and  vice  versa.
       The non-directory file must not be -.

       If  both from-file and to-file are directories, diff compares corresponding files in both directories, in alphabetical order; this compari-
       son is not recursive unless the -r or --recursive option is given.  diff never compares the actual contents of a directory as if it were  a
       file.   The  file  that	is fully specified may not be standard input, because standard input is nameless and the notion of ``file with the
       same name'' does not apply.

       diff options begin with -, so normally from-file and to-file may not begin with -.  However, -- as an argument by itself treats the remain-
       ing arguments as file names even if they begin with -.

   Options
       Below  is  a  summary of all of the options that GNU diff accepts.  Most options have two equivalent names, one of which is a single letter
       preceded by -, and the other of which is a long name preceded by --.  Multiple single letter options (unless they take an argument) can	be
       combined  into a single command line word: -ac is equivalent to -a -c.  Long named options can be abbreviated to any unique prefix of their
       name.  Brackets ([ and ]) indicate that an option takes an optional argument.

       -lines Show lines (an integer) lines of context.  This option does not specify an output format by itself; it has no effect  unless  it	is
	      combined with -c or -u.  This option is obsolete.  For proper operation, patch typically needs at least two lines of context.

       -a     Treat all files as text and compare them line-by-line, even if they do not seem to be text.

       -b     Ignore changes in amount of white space.

       -B     Ignore changes that just insert or delete blank lines.

       --brief
	      Report only whether the files differ, not the details of the differences.

       -c     Use the context output format.

       -C lines
       --context[=lines]
	      Use  the	context output format, showing lines (an integer) lines of context, or three if lines is not given.  For proper operation,
	      patch typically needs at least two lines of context.

       --changed-group-format=format
	      Use format to output a line group containing differing lines from both files in if-then-else format.

       -d     Change the algorithm to perhaps find a smaller set of changes.  This makes diff slower (sometimes much slower).

       -D name
	      Make merged if-then-else format output, conditional on the preprocessor macro name.

       -e
       --ed   Make output that is a valid ed script.

       --exclude=pattern
	      When comparing directories, ignore files and subdirectories whose basenames match pattern.

       --exclude-from=file
	      When comparing directories, ignore files and subdirectories whose basenames match any pattern contained in file.

       --expand-tabs
	      Expand tabs to spaces in the output, to preserve the alignment of tabs in the input files.

       -f     Make output that looks vaguely like an ed script but has changes in the order they appear in the file.

       -F regexp
	      In context and unified format, for each hunk of differences, show some of the last preceding line that matches regexp.

       --forward-ed
	      Make output that looks vaguely like an ed script but has changes in the order they appear in the file.

       -h     This option currently has no effect; it is present for Unix compatibility.

       -H     Use heuristics to speed handling of large files that have numerous scattered small changes.

       --horizon-lines=lines
	      Do not discard the last lines lines of the common prefix and the first lines lines of the common suffix.

       -i     Ignore changes in case; consider upper- and lower-case letters equivalent.

       -I regexp
	      Ignore changes that just insert or delete lines that match regexp.

       --ifdef=name
	      Make merged if-then-else format output, conditional on the preprocessor macro name.

       --ignore-all-space
	      Ignore white space when comparing lines.

       --ignore-blank-lines
	      Ignore changes that just insert or delete blank lines.

       --ignore-case
	      Ignore changes in case; consider upper- and lower-case to be the same.

       --ignore-matching-lines=regexp
	      Ignore changes that just insert or delete lines that match regexp.

       --ignore-space-change
	      Ignore changes in amount of white space.

       --initial-tab
	      Output a tab rather than a space before the text of a line in normal or context format.  This causes the alignment of  tabs  in  the
	      line to look normal.

       -l     Pass the output through pr to paginate it.

       -L label
       --label=label
	      Use label instead of the file name in the context format and unified format headers.

       --left-column
	      Print only the left column of two common lines in side by side format.

       --line-format=format
	      Use format to output all input lines in in-then-else format.

       --minimal
	      Change the algorithm to perhaps find a smaller set of changes.  This makes diff slower (sometimes much slower).

       -n     Output RCS-format diffs; like -f except that each command specifies the number of lines affected.

       -N
       --new-file
	      In directory comparison, if a file is found in only one directory, treat it as present but empty in the other directory.

       --new-group-format=format
	      Use format to output a group of lines taken from just the second file in if-then-else format.

       --new-line-format=format
	      Use format to output a line taken from just the second file in if-then-else format.

       --old-group-format=format
	      Use format to output a group of lines taken from just the first file in if-then-else format.

       --old-line-format=format
	      Use format to output a line taken from just the first file in if-then-else format.

       -p     Show which C function each change is in.

       -P     When comparing directories, if a file appears only in the second directory of the two, treat it as present but empty in the other.

       --paginate
	      Pass the output through pr to paginate it.

       -q     Report only whether the files differ, not the details of the differences.

       -r     When comparing directories, recursively compare any subdirectories found.

       --rcs  Output RCS-format diffs; like -f except that each command specifies the number of lines affected.

       --recursive
	      When comparing directories, recursively compare any subdirectories found.

       --report-identical-files
       -s     Report when two files are the same.

       -S file
	      When comparing directories, start with the file file.  This is used for resuming an aborted comparison.

       --from-file=file
	      Compare file to all operands.  file can be a directory.

       --to-file=file
	      Compare all operands to file. file can be a directory.

       --sdiff-merge-assist
	      Print  extra  information  to  help  sdiff.  sdiff uses this option when it runs diff.  This option is not intended for users to use
	      directly.

       --show-c-function
	      Show which C function each change is in.

       --show-function-line=regexp
	      In context and unified format, for each hunk of differences, show some of the last preceding line that matches regexp.

       --side-by-side
	      Use the side by side output format.

       --speed-large-files
	      Use heuristics to speed handling of large files that have numerous scattered small changes.

       --starting-file=file
	      When comparing directories, start with the file file.  This is used for resuming an aborted comparison.

       --suppress-common-lines
	      Do not print common lines in side by side format.

       -t     Expand tabs to spaces in the output, to preserve the alignment of tabs in the input files.

       -T     Output a tab rather than a space before the text of a line in normal or context format.  This causes the alignment of  tabs  in  the
	      line to look normal.

       --text Treat all files as text and compare them line-by-line, even if they do not appear to be text.

       -u     Use the unified output format.

       --unchanged-group-format=format
	      Use format to output a group of common lines taken from both files in if-then-else format.

       --unchanged-line-format=format
	      Use format to output a line common to both files in if-then-else format.

       --unidirectional-new-file
	      When comparing directories, if a file appears only in the second directory of the two, treat it as present but empty in the other.

       -U lines
       --unified[=lines]
	      Use  the	unified output format, showing lines (an integer) lines of context, or three if lines is not given.  For proper operation,
	      patch typically needs at least two lines of context.

       -v
       --version
	      Output the version number of diff.

       -w     Ignore white space when comparing lines.

       -W columns
       --width=columns
	      Use an output width of columns in side by side format.

       -x pattern
	      When comparing directories, ignore files and subdirectories whose basenames match pattern.

       -X file
	      When comparing directories, ignore files and subdirectories whose basenames match any pattern contained in file.

       -y     Use the side by side output format.

SEE ALSO

       cmp(1), comm(1), diff3(1), ed(1), patch(1), pr(1), sdiff(1).

DIAGNOSTICS

       An exit status of 0 means no differences were found, 1 means some differences were found, and 2 means trouble.

GNU Tools							     22sep1993								   DIFF(1)

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Can a shell script pull the first word (or nth word) off each line of a text file?

Discussion started by: tricky

2. Shell Programming and Scripting

To read data word by word from given file & storing in variables

Discussion started by: sjoshi98

3. UNIX for Dummies Questions & Answers

Script to search for a particular word in files and print the word and path name

Discussion started by: virtual_45

4. Programming

Python: Compare 2 word lists

Discussion started by: Bloomy