Using diff for spellchecking


 
Thread Tools Search this Thread
Top Forums Shell Programming and Scripting Using diff for spellchecking
# 1  
Old 10-07-2011
Using diff for spellchecking

Hi,

I am attempting to use diff for the purpose of spell checking. Here is what I have thus far. I am using the Bourne shell (sh).

Code:
diff -i list.txt '/usr/share/dict/words'

I am comparing my list of words (list.txt) with another list (a dictionary).

Here is what is contained in list.txt:
Code:
brown
brown
brown
fjkjk
fox
fox
I
I
I
I
is
is
is
is
the
the
The
this
this
this
this
Thursday
Today

I get output like the following: (just a part of the output)

Code:
> browis
> browless
> browman
3,4c53558,75240
< brown
---
> brown-armed
> brownback
> brown-backed

I thought that the "<" meant that the word was found in my file but not in the dictionary and thus not a correctly spelled word. However, it is doing this for "brown" as well as others later down the list. I also think that the ">" means that the word is in the dictionary but not in my file, meaning just that I can ignore it. I need some way of getting words that aren't in the dictionary using the diff utility.

Thanks for your time
# 2  
Old 10-07-2011
Do you need look for the words in list.txt which are not in dictory?

Code:
awk 'NR==FNR{a[$0];next} ! tolower($1) in a' /usr/share/dict/words list.txt

# 3  
Old 10-07-2011
Quote:
Do you need look for the words in list.txt which are not in dictory?
Yes, this is what I need to do. I need to find the words that are in list.txt, but not in the dictionary. After I find these words, I need to display them and the line number they occur on (in list.txt).

---------- Post updated at 01:58 PM ---------- Previous update was at 01:08 AM ----------

I ended up figuring out my issue. I needed to remove duplicates after sorting my list of words.
# 4  
Old 10-09-2011
export the line number as well.
Code:
awk 'NR==FNR{a[$0];next} ! tolower($1) in a {print NR, $0}' /usr/share/dict/words list.txt

Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Diff 3 files, but diff only their 2nd column

Guys i have 3 files, but i want to compare and diff only the 2nd column path=`/home/whois/doms` for i in `cat domain.tx` do whois $i| sed -n '/Registry Registrant ID:/,/Registrant Email:/p' > $path/$i.registrant whois $i| sed -n '/Registry Admin ID:/,/Admin Email:/p' > $path/$i.admin... (10 Replies)
Discussion started by: kenshinhimura
10 Replies

2. Shell Programming and Scripting

serach diff filename in diff location using shell scripting

Hi, I am new to shell scripting. please help me to find out the solution. I need a script where we need to read the text file(consists of all file names) and get the file names one by one and append the date suffix for each file name as 'yyyymmdd' . Then search each file if exists... (1 Reply)
Discussion started by: Lucky123
1 Replies

3. Shell Programming and Scripting

.procmailrc and uudeview (put attachments from diff senders to diff folders)

Moderator, please, delete this topic (1 Reply)
Discussion started by: optik77
1 Replies

4. UNIX for Dummies Questions & Answers

Diff between more and less

Hi, Can anyone tell me the diff between the two filters "more" and "less"? Many thanks. Regards, Venkat. (4 Replies)
Discussion started by: venkatesht
4 Replies

5. UNIX for Dummies Questions & Answers

Using diff

is there any way to make the diff function compare 1 folder to another instead of just file to file? also, can binary files be compared? (2 Replies)
Discussion started by: puzzler
2 Replies

6. Shell Programming and Scripting

Simulate SVN diff using plain diff

Hi, svn diff does not work very well with 2 local folders, so I am trying to do this diff using diff locally. since there's a bunch of meta files in an svn directory, I want to do a diff that excludes everything EXCEPT *.java files. there seems to be only an --exclude option, so I'm not sure... (3 Replies)
Discussion started by: ackbarr
3 Replies

7. UNIX for Dummies Questions & Answers

diff

hi all, i want to do this shell script. create a script that will check the transferred file vs. orig file. 1. diff the file1 and file2 2. if difference found, retain the original file and email to netcracker team. 3. if no difference found, delete the previous file and retain... (3 Replies)
Discussion started by: tungaw2004
3 Replies

8. Shell Programming and Scripting

Diff b/w $@ and $#

Hello, Pls explain the difference between $# and $@, and how its used in shell scripting . Thanks in advance (4 Replies)
Discussion started by: PradeepRed
4 Replies

9. Shell Programming and Scripting

diff 2 files; output diff's to 3rd file

Hello, I want to compare two files. All records in file 2 that are not in file 1 should be output to file 3. For example: file 1 123 1234 123456 file 2 123 2345 23456 file 3 should have 2345 23456 I have looked at diff, bdiff, cmp, comm, diff3 without any luck! (2 Replies)
Discussion started by: blt123
2 Replies

10. UNIX for Dummies Questions & Answers

diff and ed?

I am trying to use the diff command to find the differences between two txt files. From here, I wish to use the ed command to create the first file from the second file. I am fairly new to unix, and I haven't got a clue how to do this. Can anyone help me please? Cheers (2 Replies)
Discussion started by: Brototype
2 Replies
Login or Register to Ask a Question