Sponsored Content
Full Discussion: Count unique words
Top Forums UNIX for Beginners Questions & Answers Count unique words Post 302991682 by imranrasheedamu on Wednesday 15th of February 2017 10:06:48 AM
Old 02-15-2017
Count unique words

Dear all,

I would like to know how to list and count unique words in thousands number of text files.

Please help me out
thanks in advance
 

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

how to read all the unique words in a text file

How can i read all the unique words in a file, i used - cat comment_file.txt | /usr/xpg6/bin/tr -sc 'A-Za-z' '/012' and cat comment_file.txt | /usr/xpg6/bin/tr -sdc 'A-Za-z' '/012' but they didnt worked..... (5 Replies)
Discussion started by: aditya.ece1985
5 Replies

2. Shell Programming and Scripting

Finding the number of unique words in a file

find the number of unique words in a file using sort com- mand. (7 Replies)
Discussion started by: abhikamune
7 Replies

3. Shell Programming and Scripting

Shell script to find out words, replace them and count words

hello, i 'd like your help about a bash script which: 1. finds inside the html file (it is attached with my post) the code number of the Latest Stable Kernel, 2.finds the link which leads to the download location of the Latest Stable Kernel version, (the right link should lead to the file... (3 Replies)
Discussion started by: alex83
3 Replies

4. Homework & Coursework Questions

unique words in files of folder and its subfolders

Hello, I tried to count all unique words of all files in one folder and its subfolders. Can anybody say me, why this doesnt work: ls| find -d | cat | tr "\ " "\n"| uniq -u | wc -l ??? Cat writes only the names of those files, but not the wors, which should be in them. Thanks for any advice. ... (9 Replies)
Discussion started by: Dworza
9 Replies

5. Shell Programming and Scripting

display unique words.

I am having a file with duplicate words how can I eliminate them ant,bat bat,cat cat a.txt | grep -bat | awk '{print $1}' expecting o/p as ant,bat,cat How can I display the output as ant,bat,cat in a single line and no duplicates exists. (2 Replies)
Discussion started by: shikshavarma
2 Replies

6. Shell Programming and Scripting

Unique words in each line

In each row there could be repetition of a word. I want to delete all repetitions and keep unique occurrences. Example: a+b+c ab+c ab+c abbb+c ab+bbc a+bbbc aaa aaa aaa Output: a+b+c ab+c abbb+c ab+bbc a+bbbc aaa (6 Replies)
Discussion started by: Viernes
6 Replies

7. Shell Programming and Scripting

awk to count using each unique value

Im looking for an awk script that will take the unique values in column 5, then print and count the unique values in column 6. CA001011500 11111 11111 -9999 201301 AAA CA001012040 11111 11111 -9999 201301 AAA CA001012573 11111 11111 -9999 201301 BBB CA001012710 11111 11111 -9999 201301... (4 Replies)
Discussion started by: ncwxpanther
4 Replies

8. Shell Programming and Scripting

How count the number of two words associated with the two words occurring in the file?

Hi , I need to count the number of errors associated with the two words occurring in the file. It's about counting the occurrences of the word "error" for where is the word "index.js". As such the command should look like. Please kindly help. I was trying: grep "error" log.txt | wc -l (1 Reply)
Discussion started by: jmarx
1 Replies

9. Shell Programming and Scripting

Count occurrence of column one unique value having unique second column value

Hello Team, I need your help on the following: My input file a.txt is as below: 3330690|373846|108471 3330690|373846|108471 0640829|459725|100001 0640829|459725|100001 3330690|373847|108471 Here row 1 and row 2 of column 1 are identical but corresponding column 2 value are... (4 Replies)
Discussion started by: angshuman
4 Replies

10. Shell Programming and Scripting

Regex to identify unique words in a dictionary database

Hello, I have a dictionary which I am building for the Open Source Community. The data structure is as under HEADWORD=PARTOFSPEECH=ENGLISH MEANING as shown in the example below अ=m=Prefix signifying negation. अँहँ=ind=Interjection expressing disapprobation. अं=int=An interjection... (2 Replies)
Discussion started by: gimley
2 Replies
spell(1)						      General Commands Manual							  spell(1)

Name
       spell, spellin, spellout - check text for spelling errors

Syntax
       spell [-v] [-b] [-x] [-d hlist] [+local-file] [-s hstop] [-h spellhist] [file...]

       spellin [list]

       spellout [-d] list

Description
       The command collects words from the named documents, and looks them up in a spelling list.  Words that are not on the spelling list and are
       not derivable from words on the list (by applying certain inflections, prefixes or suffixes) are printed on the	standard  output.   If	no
       files are specified, words are collected from the standard input.

       The command ignores most and constructions.

       Two  routines help maintain the hash lists used by Both expect a set of words, one per line, from the standard input.  The command combines
       the words from the standard input and the preexisting list file and places a new list on the standard output.  If no list  file	is  speci-
       fied,  a  new  list  is generated.  The command looks up each word from the standard input and prints on the standard output those that are
       missing from (or present on, with option -d) the hashed list file.  For example, to verify that hookey is not on the default spelling list,
       add it to your own private list, and then use it with
       echo  hookey  |	spellout  /usr/dict/hlista
       echo  hookey  |	spellin  /usr/dict/hlista  >  myhlist
       spell  -d  myhlist <filename>

Options
       -v	      Displays words not found in spelling list with all plausible derivations from spelling list.

       -b	      Checks  data  according  to British spelling.  Besides preferring centre, colour, speciality, travelled, this option insists
		      upon -ise instead of -ize in words like standardise.

       -x	      Precedes each word with an equal sign (=) and displays all plausible derivations.

       -d hlist       Specifies the file used for the spelling list.

       -h spellhist   Specifies the file used as the history file.

       -s hstop       Specifies the file used for the stop list.

       +local-file    Removes words found in local-file from the output of the command.  The argument local-file is the name of a file provided by
		      the  user  that contains a sorted list of words, one per line.  With this option, the user can specify a list of words for a
		      particular job that are spelled correctly.

       The auxiliary files used for the spelling list, stop list, and history file may be specified by arguments following  the  -d,  -s,  and	-h
       options.   The  default files are indicated below.  Copies of all output may be accumulated in the history file.  The stop list filters out
       misspellings (for example, thier=thy-y+ier) that would otherwise pass.

Restrictions
       The coverage of the spelling list is uneven; new installations will probably wish to monitor the output for several months to gather  local
       additions.

       The command works only with ASCII text files.

Files
       /usr/dict/hlist[ab] hashed spelling lists, American &			 British, default for -d
       /usr/dict/hstop	   hashed stop list, default for -s
       /dev/null	   history file, default for -h
       /tmp/spell.$$*	   temporary files
       /usr/lib/spell

See Also
       deroff(1), sed(1), sort(1), tee(1)

																	  spell(1)
All times are GMT -4. The time now is 02:01 AM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy