02-15-2017
Count unique words
Dear all,
I would like to know how to list and count unique words in thousands number of text files.
Please help me out
thanks in advance
10 More Discussions You Might Find Interesting
1. Shell Programming and Scripting
How can i read all the unique words in a file, i used -
cat comment_file.txt | /usr/xpg6/bin/tr -sc 'A-Za-z' '/012'
and
cat comment_file.txt | /usr/xpg6/bin/tr -sdc 'A-Za-z' '/012'
but they didnt worked..... (5 Replies)
Discussion started by: aditya.ece1985
5 Replies
2. Shell Programming and Scripting
find the number of unique words in a file using sort com-
mand. (7 Replies)
Discussion started by: abhikamune
7 Replies
3. Shell Programming and Scripting
hello,
i 'd like your help about a bash script which:
1. finds inside the html file (it is attached with my post) the code number of the Latest Stable Kernel,
2.finds the link which leads to the download location of the Latest Stable Kernel version,
(the right link should lead to the file... (3 Replies)
Discussion started by: alex83
3 Replies
4. Homework & Coursework Questions
Hello, I tried to count all unique words of all files in one folder and its subfolders. Can anybody say me, why this doesnt work:
ls| find -d | cat | tr "\ " "\n"| uniq -u | wc -l
???
Cat writes only the names of those files, but not the wors, which should be in them.
Thanks for any advice.
... (9 Replies)
Discussion started by: Dworza
9 Replies
5. Shell Programming and Scripting
I am having a file with duplicate words how can I eliminate them
ant,bat
bat,cat
cat a.txt | grep -bat | awk '{print $1}'
expecting o/p as ant,bat,cat
How can I display the output as ant,bat,cat in a single line and no duplicates exists. (2 Replies)
Discussion started by: shikshavarma
2 Replies
6. Shell Programming and Scripting
In each row there could be repetition of a word. I want to delete all repetitions and keep unique occurrences.
Example:
a+b+c ab+c ab+c
abbb+c ab+bbc a+bbbc
aaa aaa aaa
Output:
a+b+c ab+c
abbb+c ab+bbc a+bbbc
aaa (6 Replies)
Discussion started by: Viernes
6 Replies
7. Shell Programming and Scripting
Im looking for an awk script that will take the unique values in column 5, then print and count the unique values in column 6.
CA001011500 11111 11111 -9999 201301 AAA
CA001012040 11111 11111 -9999 201301 AAA
CA001012573 11111 11111 -9999 201301 BBB
CA001012710 11111 11111 -9999 201301... (4 Replies)
Discussion started by: ncwxpanther
4 Replies
8. Shell Programming and Scripting
Hi ,
I need to count the number of errors associated with the two words occurring in the file. It's about counting the occurrences of the word "error" for where is the word "index.js". As such the command should look like. Please kindly help. I was trying: grep "error" log.txt | wc -l (1 Reply)
Discussion started by: jmarx
1 Replies
9. Shell Programming and Scripting
Hello Team,
I need your help on the following:
My input file a.txt is as below:
3330690|373846|108471
3330690|373846|108471
0640829|459725|100001
0640829|459725|100001
3330690|373847|108471
Here row 1 and row 2 of column 1 are identical but corresponding column 2 value are... (4 Replies)
Discussion started by: angshuman
4 Replies
10. Shell Programming and Scripting
Hello,
I have a dictionary which I am building for the Open Source Community. The data structure is as under
HEADWORD=PARTOFSPEECH=ENGLISH MEANING
as shown in the example below
अ=m=Prefix signifying negation.
अँहँ=ind=Interjection expressing disapprobation.
अं=int=An interjection... (2 Replies)
Discussion started by: gimley
2 Replies
LEARN ABOUT DEBIAN
pofilespell
POFILESPELL(1) POFILESPELL(1)
NAME
POFileSpell - checks the spelling in a collection of PO files
SYNOPSIS
POFileSpell [OPTION] [...] [FILE] [...]
INTRODUCTION
POFileSpell checks the spelling in a collection of PO files.
COMMAND LINE OPTIONS
--help or -h
show usage instructions
--interactive or -i
interactive mode, iterate through the spelling errors using a text mode interface; see the Interactive Mode section
--overview or -o
generate an overview file, grouping by error and not by file
--dict=file or -d file
load a file with a list of words to consider correct; can be used multiple times
--batch-add=file
load a file with a list of words to add to the X-POFile-SpellExtra section of each of the target PO files; can be used multiple
times; when used, the actual spelling process is not run
--command=command
the command used for actually spell checking the text, by default aspell --encoding=utf-8 -l; if you want to use ispell, try
something like --comand="ispell -l" or --comand="iconv -t iso-8859-1 | ispell -l"
INTERACTIVE MODE
In interactive mode you iterate through each of the errors found. In each prompt you can press a to add the word to a file's
X-POFile-SpellExtra entry, n to ignore all further errors from this file, Enter to ignore this error or, if you are using one or more
dictionary files, the number of the file (1, 2, ...) to add the word to that dictionary file.
PO FILE HEADER DIRECTIVES
POFileSpell recognizes one PO file header directive. As with all gettext lint tools, this directive is prefixed with X-POFile.
X-POFile-SpellExtra: word
adds the word to the file's list of accepted words
DICTIONARY FILE FORMAT
Dictionary files are just lists of words, one on each line. For example:
word 1
word 2
word n
MORE INFORMATION
gettext-lint web page: http://gettext-lint.sourceforge.net/
AUTHOR
Pedro Morais.
<morais@kde.org>
08/16/2006 POFILESPELL(1)