10-13-2010
Count and print all repeating words in a line
Gurus,
I have a file containing lines like this :
Quote:
GO:0005874 GO:0005634 GO:0007067 GO:0000778 GO:0005876 GO:0005874
GO:0005938 GO:0000776 GO:0007067 GO:0000092 GO:0007067
GO:0043332 GO:0008017 GO:0005737 GO:0008017 GO:0051301 GO:0005737
GO:0005874 GO:0007067 GO:0000555 GO:0005123
Now, number of words in each line varies. My need is, if a word repeats in a line get it printed. Also total number of repeats.
So, the output would be :
Quote:
GO:0005874 1
GO:0007067 1
GO:0008017 GO:0005737 2
Any help would be highly appreciated.
Thanks & Regards
10 More Discussions You Might Find Interesting
1. Shell Programming and Scripting
hi
i have a string like
str=abc def ghi jkl
now i want to count the no of words in the string
please help (7 Replies)
Discussion started by: satish@123
7 Replies
2. Shell Programming and Scripting
can u help me out to print last two words of each sentence of a file.
for example.
contents of input file:
i love songs
my favourite songs
sent
songs all kind
good buddy
Ouput file should contain:
love songs
favourite songs
sent
all kind
good buddy (5 Replies)
Discussion started by: pradeepreddy
5 Replies
3. Shell Programming and Scripting
hi
i have a line
"abc,def,ghi,abc,def ,ghi,abc,def,ghi,abc,def ,ghi,abc,def,ghi,abc"
I want to print the no of words, words separated by comma
please help (3 Replies)
Discussion started by: Satyak
3 Replies
4. Shell Programming and Scripting
Hi, im having a problem with xargs, i want to cout word of each line in file, and i HAVE to use xargs, i tried:
cat file | xargs wc -w .....that uses all words in file like name of files and passed then to wc so it worte wc :somewordformfile is not i afile or directory
cat file | xargs -I{} wc... (3 Replies)
Discussion started by: Qwetek
3 Replies
5. Shell Programming and Scripting
Hi experts
I need to pick 2 matched words from the same line.....
I have given below an example file
eg:
O14757 hsa04110 hsa04115 2 P38398 hsa04120 1
O15111 hsa04010 hsa04210 hsa04920 hsa04620 hsa04660 hsa04662 hsa05200 hsa05212 hsa05221 hsa05220 hsa05215 hsa05222 hsa05120 13 O14920... (4 Replies)
Discussion started by: binnybio
4 Replies
6. Shell Programming and Scripting
HI,
cat test
abc
echo "def" >> test
output is
cat test
abc
def
the needed output is
cat test
abc def
and so on (5 Replies)
Discussion started by: jobycxa
5 Replies
7. UNIX for Advanced & Expert Users
I would like to cut words based on the word count of a line. This over here inspired me with some ideas but I wasn't able to get what I needed.
https://www.unix.com/shell-programming-scripting/105841-count-words-each-line-file-using-xargs.html
If the line has 6 words I would like to use this.... (8 Replies)
Discussion started by: cokedude
8 Replies
8. Shell Programming and Scripting
Hi,
Please suggest a way to print number of words in the end of each line.
<input file>
red aunt house
blue sky
bat and ball game
<output file>
red aunt house 3
blue sky 2
bat and ball game 4
Thanks! (2 Replies)
Discussion started by: mira
2 Replies
9. Shell Programming and Scripting
Hi all,
I have a file in which each string from column 1 is associated with one or multiple strings from column 2. For an example, in the sample input below, Gene1 from column1 is associated with two different strings from column 2 (BP1 and BP2).For every unique string from column 1, I need to... (9 Replies)
Discussion started by: AshwaniSharma09
9 Replies
10. UNIX for Advanced & Expert Users
Hi Folks :)
I have a .txt file with thousands of words. I'm trying to sort the lines in order based on number of words per line.
Example
from:
word
word word word
word word
word word word word
word
word word word
word word
to desired output:
word (2 Replies)
Discussion started by: martinsmith
2 Replies
GREP(1) General Commands Manual GREP(1)
NAME
grep - search a file for a pattern
SYNOPSIS
grep [ option ... ] pattern [ file ... ]
DESCRIPTION
Grep searches the input files (standard input default) for lines (with newlines excluded) that match the pattern, a regular expression as
defined in regexp(6). Normally, each line matching the pattern is `selected', and each selected line is copied to the standard output.
The options are
-c Print only a count of matching lines.
-h Do not print file name tags (headers) with output lines.
-i Ignore alphabetic case distinctions. The implementation folds into lower case all letters in the pattern and input before interpre-
tation. Matched lines are printed in their original form.
-l (ell) Print the names of files with selected lines; don't print the lines.
-L Print the names of files with no selected lines; the converse of -l.
-n Mark each printed line with its line number counted in its file.
-s Produce no output, but return status.
-v Reverse: print lines that do not match the pattern.
Output lines are tagged by file name when there is more than one input file. (To force this tagging, include /dev/null as a file name
argument.)
Care should be taken when using the shell metacharacters $*[^|()= and newline in pattern; it is safest to enclose the entire expression in
single quotes '...'.
SOURCE
/sys/src/cmd/grep.c
SEE ALSO
ed(1), awk(1), sed(1), sam(1), regexp(6)
DIAGNOSTICS
Exit status is null if any lines are selected, or non-null when no lines are selected or an error occurs.
GREP(1)