How to count number of results found?

03-27-2013

Registered User

23, 0

Join Date: Jul 2012

Last Activity: 7 April 2017, 10:39 AM EDT

Posts: 23

Thanks Given: 28

Thanked 0 Times in 0 Posts

I actually prefer counting the words even if its between other words, so RudiC seems the best option, anyways all the replies helped me a lot, so thanks to all!

---------- Post updated at 06:49 PM ---------- Previous update was at 05:44 PM ----------

Quote:

Originally Posted by RudiC

Using gary_w's files, would this satisfy your needs:

Code:

$ grep -of x1.dat x2.dat| sort |uniq -c
      3 word1
      5 word2
      8 word3

---------- Post updated at 12:50 ---------- Previous update was at 12:44 ----------

That wouldn't help. Remove the space in grep's "$i " parameter...

Unfortunately the grep -o is not installed in my server, and I cant do anything about it.

grep: illegal option -- o

Do you know if its possible to do something similar?

---------- Post updated at 08:58 PM ---------- Previous update was at 06:49 PM ----------

I got the result expected by using the following:

Code:

for i in $(cat x1.dat); do echo "$i ";tr -s ' ' '\n' < x2.dat| grep -c "$i";done

However, the result is coming up like this:

word1
3
word2
5
word3
8

But I expected to be like this:

3 word1
5 word2
8 word3

Can anyone help further?

demmel

View Public Profile for demmel

Find all posts by demmel

03-27-2013

Moderator

3,689, 1,352

Join Date: Jan 2012

Last Activity: 22 August 2020, 11:29 PM EDT

Location: Galactic Empire

Posts: 3,689

Thanks Given: 268

Thanked 1,352 Times in 1,258 Posts

Here is a KSH script using Associative Arrays for counting words:

Code:

#!/bin/ksh

typeset -A word_ARR

while read line
do
        for word in $line
        do
                (( word_ARR[$word]++ ))
        done
done < file.txt

for key in ${!word_ARR[*]}
do
        print ${word_ARR[$key]} $key
done

This User Gave Thanks to Yoda For This Post:

Yoda

View Public Profile for Yoda

Visit Yoda's homepage!

Find all posts by Yoda

03-27-2013

Registered User

23, 0

Join Date: Jul 2012

Last Activity: 7 April 2017, 10:39 AM EDT

Posts: 23

Thanks Given: 28

Thanked 0 Times in 0 Posts

Quote:

Originally Posted by Yoda

Here is a KSH script using Associative Arrays for counting words:

Code:

#!/bin/ksh

typeset -A word_ARR

while read line
do
        for word in $line
        do
                (( word_ARR[$word]++ ))
        done
done < file.txt

for key in ${!word_ARR[*]}
do
        print ${word_ARR[$key]} $key
done

I may be doing something wrong, but I'm unable to get any results from this script.

I ran the same , only replacing the file input name.
Tried in 2 dif envs:

1-$ ./array
./array[3]: typeset: bad option(s)
2-$./array
bash: ./array: /bin/ksh: bad interpreter: No such file or directory

Any clue where is the problem?

demmel

View Public Profile for demmel

Find all posts by demmel

03-28-2013

Registered User

858, 184

Join Date: Mar 2013

Last Activity: 12 May 2013, 11:33 PM EDT

Posts: 858

Thanks Given: 18

Thanked 184 Times in 179 Posts

The problem is that /bin/ksh apparently does not exist on your system.

Please try the following, assuming it runs on your system:

Code:

$ cat file1
word1
word2
word3

Code:

$ cat file2
word1
word2 word3
word2 word3 word3 word4

Code:

$ cat temp.sh
grep -f file1 file2 > good_lines
sed "s/\<[a-zA-Z0-9_]\+\>/&\n/g" good_lines > split_lines
grep -f file1 split_lines | sed "s/^ *//; s/ *$//" > matched_words
sort matched_words | uniq -c

Code:

$ ./temp.sh
      1 word1
      2 word2
      3 word3

I defined a "Word" as the standard [a-zA-Z0-9_].
So this includes "Words" with numbers and underscores.
Alternatively, you could use [a-zA-Z].
Or maybe you want to count "auto-correct" as one word.
In that case, [a-zA-Z-] would work.

This User Gave Thanks to hanson44 For This Post:

hanson44

View Public Profile for hanson44

Find all posts by hanson44

03-28-2013

Moderator

3,689, 1,352

Join Date: Jan 2012

Last Activity: 22 August 2020, 11:29 PM EDT

Location: Galactic Empire

Posts: 3,689

Thanks Given: 268

Thanked 1,352 Times in 1,258 Posts

I forgot to mention that you require KSH93 to support this code.

KSH88 does not support typeset option -a to define arrays.

This User Gave Thanks to Yoda For This Post:

Yoda

View Public Profile for Yoda

Visit Yoda's homepage!

Find all posts by Yoda

03-28-2013

Registered User

23, 0

Join Date: Jul 2012

Last Activity: 7 April 2017, 10:39 AM EDT

Posts: 23

Thanks Given: 28

Thanked 0 Times in 0 Posts

Quote:

Originally Posted by hanson44

The problem is that /bin/ksh apparently does not exist on your system.

Please try the following, assuming it runs on your system:

Code:

$ cat file1
word1
word2
word3

Code:

$ cat file2
word1
word2 word3
word2 word3 word3 word4

Code:

$ cat temp.sh
grep -f file1 file2 > good_lines
sed "s/\<[a-zA-Z0-9_]\+\>/&\n/g" good_lines > split_lines
grep -f file1 split_lines | sed "s/^ *//; s/ *$//" > matched_words
sort matched_words | uniq -c

Code:

$ ./temp.sh
      1 word1
      2 word2
      3 word3

The standard word you defined is great as it is.

I created the temp script but it did not work as expected in one of my systems

Code:

 $ ./temp.sh
sed: Function s/\<[a-zA-Z0-9_]\+\>/& cannot be parsed.

I'm not sure why some sed functions are not functioning/installed here. Any ideas to circumvent this error?

However in my other system the result was as expected, so thanks a lot!

---------- Post updated at 06:59 PM ---------- Previous update was at 06:47 PM ----------

Quote:

Originally Posted by demmel

The standard word you defined is great as it is.

I created the temp script but it did not work as expected in one of my systems

Code:

 $ ./temp.sh
sed: Function s/\<[a-zA-Z0-9_]\+\>/& cannot be parsed.

I'm not sure why some sed functions are not functioning/installed here. Any ideas to circumvent this error?

However in my other system the result was as expected, so thanks a lot!

I was able to prevent the error by using single quotes instead of double quotes, still the result did not come right, see below:

Code:

$ ./temp.sh
   1 word1
   1 word2 word3
   1 word2 word3 word3 word4

This is the content of the file split_lines:

Code:

word1
word2 word3
word2 word3 word3 word4

Any ideas?

Last edited by demmel; 03-28-2013 at 07:09 PM..

demmel

View Public Profile for demmel

Find all posts by demmel

03-28-2013

Registered User

858, 184

Join Date: Mar 2013

Last Activity: 12 May 2013, 11:33 PM EDT

Posts: 858

Thanks Given: 18

Thanked 184 Times in 179 Posts

It's something to do with the sed line. The best way to figure it out is to copy and paste the temp.sh shell script, exactly as it is on your system, and include it with the message. No point in guessing.

hanson44

View Public Profile for hanson44

Find all posts by hanson44

Shell Programming and Scripting

How to count number of results found?

10 More Discussions You Might Find Interesting

1. UNIX for Beginners Questions & Answers

Grep command to show the number of results

Discussion started by: abdossamad2003

2. Shell Programming and Scripting

Count occurences of a character in a file by sorting results

Discussion started by: destin45

3. Shell Programming and Scripting

awk Help -- If match found return the count

Discussion started by: bbc17484

4. Shell Programming and Scripting

how to add the number of row and count number of rows

Discussion started by: juelillo

5. Shell Programming and Scripting

Awk - Count instances of a number in col1 and put results in a col2 (new) of diff file

Discussion started by: jontjioe

6. Shell Programming and Scripting

found count of them

Discussion started by: Skipper

7. Shell Programming and Scripting

count the number of lines that start with the number

Discussion started by: grajp002

8. UNIX for Dummies Questions & Answers

putting grep -c results number in a variable

Discussion started by: busdude

9. Shell Programming and Scripting

Number count per number ranges

Discussion started by: shirleyeow

10. UNIX for Dummies Questions & Answers

awk | stop after specified number of results

Discussion started by: evan108