Determining Word Frequency of Specific Terms

03-06-2009

Registered User

135, 0

Join Date: Feb 2009

Last Activity: 10 February 2016, 9:34 PM EST

Posts: 135

Thanks Given: 7

Thanked 0 Times in 0 Posts

Hi, Can we take out:

Total number of SOA records = 30

I only need records showing the below in each db.x

PTR
MX
NS
CNAME
A

The code must be smart to look at tabs/spaces I guess??

a copy of a db.x looks like

;
; THIS FILE IS AUTOMATICALLY GENERATED. DO NOT EDIT IT.
; THIS FILE IS AUTOMATICALLY GENERATED. DO NOT EDIT IT.
; THIS FILE IS AUTOMATICALLY GENERATED. DO NOT EDIT IT.
; THIS FILE IS AUTOMATICALLY GENERATED. DO NOT EDIT IT.
;
; generated from: $Id: master.txt,v 2.1230 2009/01/05 22:29:21 root Exp $
;

$TTL 3600

beerprime.com. IN SOA iqedns1.internet.com. hostmaster.beer.com. (
2009010501 ; Serial
900 ; Refresh
300 ; Retry
1209600 ; Expire
3600 ) ; Minimum
beerprime.com. IN NS iqdns1.internet.com.

integ4 IN A 192.168.205.156
beerprime.com. IN A 192.168.205.175
www IN CNAME intg4.beerprime.com.
86.96.168.192.in-addr.arpa. IN PTR sepapp.beerprime.com

;
; END OF beerprime.com
;

Thanks

richsark

View Public Profile for richsark

Find all posts by richsark

03-06-2009

Registered User

5,690, 630

Join Date: Jan 2007

Last Activity: 9 January 2017, 4:40 AM EST

Location: Варна, България / Milano, Italia

Posts: 5,690

Thanks Given: 184

Thanked 630 Times in 587 Posts

It's smart enough

Try this and let me know if the output is OK:

Code:

awk 'END {
  print f ":"
    for (Z in z)
      printf "Total number of %s records = %d\n", \
      Z, z[Z]
    print RS
    }
FNR == 1 {
  if (f) {
    print f ":"
    for (Z in z)
      printf "Total number of %s records = %d\n", \
      Z, z[Z]
    print RS
    }
    f = FILENAME
  }    
$3 ~ /^(PTR|MX|NS|CNAME|A)$/ { z[$3]++ }' db*

Last edited by radoulov; 03-06-2009 at 08:47 AM.. Reason: corrected $2 -> $3

radoulov

View Public Profile for radoulov

Find all posts by radoulov

03-06-2009

Registered User

135, 0

Join Date: Feb 2009

Last Activity: 10 February 2016, 9:34 PM EST

Posts: 135

Thanks Given: 7

Thanked 0 Times in 0 Posts

Hi !

OK, looks like we got a count issue, see db.beerstearns.com

beerstearns.com. IN SOA iqedns1.internet.com. hostmaster.beer.com. (
2009010501 ; Serial
900 ; Refresh
300 ; Retry
1209600 ; Expire
3600 ) ; Minimum
bearstearns.com. IN NS iqedns1.internet.com.

fbhp IN A 192.168.205.124
futures IN A 192.168.205.165
bigdog IN A 192.168.205.195
bigdog2 IN A 192.168.205.196

; SPECIALS
;
situnifiedportal.bearstearns.com. IN NS whdgss1cnis-pri1.clearco.com.
situnifiedportal.bearstearns.com. IN NS metgss1cnis-sec1.clearco.com.
qa.bearstearns.com. IN NS whdgss1cnis-pri1.clearco.com.
qa.bearstearns.com. IN NS metgss1cnis-sec1.clearco.com.

The output came out as:

db.bearstearns.com:

Total number of CNAME records = 1
Total number of A records = 6
Total number of NS records = 26
Total number of PTR records = 166

There is 4 A records, I dont see CNAME.

I also need a count if it detects the word "Special"
So maybe
Total number of Special records = 4
Sorry, I just noticed that

richsark

View Public Profile for richsark

Find all posts by richsark

03-06-2009

Registered User

5,690, 630

Join Date: Jan 2007

Last Activity: 9 January 2017, 4:40 AM EST

Location: Варна, България / Milano, Italia

Posts: 5,690

Thanks Given: 184

Thanked 630 Times in 587 Posts

You're right, I have to empty the array at the beginning of every file. Try this one:

Code:

awk 'END {
  print f ":"
    for (Z in z)
      printf "Total number of %s records = %d\n", \
      Z, z[Z]
    print RS
    }
FNR == 1 {
  if (f) {
    print f ":"
    for (Z in z)
      printf "Total number of %s records = %d\n", \
      Z, z[Z]
    if (sc) printf "Total number of Special records = %d\n", \
    sc    
    print RS
    split(x, z)
    s = sc = 0
    }
    f = FILENAME
  }    
$3 ~ /^(PTR|MX|NS|CNAME|A)$/ { z[$3]++; s && sc++ }
/SPECIALS/ { s = 1 }' db*

Do you want the special records in the total or you want a separate count for them?
For db.beerstearns.com you want this:

Code:

db.beerstearns.com:
Total number of A records = 4
Total number of NS records = 5
Total number of Special records = 4

Or this:

Code:

db.beerstearns.com:
Total number of A records = 4
Total number of NS records = 1
Total number of Special records = 4

Last edited by radoulov; 03-06-2009 at 09:55 AM.. Reason: corrected again: reset s at the beginning of every file

radoulov

View Public Profile for radoulov

Find all posts by radoulov

03-06-2009

Registered User

135, 0

Join Date: Feb 2009

Last Activity: 10 February 2016, 9:34 PM EST

Posts: 135

Thanks Given: 7

Thanked 0 Times in 0 Posts

Hi, I would like to have it like this:

Or this:

Code:
db.beerstearns.com:
Total number of A records = 4
Total number of NS records = 1
Total number of Special records = 4

richsark

View Public Profile for richsark

Find all posts by richsark

03-06-2009

Registered User

5,690, 630

Join Date: Jan 2007

Last Activity: 9 January 2017, 4:40 AM EST

Location: Варна, България / Milano, Italia

Posts: 5,690

Thanks Given: 184

Thanked 630 Times in 587 Posts

Is this OK?
Do you want the IN strings (I don't know the exact word

) for the special records too or the count is sufficient?

Code:

awk 'END {
  print f ":"
    for (Z in z)
      printf "Total number of %s records = %d\n", \
      Z, z[Z]
    print RS
    }
FNR == 1 {
  if (f) {
    print f ":"
    for (Z in z)
      printf "Total number of %s records = %d\n", \
      Z, z[Z]
    if (sc) printf "Total number of Special records = %d\n", \
    sc    
    print RS
    split(x, z)
    s = sc = 0
    }
    f = FILENAME
  }    
$3 ~ /^(PTR|MX|NS|CNAME|A)$/ {  
  if (s) sc++ 
  else z[$3]++
  }
/SPECIALS/ { s = 1 }' db*

radoulov

View Public Profile for radoulov

Find all posts by radoulov

03-06-2009

Registered User

135, 0

Join Date: Feb 2009

Last Activity: 10 February 2016, 9:34 PM EST

Posts: 135

Thanks Given: 7

Thanked 0 Times in 0 Posts

Awesome !

Thanks a whole bunch !!

richsark

View Public Profile for richsark

Find all posts by richsark

Shell Programming and Scripting

Determining Word Frequency of Specific Terms

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Search for a specific word and print only the word from the input file

Discussion started by: mohan_kumarcs

2. Shell Programming and Scripting

Count frequency of unique values in specific column

Discussion started by: owwow14

3. Shell Programming and Scripting

Shell scripting: frequency of specific word in a string and statistics

Discussion started by: kraterions

4. Shell Programming and Scripting

Convert a list of word/terms into their Regexp representation

Discussion started by: oly_r

5. Shell Programming and Scripting

Fetch entries in front of specific word till next word

Discussion started by: Priyanka Chopra

6. Shell Programming and Scripting

Help with calculating frequency of specific word in a string

Discussion started by: perl_beginner

7. UNIX for Dummies Questions & Answers

How to print line starts with specific word and contains specific word using sed?

Discussion started by: tmalik79

8. Shell Programming and Scripting

Word Frequency Sort

Discussion started by: gimley

9. Shell Programming and Scripting

word frequency counter - awk solution?

Discussion started by: irrevocabile

10. Shell Programming and Scripting

Word frequency with additional information

Discussion started by: ToeLint