Sponsored Content
Top Forums Shell Programming and Scripting Determining Word Frequency of Specific Terms Post 302294749 by richsark on Thursday 5th of March 2009 06:25:24 PM
Old 03-05-2009
I have many zone files or dns zones that contain various record types.

is it to much to ask to add some finesse to my request.

Example: I could have

db.208.199.11.0

That would contain the below information

Code:
224.199.207.IN-ADDR.ARPA. IN NS AIM1.internet.com.
4.200.162.207.in-addr.arpa. IN PTR beeriftw.internet.com.
arroyoeinternet.com. IN A 200.199.227.49

Then another file
db.explorer.com would contain

Code:
224.162.207.IN-ADDR.ARPA.       IN NS   pwedns1.internet.com.
224.162.207.IN-ADDR.ARPA.       IN NS   pmedns1.internet.com.
224.162.207.IN-ADDR.ARPA.       IN NS   phedns1.internet.com.
224.162.207.IN-ADDR.ARPA.       IN NS   auth100.ns.aut.net.

So what I am requesting is to create input file that has these names in it that would use your script to count against.

So the output may look like for each word in my input file

Code:
db.208.199.11.0:
Total number of A records = 684
Total number of PTR records = 306
Total number of CNAME records = 58
Total number of NS records = 1352

db.explorer.com;
Total number of A records = 6
Total number of PTR records = 30
Total number of CNAME records = 88
Total number of NS records = 55

So rather then having it look for each txt file like my original thought, is have the script reference a master input file.

Thanks in advance !

Last edited by radoulov; 03-06-2009 at 08:22 AM.. Reason: added code tags
 

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Word frequency with additional information

Hello everyone, I am using a chunk of code to display the frequency of a file name in a list of directories. The code looks like this: find . -name "*.log" | cut -d/ -f4 | cut -d. -f1 | awk '{print $1}' | sort | uniq -c | sort -nr The file paths would look something like this:... (1 Reply)
Discussion started by: ToeLint
1 Replies

2. Shell Programming and Scripting

word frequency counter - awk solution?

Dear all, i need your help on this. There is a text file, i need to count word frequency for each word with frequency >40 in each line of file and output it into another file with columns like this: word1,word2,word3, ...wordn 0,0,1 1,2,0 3,2,0 etc -- each raw represents... (13 Replies)
Discussion started by: irrevocabile
13 Replies

3. Shell Programming and Scripting

Word Frequency Sort

hello, Here is a program for creating a word-frequency # wf.gk --- program to generate word frequencies from a file { # remove punctuation: This will remove all punctuations from the file gsub(/_]/, "", $0) #Start frequency analysis for (i = 1; i <= NF; i++) freq++ } END #Print output... (11 Replies)
Discussion started by: gimley
11 Replies

4. UNIX for Dummies Questions & Answers

How to print line starts with specific word and contains specific word using sed?

Hi, I have gone through may posts and dint find exact solution for my requirement. I have file which consists below data and same file have lot of other data. <MAPPING DESCRIPTION ='' ISVALID ='YES' NAME='m_TASK_UPDATE' OBJECTVERSION ='1'> <MAPPING DESCRIPTION ='' ISVALID ='NO'... (11 Replies)
Discussion started by: tmalik79
11 Replies

5. Shell Programming and Scripting

Help with calculating frequency of specific word in a string

Input file: #read_1 AWEAWQQRZZZQWQQWZ #read_2 ZZAQWRQTWQQQWADSADZZZ #read_3 POGZZZZZZADWRR . . Desired output file: #read_1 3 #read_1 1 #read_2 2 #read_2 3 #read_3 6 . . (3 Replies)
Discussion started by: perl_beginner
3 Replies

6. Shell Programming and Scripting

Fetch entries in front of specific word till next word

Hi all I have following file which I have to edit for research purpose file:///tmp/moz-screenshot.png body, div, table, thead, tbody, tfoot, tr, th, td, p { font-family: &quot;Liberation Sans&quot;; font-size: x-small; } Drug: KRP-104 QD Drug: Placebo Drug: Metformin|Drug:... (15 Replies)
Discussion started by: Priyanka Chopra
15 Replies

7. Shell Programming and Scripting

Convert a list of word/terms into their Regexp representation

Ok this might sound pretty weird but here is the request. Running on a linux system in bash or Perl (i really don't know perl but the end user has a few pearl script already) Start File looks something like this (4000 entries) TEST PLAN T//TF T-TF TEST (T) Hacker ... I am thinking about... (3 Replies)
Discussion started by: oly_r
3 Replies

8. Shell Programming and Scripting

Shell scripting: frequency of specific word in a string and statistics

Hello friends, I need a BIG help from UNIX collective intelligence: I have a CSV file like this: VALUE,TIMESTAMP,TEXT 1,Sun May 05 16:13:05 +0000 2013,"RT @gracecheree: Praying God sends me a really great man one day. Gotta trust in his timing. 0,Sun May 05 16:13:05 +0000 2013,@sendi__... (19 Replies)
Discussion started by: kraterions
19 Replies

9. Shell Programming and Scripting

Count frequency of unique values in specific column

Hi, I have tab-deliminated data similar to the following: dot is-big 2 dot is-round 3 dot is-gray 4 cat is-big 3 hot in-summer 5 I want to count the frequency of each individual "unique" value in the 1st column. Thus, the desired output would be as follows: dot 3 cat 1 hot 1 is... (5 Replies)
Discussion started by: owwow14
5 Replies

10. Shell Programming and Scripting

Search for a specific word and print only the word from the input file

Hi, I have a sample file as shown below, I am looking for sed or any command which prints the complete word only from the input file. Ex: $ cat "sample.log" I am searching for a word which is present in this file We can do a pattern search using grep but I need to cut only the word which... (1 Reply)
Discussion started by: mohan_kumarcs
1 Replies
RADLIST(1)							Yard Radius Manual							RADLIST(1)

NAME
radlist - lists current on-line users and per user stats SYNOPSIS
radlist [ -bhnstx ] [ -D|M|Y ] [ -F format_string ] [ -d day ] [ -m month ] [ -y year ] [ -u user ] [ -y year ] DESCRIPTION
This program gives detailed statistics for current users on line (which is the default) or any user you like. Its statistics are on a per- day/month/year base and can be used to produced useful reports. It shows on-line time and traffic, along with numbers of logins and cur- rent port usage. Its output can be largerly customized as you prefer with a suitable format string. See FORMATS section below. OPTIONS
-b Uses a brief output format. -t Uses a `traditional' tabular output format. -n Skips the header when `-t' option is used. -h Prints out usage of the command. -F -Hformat_string Define customized formats for output rows and the header. See FORMATS section below. -m month Reports statistics for month month where month is in the range 1-12. -y year Reports statistics for year year that is a four digits number. -x Shows extedend information (port information). - u user Gets a single username database entry, i.e. select a specific username to report statistics for that user only. -M -D -Y Reports statistics on a dayly/monthly/yearly basis. They should be used along with d,m,y options. If not specified current date and -M is considered. FORMATS
radlist does allow the user to have a customized output format with the `-F' command line option (see above). The program accepts and out- puts any character in a format string and parse it to find some %-prefixed one-letter tokens. Those `a la' printf tokens are substituted with corresponding entry values. Normal C substitution are performed for \, , , and \%. The list of legal % tokens follows, any oher combination is parsed and ignored. Token Value --------------------------------------------------------- %l Username %s Number of current sessions %n Total number of sessions in the period specified %t Total online time in secs %T Total online time in HH:MM:SS format %i Total input traffic in bytes %I Total input traffic in KBytes %o Total output traffic in bytes %O Total output traffic in KBytes %m Total traffic in bytes %M Total traffic in KBytes %k Mean online time per session in secs %K Mean online time per session in HH:MM:SS format %g Mean traffic per session in bytes %G Mean traffic per session in KBytes %% literal '%' No alignment decimal values are allowed in this version of format strings. FILES
/usr/logs/YEAR/user-stats These are files which constitutes the yearly radlist database. They are in binary GDBM format and collect statistics on a per user base, day by day. /usr/logs/NAS/YEAR/detail-XX These text files stores accounting information for every access server (NAS) listed in the clients file. Each user session is iden- tified by a unique session-id and these accounting files store start/stop records for every single authenticated session. Informa- tion stored in those records, but for a group of standard ones, strictly depend on the NAS model and operating system. BUGS
The -x option apparently is not working. SEE ALSO
radiusd(8), radlast(1) builddbm(8), radtest(1), radwatch(1), radius_attributes(5) AUTHOR
Francesco Paolo Lovergine <francesco@yardradius.org>. A complete list of contributors is contained in CREDITS file. You should get that file among other ones within your distribution and pos- sibly installed under /usr/docs directory COPYRIGHT
Copyright (C) 1997 Cristian Gafton. All rights reserved. Copyright (C) 1999-2004 Francesco Paolo Lovergine. All rights reserved. See the LICENSE file enclosed within this software for conditions of use and distribution. This is a pure ISO BSD Open Source License . 1.1 Aug 25, 2004 RADLIST(1)
All times are GMT -4. The time now is 12:15 PM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy