Word frequency with additional information


 
Thread Tools Search this Thread
Top Forums Shell Programming and Scripting Word frequency with additional information
# 1  
Old 02-12-2010
Word frequency with additional information

Hello everyone,

I am using a chunk of code to display the frequency of a file name in a list of directories. The code looks like this:
Code:
find . -name "*.log" | cut -d/ -f4 | cut -d. -f1 | awk '{print $1}' | sort | uniq -c | sort -nr

The file paths would look something like this:
Code:
./clogs/20091201/2/205/54353.log
./clogs/20091201/12/201/99001.log
./clogs/20091202/2/205/54353.log
etc.

The 20091201 represents the date. Running the code would produce something like:
Code:
2  54353
1  99001

This is good but I'd like some extra information displayed as well. Specifically, the dates. Is it possible to add in the dates and have the output look like:
Code:
2 54353 20091201 20091202
1 99001 20091201

Any suggestions would be appreciated. Thanks.
# 2  
Old 02-12-2010
This does what you asked for. But like Oscar Wilde said - the worst things are not getting what you asked for, but the worst is getting what you asked for.

This has the potential for VERY long lines in report.txt
Code:
find . -type f | awk -F'/'  '{ tmp=substr($NF, 1, index($NF,".")-1 ) 
                               dir[tmp]=dir[tmp] " " $2 
                               cnt[tmp]++
                             }
                        END {for (i in cnt) {print i, cnt[i], dir[i]  } }' > report.txt

note: find . -type f gets files only, not any directories.
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Retrieve information Text/Word from HTML code using awk/sed

awk/sed newbie here. I have a HTML file and from that file and I would like to retrieve a text word. <font face=arial size=-1><li><a href=/value_for_clients/Tokyo/abc_process.txt>abc</a> NDK Version: 4.0 </li> <font face=arial size=-1><li><a... (6 Replies)
Discussion started by: sk2code
6 Replies

2. Shell Programming and Scripting

Help with if else with additional condition

Input file: 3420023_3422482,3422486_3423070 46910 1795263_1798343 32681 1837399_1838886 1534 2148674_2149696,2149698_2149772 10203 3825382_3825555,3827296_3827900 198 1839890_1840294 72 . . Output file 3420023_3422482,3422486_3423070 3420023_3422482,3422486_3423070 46910 1795263... (1 Reply)
Discussion started by: perl_beginner
1 Replies

3. Shell Programming and Scripting

Shell scripting: frequency of specific word in a string and statistics

Hello friends, I need a BIG help from UNIX collective intelligence: I have a CSV file like this: VALUE,TIMESTAMP,TEXT 1,Sun May 05 16:13:05 +0000 2013,"RT @gracecheree: Praying God sends me a really great man one day. Gotta trust in his timing. 0,Sun May 05 16:13:05 +0000 2013,@sendi__... (19 Replies)
Discussion started by: kraterions
19 Replies

4. UNIX for Dummies Questions & Answers

Find EXACT word in files, just the word: no prefix, no suffix, no 'similar', just the word

I have a file that has the words I want to find in other files (but lets say I just want to find my words in a single file). Those words are IDs, so if my word is ZZZ4, outputs like aaZZZ4, ZZZ4bb, aaZZZ4bb, ZZ4, ZZZ, ZyZ4, ZZZ4.8 (or anything like that) WON'T BE USEFUL. I need the whole word... (6 Replies)
Discussion started by: chicchan
6 Replies

5. Shell Programming and Scripting

Help with calculating frequency of specific word in a string

Input file: #read_1 AWEAWQQRZZZQWQQWZ #read_2 ZZAQWRQTWQQQWADSADZZZ #read_3 POGZZZZZZADWRR . . Desired output file: #read_1 3 #read_1 1 #read_2 2 #read_2 3 #read_3 6 . . (3 Replies)
Discussion started by: perl_beginner
3 Replies

6. Shell Programming and Scripting

Bash - Same frequency

Hi, Could anyone help me with the following question, if I have two colums (names and frequency) as follows in a file called name.txt Michael 1 Jones 1 Ben 2 Rebeca 4 David 1 and I want to use bash script called freqnames.sh that takes one argument (name) and the output should be... (3 Replies)
Discussion started by: jboy
3 Replies

7. Shell Programming and Scripting

Word Frequency Sort

hello, Here is a program for creating a word-frequency # wf.gk --- program to generate word frequencies from a file { # remove punctuation: This will remove all punctuations from the file gsub(/_]/, "", $0) #Start frequency analysis for (i = 1; i <= NF; i++) freq++ } END #Print output... (11 Replies)
Discussion started by: gimley
11 Replies

8. Shell Programming and Scripting

word frequency counter - awk solution?

Dear all, i need your help on this. There is a text file, i need to count word frequency for each word with frequency >40 in each line of file and output it into another file with columns like this: word1,word2,word3, ...wordn 0,0,1 1,2,0 3,2,0 etc -- each raw represents... (13 Replies)
Discussion started by: irrevocabile
13 Replies

9. Solaris

out of frequency

:cool: after installing solaris 10 5/08/09 directory the computer rebooting then the massage "out of frequency" appear.i want a solution first second my main board is GA-MA780G UD3H (14 Replies)
Discussion started by: medo2008
14 Replies

10. Shell Programming and Scripting

Determining Word Frequency of Specific Terms

Hello, I require a perl script that will read a .txt file that contains words like 224.199.207.IN-ADDR.ARPA. IN NS NS1.internet.com. 4.200.162.207.in-addr.arpa. IN PTR beeriftw.internet.com. arroyoeinternet.com. IN A 200.199.227.49 I want to focus on words: IN... (23 Replies)
Discussion started by: richsark
23 Replies
Login or Register to Ask a Question