07-30-2009
How to count unique strings
How do I count the total number of unique strings from a file using Perl? Any help is appreciated..
10 More Discussions You Might Find Interesting
1. UNIX for Dummies Questions & Answers
I need to grep for a pattern in a file. Files are huge and have several repeated occurances of the strings which match pattern. I just need the strings which contain the pattern in the output.
For eg.
The contents of my file are as follows. The pattern I want to match by is ABCD
... (5 Replies)
Discussion started by: tektips
5 Replies
2. Shell Programming and Scripting
I have a sorted file like:
Apple 3
Apple 5
Apple 8
Banana 2
Banana 3
Grape 31
Orange 7
Orange 13
I'd like to search $1 and if $1 is not the same as $1 in the previous row print that row and print the number of times $1 was found.
so the output would look like:
Apple 8 3
Banana... (2 Replies)
Discussion started by: dcfargo
2 Replies
3. Shell Programming and Scripting
Hello Guys
I have a flat file with '|~|' delimited
When I use to record count using below command
awk -FS"+" ' {print $colno}' filename | wc -l
the count is fine
But when I am trying to find the unique number of record the o/p is always 1
awk -FS"+" ' {print $colno}'... (11 Replies)
Discussion started by: Pratik4891
11 Replies
4. Shell Programming and Scripting
Hi,
Im looking for a script which will calculate the unique strings column 2 & 3 values in a log as mentioned in example
eg:-
bag 12 12
bag 18 15
bags 15 13
bags 15 14
blazer 24 24
blazer 33 32
boots 19 15
Result should be:-
bag 30 27
bags 30 27... (9 Replies)
Discussion started by: Paulwintech
9 Replies
5. Shell Programming and Scripting
how to display the unique strings in two files using shell script or commands.
I tried diff and cmp but it shows the entire line, i need only the mismatched strings.
File1:
sat,sun,mon,tue
rose,lilly,lotus
white,red,blue,green,pink
File2:
sat,sun,mon,tue
rose,sunflower,lotus... (4 Replies)
Discussion started by: Arun_Linux
4 Replies
6. Shell Programming and Scripting
Im looking for an awk script that will take the unique values in column 5, then print and count the unique values in column 6.
CA001011500 11111 11111 -9999 201301 AAA
CA001012040 11111 11111 -9999 201301 AAA
CA001012573 11111 11111 -9999 201301 BBB
CA001012710 11111 11111 -9999 201301... (4 Replies)
Discussion started by: ncwxpanther
4 Replies
7. Shell Programming and Scripting
When I use the below awk to count the unique lines in $4 for the input it seems to work. The answer is 3 because $4 is only unique 3 times in all the entries. However, when I use the same on actual data I get 56,536 and I know the answer should be 56,548. My question is there a better way to... (8 Replies)
Discussion started by: cmccabe
8 Replies
8. Shell Programming and Scripting
Hello Team,
I need your help on the following:
My input file a.txt is as below:
3330690|373846|108471
3330690|373846|108471
0640829|459725|100001
0640829|459725|100001
3330690|373847|108471
Here row 1 and row 2 of column 1 are identical but corresponding column 2 value are... (4 Replies)
Discussion started by: angshuman
4 Replies
9. UNIX for Beginners Questions & Answers
Dear all,
I would like to know how to list and count unique words in thousands number of text files.
Please help me out
thanks in advance (9 Replies)
Discussion started by: imranrasheedamu
9 Replies
10. UNIX for Beginners Questions & Answers
Hello,
I am trying to count unique rows in my file based on 4 columns (2-5) and to output its frequency in a sixth column. My file is tab delimited
My input file looks like this:
Colum1 Colum2 Colum3 Colum4 Coulmn5
1.1 100 100 a b
1.1 100 100 a c
1.2 200 205 a d
1.3 300 301 a y
1.3 300... (6 Replies)
Discussion started by: nans
6 Replies
VM_STAT(1) BSD General Commands Manual VM_STAT(1)
NAME
vm_stat -- show Mach virtual memory statistics
SYNOPSIS
vm_stat [[-c count] interval]
DESCRIPTION
vm_stat displays Mach virtual memory statistics. If the optional interval is specified, then vm_stat will display the statistics every
interval seconds. In this case, each line of output displays the change in each statistic (an interval count of 1 displays the values per
second). However, the first line of output following each banner displays the system-wide totals for each statistic. If a count is pro-
vided, the command will terminate after count intervals. The following values are displayed:
Pages free
the total number of free pages in the system.
Pages active
the total number of pages currently in use and pageable.
Pages inactive
the total number of pages on the inactive list.
Pages speculative
the total number of pages on the speculative list.
Pages throttled
the total number of pages on the throttled list (not wired but not pageable).
Pages wired down
the total number of pages wired down. That is, pages that cannot be paged out.
Pages purgeable
the total number of purgeable pages.
Translation faults
the number of times the "vm_fault" routine has been called.
Pages copy-on-write
the number of faults that caused a page to be copied (generally caused by copy-on-write faults).
Pages zero filled
the total number of pages that have been zero-filled on demand.
Pages reactivated
the total number of pages that have been moved from the inactive list to the active list (reactivated).
Pages purged
the total number of pages that have been purged.
File-backed pages
the total number of pages that are file-backed (non-swap)
Anonymous pages
the total number of pages that are anonymous
Uncompressed pages
the total number of pages (uncompressed) held within the compressor
Pages used by VM compressor:
the number of pages used to store compressed VM pages.
Pages decompressed
the total number of pages that have been decompressed by the VM compressor.
Pages compressed
the total number of pages that have been compressed by the VM compressor.
Pageins
the total number of requests for pages from a pager (such as the inode pager).
Pageouts
the total number of pages that have been paged out.
Swapins
the total number of compressed pages that have been swapped out to disk.
Swapouts
the total number of compressed pages that have been swapped back in from disk.
If interval is not specified, then vm_stat displays all accumulated statistics along with the page size.
Mac OS X August 13, 1997 Mac OS X