Sponsored Content
Top Forums Shell Programming and Scripting Count the occurences of strings Post 302889799 by gomez on Monday 24th of February 2014 01:42:55 AM
Old 02-24-2014
Count the occurences of strings

I have some text files in a folder f1 with 10 columns. The first five columns of a file are shown below.

Code:
aab abb  263-455  263  455
aab abb  263-455  263  455
aab abb  263-455  263  455
bbb abb  26-455   26   455
bbb abb  26-455   26   455
bbb aka  264-266  264  266
bga bga  230-232  230  232
bga bga  230-232  230  232

I want to count the unique number of each string in the first and second columns based on the numbers of third column (or fourth and fifth column). I used the following program.

Code:
for i in f1/*.txt;
do
awk '
BEGIN {
    path=sprintf("%s", "/home/gomez/Desktop/f2/")
}
!s[1":"$1":"$3]++{sU[$1]++;tot++} 
!s[2":"$2":"$3]++{sU[$2]++;tot++} 
END { 
    sub(/.*\//,"",FILENAME)
    for (x in sU) 
        print x, sU[x] > path FILENAME;
        print "Total No -",tot > path FILENAME;
}'  $i;
 done

Output of the program

Code:
bga 1
aab 1
bbb 2
aka 1
bga 1
abb 2
Total No - 8

But I would like to get the output like this.

Code:
bga 230  1
bga 232  1
aab 263  1
abb 455  2
aka 266  1
bbb 26    1
bbb 264  1

Total No - 8

I would like to print the values of each string as second column. The values are in the third column(or fourth and fifth column) of the input files.

your help would be appreciated!!

Last edited by gomez; 02-24-2014 at 04:04 AM..
 

10 More Discussions You Might Find Interesting

1. UNIX for Dummies Questions & Answers

Counting occurences of different strings in a file

Hi, i'd like to know if the following is possible with a shell script, and can't find the answer in the search. Suppose i have a logfile build like this: # 8 :riuyzp1028 # 38 : riuyzp1028 # 25 : riuyvzp1032 # 30 : nlkljpa0202 # 1 : nlklja0205 # 38 : riuyzp1028 # 25 :... (4 Replies)
Discussion started by: Freerider
4 Replies

2. Shell Programming and Scripting

How to count the number of occurences of this pattern?

Hi all, I have a pattern like this in a file: 123 4 56 789 234 5 67 789 121 3 56 789 222 4 65 789 321 6 90 100 478 8 40 789 243 7 80 789 How can I count the number of occurences of '789' (4th column) in this set...? Thanks for all your help! K (7 Replies)
Discussion started by: kripssmart
7 Replies

3. Shell Programming and Scripting

Perl - Count occurences

I have enclosed the script. I am able to find the files that contain my search string but when I try to count the occurences within the file I get zero always. Any help on this. #!/usr/bin/perl my $find = $ARGV; my $replace = $ARGV; my $glob = $ARGV; @filelist = <*$glob>; # process each... (22 Replies)
Discussion started by: TimHortons
22 Replies

4. Shell Programming and Scripting

to count the number of occurences of a column value

im trying to count the number of occurences of column 2 value(starting from KKK*) of the below file, file.txt using the code cat file.txt | awk ' BEGIN { print "Category Counts"} {FS=","} {NR > 2} { cats = cats + 1} END { for(c in cats) { print c, "=", cats} } ' but its returning as ... (6 Replies)
Discussion started by: michaelrozar17
6 Replies

5. Shell Programming and Scripting

Awk to count occurences

Hi, i am in need of an awk script to accomplish the following: Input table looks like: Student1 arts Student2 science Student3 arts Student4 science Student5 science Student6 science Student7 science Student8 science Student9 science Student10 science Student11 science... (8 Replies)
Discussion started by: saint2006
8 Replies

6. Shell Programming and Scripting

Count occurences of string

Hi, Please help me in finding the number of occurences of the string. Example: Apple, green, blue, Apple, Orange, green, blue are the strings can be even in the next line. The o/p should look as: Word Count ----- ----- Apple 2 green 2 Orange 1 blue 2 Thanks (2 Replies)
Discussion started by: acc888
2 Replies

7. UNIX for Dummies Questions & Answers

Count pattern occurences

hi, I have a text..and i need to find a pattern in the text and count to the no of times the pattern occured. i have used grep command ..but the problem is , it shows the occurrences of the pattern but doesn't count no of times the pattern occuries. (5 Replies)
Discussion started by: nvnni
5 Replies

8. Shell Programming and Scripting

Count occurences based on interval

Hi, I have a file which has 4500 entries 10000 9880 9800 8700 8200 ... ..... ... ... ... ... ... ... ... 50 (1 Reply)
Discussion started by: Diya123
1 Replies

9. Shell Programming and Scripting

awk count occurences

line number:status, market, keystation 1,SENT,EBS,1 : 1 2,DONE,REU,1 : 1 3,SENT,EBS,2 : 1 4,DONE,EBS,1 : 0 5,SENT,EBS,2 : 0 6,SENT,EBS,2 : 0 7,SENT,EBS,2 : 0 8,SENT,EBS,1 : 1 for each status, market combination I want to keep a tally of active orders. i.e if an order is SENT, then +1, if... (8 Replies)
Discussion started by: Calypso
8 Replies

10. UNIX for Beginners Questions & Answers

Count occurences of the word without it repeating

Hi, I would like to count the number of ALA occurences without having them to be repeated. In the script I have written now it has 40 repetitions of ALA but it has to be 8. ALA is chosen as one of the 20 values it can have when the script asks for the input of AAA, which for this example is chosen... (7 Replies)
Discussion started by: Aurimas
7 Replies
iconv_mac_cyr(5)					Standards, Environments, and Macros					  iconv_mac_cyr(5)

NAME
iconv_mac_cyr - code set conversion tables for Macintosh Cyrillic DESCRIPTION
The following code set conversions are supported: +---------------------------------------------------------------------+ | Code Set Conversions Supported | +--------------+--------+--------------+--------+---------------------+ | Code |Symbol |Target Code |Symbol | Target | +--------------+--------+--------------+--------+---------------------+ |Output | | | | | +--------------+--------+--------------+--------+---------------------+ |Mac Cyrillic |mac |ISO 8859-5 |iso5 | ISO 8859-5 Cyrillic | +--------------+--------+--------------+--------+---------------------+ |Mac Cyrillic |mac |KOI8-R |koi8 | KOI8-R | +--------------+--------+--------------+--------+---------------------+ |Mac Cyrillic |mac |PC Cyrillic |alt | Alternative PC | +--------------+--------+--------------+--------+---------------------+ |Cyrillic | | | | | +--------------+--------+--------------+--------+---------------------+ |Mac Cyrillic |mac |MS 1251 |win5 | Windows Cyrillic | +--------------+--------+--------------+--------+---------------------+ CONVERSIONS
The conversions are performed according to the following tables. All values in the tables are given in octal. Mac Cyrillic to ISO 8859-5 For the conversion of Mac Cyrillic to ISO 8859-5, all characters not in the following table are mapped unchanged. +-----------------------------------------------------------------+ | Conversions Performed | | Mac Cyrillic ISO 8859-5 Mac Cyrillic ISO 8859-5 | |24 4 276 252 | |200 260 277 372 | |201 261 300 370 | |202 262 301 245 | |203 263 302-311 40 | |204 264 312 240 | |205 265 313 242 | |206 266 314 362 | |207 267 315 254 | |210 270 316 374 | |211 271 317 365 | |212 272 320-327 40 | |213 273 330 256 | |214 274 331 376 | |215 275 332 257 | |216 276 333 377 | |217 277 334 360 | |220 300 335 241 | |221 301 336 361 | |222 302 337 357 | |223 303 340 320 | |224 304 341 321 | |225 305 342 322 | |226 306 343 323 | |227 307 344 324 | |230 310 345 325 | |231 311 346 326 | |232 312 347 327 | |233 313 350 330 | |234 314 351 331 | |235 315 352 332 | |236 316 353 333 | |237 317 354 334 | |240-246 40 355 335 | |247 246 356 336 | |250-252 40 357 337 | |253 242 360 340 | |254 362 361 341 | |255 40 362 342 | |256 243 363 343 | |257 363 364 344 | |260-263 40 365 345 | |264 366 366 346 | |265-266 40 367 347 | |267 250 370 350 | |270 244 371 351 | |271 364 372 352 | |272 247 373 353 | |273 367 374 354 | |274 251 375 355 | |275 371 376 356 | |375 370 | +-----------------------------------------------------------------+ Mac Cyrillic to KOI8-R For the conversion of Mac Cyrillic to KOI8-R, all characters not in the following table are mapped unchanged. +-----------------------------------------------------------------+ | | Conversions|Performed | | | Mac Cyrillic | KOI8-R | Mac Cyrillic | KOI8-R | |24 | 4 |276 |272 | |200 | 341 |277 |252 | |201 | 342 |300 |250 | |202 | 367 |301 |265 | |203 | 347 |302-311 |40 | |204 | 344 |312 |240 | |205 | 345 |313 |261 | |206 | 366 |314 |241 | |207 | 372 |315 |274 | |210 | 351 |316 |254 | |211 | 352 |317 |245 | |212 | 353 |320-327 |40 | |213 | 354 |330 |276 | |214 | 355 |331 |256 | |215 | 356 |332 |277 | |216 | 357 |333 |257 | |217 | 360 |334 |260 | |220 | 362 |335 |263 | |221 | 363 |336 |243 | |222 | 364 |337 |321 | |223 | 365 |340 |301 | |224 | 346 |341 |302 | |225 | 350 |342 |327 | |226 | 343 |343 |307 | |227 | 376 |344 |304 | |230 | 373 |345 |305 | |231 | 375 |346 |326 | |232 | 377 |347 |332 | |233 | 371 |350 |311 | |234 | 370 |351 |312 | |235 | 374 |352 |313 | |236 | 340 |353 |314 | |237 | 361 |354 |315 | |240-246 | 40 |355 |316 | |247 | 266 |356 |317 | |250-252 | 40 |357 |320 | |253 | 261 |360 |322 | |254 | 241 |361 |323 | |255 | 40 |362 |324 | |256 | 262 |363 |325 | |257 | 242 |364 |306 | |260-263 | 40 |365 |310 | |264 | 246 |366 |303 | |265-266 | 40 |367 |336 | |267 | 270 |370 |333 | |270 | 264 |371 |335 | |271 | 244 |372 |337 | |272 | 267 |373 |331 | |273 | 247 |374 |330 | |274 | 271 |375 |334 | |275 | 251 |376 |300 | |375 | 370 | | | +---------------+----------------+----------------+---------------+ Mac Cyrillic to PC Cyrillic For the conversion of Mac Cyrillic to PC Cyrillic, all characters not in the following table are mapped unchanged. +-----------------------------------------------------------------+ | | Conversions|Performed | | | Mac Cyrillic | PC Cyrillic | Mac Cyrillic | PC Cyrillic | |24 | 4 |355 |255 | |240-334 | 40 |356 |256 | |335 | 360 |357 |257 | |336 | 361 |360 |340 | |337 | 357 |361 |341 | |340 | 240 |362 |342 | |341 | 241 |363 |343 | |342 | 242 |364 |344 | |343 | 243 |365 |345 | |344 | 244 |366 |346 | |345 | 245 |367 |347 | |346 | 246 |370 |350 | |347 | 247 |371 |351 | |350 | 250 |372 |352 | |351 | 251 |373 |353 | |352 | 252 |374 |354 | |353 | 253 |375 |355 | |354 | 254 |376 |356 | |303 | 366 | | | +---------------+----------------+----------------+---------------+ Mac Cyrillic to MS 1251 For the conversion of Mac Cyrillic to MS 1251, all characters not in the following table are mapped unchanged. +-----------------------------------------------------------------+ | | Conversions|Performed | | | Mac Cyrillic | MS 1251 | Mac Cyrillic | MS 1251 | |24 | 4 |255 |40 | |200 | 300 |256 |201 | |201 | 301 |257 |203 | |202 | 302 |260-263 |40 | |203 | 303 |264 |263 | |204 | 304 |266 |264 | |205 | 305 |267 |243 | |206 | 306 |270 |252 | |207 | 307 |271 |272 | |210 | 310 |272 |257 | |211 | 311 |273 |277 | |212 | 312 |274 |212 | |213 | 313 |275 |232 | |214 | 314 |276 |214 | |215 | 315 |277 |234 | |216 | 316 |300 |274 | |217 | 317 |301 |275 | |220 | 320 |302 |254 | |221 | 321 |303-306 |40 | |222 | 322 |307 |253 | |223 | 323 |310 |273 | |224 | 324 |311 |205 | |225 | 325 |312 |240 | |226 | 326 |313 |200 | |227 | 327 |314 |220 | |230 | 330 |315 |215 | |231 | 331 |316 |235 | |232 | 332 |317 |276 | |233 | 333 |320 |226 | |234 | 334 |321 |227 | |235 | 335 |322 |223 | |236 | 336 |323 |224 | |237 | 337 |324 |221 | |240 | 206 |325 |222 | |241 | 260 |326 |40 | |242 | 245 |327 |204 | |243 | 40 |330 |241 | |244 | 247 |331 |242 | |245 | 267 |332 |217 | |246 | 266 |333 |237 | |247 | 262 |334 |271 | |250 | 256 |335 |250 | |252 | 231 |336 |270 | |253 | 200 |337 |377 | |254 | 220 |362 |324 | +---------------+----------------+----------------+---------------+ FILES
/usr/lib/iconv/*.so conversion modules /usr/lib/iconv/*.t conversion tables /usr/lib/iconv/iconv_data list of conversions supported by conversion tables SEE ALSO
iconv(1), iconv(3C), iconv(5) SunOS 5.10 18 Apr 1997 iconv_mac_cyr(5)
All times are GMT -4. The time now is 12:24 PM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy