Sponsored Content
Top Forums Shell Programming and Scripting Count occurrence of column one unique value having unique second column value Post 302978669 by RudiC on Tuesday 2nd of August 2016 11:27:04 AM
Old 08-02-2016
If the index constructed from $1 and $2 does not exist in the temp array T, its a new combination, and the counter for $1 is incremented. When the input file ends, all these counters and the corresponding $1 values are printed.

More detailed:
For the first occurrence of the $1,$2 combination, T[$1,$2] doesn't exist, so !T[$1,$2] is true, and the counter C[$1] is incremented. Due to the increment of T , the next time the combination is encountered, nothing will happen. C[$1] thus counts up the different $2s for every single $1. In the end, the count for every single $1 is printed.

Last edited by RudiC; 08-02-2016 at 03:41 PM..
This User Gave Thanks to RudiC For This Post:
 

9 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

print unique values of a column and sum up the corresponding values in next column

Hi All, I have a file which is having 3 columns as (string string integer) a b 1 x y 2 p k 5 y y 4 ..... ..... Question: I want get the unique value of column 2 in a sorted way(on column 2) and the sum of the 3rd column of the corresponding rows. e.g the above file should return the... (6 Replies)
Discussion started by: amigarus
6 Replies

2. Shell Programming and Scripting

awk pattern match and count unique in column

Hi all I have a need of searching some pattern in file by month and then count unique records D11 G11 R11 -------> Pattern available in file S11 Jan$1 to $5 column contains some records in which I want to find unique for this purpose I have written script like below awk '/Jan/ ||... (4 Replies)
Discussion started by: nex_asp
4 Replies

3. Shell Programming and Scripting

Count frequency of unique values in specific column

Hi, I have tab-deliminated data similar to the following: dot is-big 2 dot is-round 3 dot is-gray 4 cat is-big 3 hot in-summer 5 I want to count the frequency of each individual "unique" value in the 1st column. Thus, the desired output would be as follows: dot 3 cat 1 hot 1 is... (5 Replies)
Discussion started by: owwow14
5 Replies

4. UNIX for Dummies Questions & Answers

Grep unique 1st column

Hello, I'm trying to used awk but am new to this. I have a file like this: Bob is a good boy Bob is a strange person Bob is a good dancer Jane can party Jane is a good girl Jane is batty I'd like to get this: Bob is a good boy is a strange person is a good dancer Jane... (4 Replies)
Discussion started by: Billyjo
4 Replies

5. Shell Programming and Scripting

Sorting unique by column

I am trying to sort, do uniq by 1st column and report this 4 columns tab delimiter table , eg chr10:112174128 rs2255141 2E-10 Cholesterol, total chr10:112174128 rs2255141 7E-16 LDL chr10:17218291 rs10904908 3E-11 HDL Cholesterol chr10:17218291 rs970548 8E-9 TG... (4 Replies)
Discussion started by: fat
4 Replies

6. UNIX for Dummies Questions & Answers

Removing rows that contain non-unique column entry

Background: I have a file of thousands of potential SSR primers from Batch Primer 3. I can't use primers that will contain the same sequence ID or sequence as another primer. I have some basic shell scripting skills, but not enough to handle this. What you need to know: I need to remove the... (1 Reply)
Discussion started by: msatseqs
1 Replies

7. UNIX for Beginners Questions & Answers

Find unique values but only in column 1

Hi All, Does anyone have any suggestions/examples of how i could show only lines where the first field is not duplicated. If the first field is listed more than once it shouldnt be shown even if the other columns make it unique. Example file : 876,RIBDA,EC2 876,RIBDH,EX7 877,RIBDF,E28... (4 Replies)
Discussion started by: mutley2202
4 Replies

8. UNIX for Beginners Questions & Answers

Count unique column

Hello, I am trying to count unique rows in my file based on 4 columns (2-5) and to output its frequency in a sixth column. My file is tab delimited My input file looks like this: Colum1 Colum2 Colum3 Colum4 Coulmn5 1.1 100 100 a b 1.1 100 100 a c 1.2 200 205 a d 1.3 300 301 a y 1.3 300... (6 Replies)
Discussion started by: nans
6 Replies

9. Shell Programming and Scripting

Count number of unique values in each column of array

What is an efficient way of counting the number of unique values in a 400 column by 1000 row array and outputting the counts per column, assuming the unique values in the array are: A, B, C, D In other words the output should look like: Value COL1 COL2 COL3 A 50 51 52... (16 Replies)
Discussion started by: Geneanalyst
16 Replies
counter(n)						      Counters and Histograms							counter(n)

NAME
counter - Procedures for counters and histograms SYNOPSIS
package require Tcl 8 package require counter ?2.0? ::counter::init tag args ::counter::count tag ?delta? ?instance? ::counter::start tag instance ::counter::stop tag instance ::counter::get tag args ::counter::exists tag ::counter::names ::counter::histHtmlDisplay tag args DESCRIPTION
The counter package provides a counter facility and can compute statistics and histograms over the collected data. ::counter::init tag args This defines a counter with the name tag. The args determines the characteristics of the counter. The args are -group name Keep a grouped counter where the name of the histogram bucket is passed into ::counter::count. -hist bucketsize Accumulate the counter into histogram buckets of size bucketsize. For example, if the samples are millisecond time values and bucketsize is 10, then each histogram bucket represents time values of 0 to 10 msec, 10 to 20 msec, 20 to 30 msec, and so on. -hist2x bucketsize Accumulate the statistic into histogram buckets. The size of the first bucket is bucketsize, each other bucket holds values 2 times the size of the previous bucket. For example, if bucketsize is 10, then each histogram bucket represents time values of 0 to 10 msec, 10 to 20 msec, 20 to 40 msec, 40 to 80 msec, and so on. -hist10x bucketsize Accumulate the statistic into histogram buckets. The size of the first bucket is bucketsize, each other bucket holds values 10 times the size of the previous bucket. For example, if bucketsize is 10, then each histogram bucket represents time val- ues of 0 to 10 msec, 10 to 100 msec, 100 to 1000 msec, and so on. -lastn N Save the last N values of the counter to maintain a "running average" over the last N values. -timehist secsPerMinute Keep a time-based histogram. The counter is summed into a histogram bucket based on the current time. There are 60 per- minute buckets that have a size determined by secsPerMinute, which is normally 60, but for testing purposes can be less. Every "hour" (i.e., 60 "minutes") the contents of the per-minute buckets are summed into the next hourly bucket. Every 24 "hours" the contents of the per-hour buckets are summed into the next daily bucket. The counter package keeps all time-based histograms in sync, so the first secsPerMinute value seen by the package is used for all subsequent time-based histograms. ::counter::count tag ?delta? ?instance? Increment the counter identified by tag. The default increment is 1, although you can increment by any value, integer or real, by specifying delta. You must declare each counter with ::counter::init to define the characteristics of counter before you start to use it. If the counter type is -group, then the counter identified by instance is incremented. ::counter::start tag instance Record the starting time of an interval. The tag is the name of the counter defined as a -hist value-based histogram. The instance is used to distinguish this interval from any other intervals that might be overlapping this one. ::counter::stop tag instance Record the ending time of an interval. The delta time since the corresponding ::counter::start call for instance is recorded in the histogram identified by tag. ::counter::get tag args Return statistics about a counter identified by tag. The args determine what value to return: -total Return the total value of the counter. This is the default if args is not specified. -totalVar Return the name of the total variable. Useful for specifying with -textvariable in a Tk widget. -N Return the number of samples accumulated into the counter. -avg Return the average of samples accumulated into the counter. -avgn Return the average over the last N samples taken. The N value is set in the ::counter::init call. -hist bucket If bucket is specified, then the value in that bucket of the histogram is returned. Otherwise the complete histogram is returned in array get format sorted by bucket. -histVar Return the name of the histogram array variable. -histHour Return the complete hourly histogram in array get format sorted by bucket. -histHourVar Return the name of the hourly histogram array variable. -histDay Return the complete daily histogram in array get format sorted by bucket. -histDayVar Return the name of the daily histogram array variable. -resetDate Return the clock seconds value recorded when the counter was last reset. -all Return an array get of the array used to store the counter. This includes the total, the number of samples (N), and any type-specific information. This does not include the histogram array. ::counter::exists tag Returns 1 if the counter is defined. ::counter::names Returns a list of all counters defined. ::counter::histHtmlDisplay tag args Generate HTML to display a histogram for a counter. The args control the format of the display. They are: -title string Label to display above bar chart -unit unit Specify minutes, hours, or days for the time-base histograms. For value-based histograms, the unit is used in the title. -images url URL of /images directory. -gif filename Image for normal histogram bars. The filename is relative to the -images directory. -ongif filename Image for the active histogram bar. The filename is relative to the -images directory. -max N Maximum number of value-based buckets to display. -height N Pixel height of the highest bar. -width N Pixel width of each bar. -skip N Buckets to skip when labeling value-based histograms. -format string Format used to display labels of buckets. -text boolean If 1, a text version of the histogram is dumped, otherwise a graphical one is generated. counter 2.0 counter(n)
All times are GMT -4. The time now is 01:30 AM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy