Finding the most common entry in a column Post: 302146769

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Finding longest common substring among filenames

I will be performing a task on several directories, each containing a large number of files (2500+) that follow a regular naming convention: YYYY_MM_DD_XX.foo_bar.A.B.some_different_stuff.EXT What I would like to do is automatically discover the part of the filenames that are common to all...

2. Shell Programming and Scripting

Finding Authors in Common Across Dozens of Lists

I currently have publication lists for ~3 dozen faculty members. I need to find out how many publications are in common across all faculty members - person 1 with person 2, person 1 with person 3, person 2 with person 3, person 1 with both person 2 and person 3, etc. One person may have Last1,...

3. Shell Programming and Scripting

finding common numbers (contents) across 2 or 3 files

I have 3 files which are tab delimited and have numbers in it. file 1 1 2 3 4 5 6 7 File 2 3 5 7 8 File 3 1

4. Shell Programming and Scripting

for each different entry in column 1 extract maximum values from column 2 in unix/awk

Hello, I have 2 columns (1st column has multiple entries but the corresponding values in the column 2 may be the same or different.) however I want to extract unique values for each entry in column 1 by assigning the max value from column 2 SDF4 -0.211654 SDF4 0.978068 ...

5. Shell Programming and Scripting

Rename a header column by adding another column entry to the header column name URGENT!!

Hi All, I have a file example.csv which looks like this GrpID,TargetID,Signal,Avg_Num CSCH74_1_1,2007,61,256 CSCH74_1_1,212007,647,679 CSCH74_1_1,12007,3,32 CSCH74_1_1,207,299,777 I want the output as GrpID,TragetID,Signal-CSCH74_1_1,Avg_Num CSCH74_1_1,2007,61,256...

6. Shell Programming and Scripting

Finding most repeated entry in a column and giving the count

Please can you help in providing the most repeated entry in the 2nd column and give its count Here is an input file 1, This , is a forum 2, This , is a forum 1, There , is a forum 2, This , is not right Here the most repeated entry is "This" and count is 3 So output...

7. Shell Programming and Scripting

Finding most common substrings

Hello, I would like to know what is the three most abundant substrings of length 6 from col2. The file is quite large and looks like this col1 col2 EN03 typehellobyedogcatcatdog EN09 typehellobyebyebyebye EN08 dogcatcatdogbyebyebyebye EN09 catcattypehellobyebyebyebye...

8. Shell Programming and Scripting

Sum column values based in common identifier in 1st column.

Hi, I have a table to be imported for R as matrix or data.frame but I first need to edit it because I've got several lines with the same identifier (1st column), so I want to sum the each column (2nd -nth) of each identifier (1st column) The input is for example, after sorted: K00001 1 1 4 3...

9. UNIX for Beginners Questions & Answers

Finding common entries between 10 columns

Hello, I need to find the intersection across 10 columns. Kindly help. my file (INPUT.csv) looks like this 4_R 4_S 8_R 8_S 12_R 12_S 24_R 24_S LOC_Os01g01010 LOC_Os01g01010 LOC_Os01g01010 LOC_Os04g48290 LOC_Os01g01010 LOC_Os01g01010...

10. UNIX for Beginners Questions & Answers

Awk/sed summation of one column based on some entry in first column

Hi All , I am having an input file as stated below Input file 6 ddk/djhdj/djhdj/Q 10 0.5 dhd/jdjd.djd.nd/QB 01 0.5 hdhd/jd/jd/jdj/Q 10 0.5 512 hd/hdh/gdh/Q 01 0.5 jdjd/jd/ud/j/QB 10 0.5 HD/jsj/djd/Q 01 0.5 71 hdh/jjd/dj/jd/Q 10 0.5 ...

LEARN ABOUT REDHAT

slabinfo

SLABINFO(5)							   Linux manual 						       SLABINFO(5)

NAME

       /proc/slabinfo - Kernel slab allocator statistics

SYNOPSIS

       cat /proc/slabinfo

DESCRIPTION

       Frequently  used  objects  in  the Linux kernel (buffer heads, inodes, dentries, etc.)  have their own cache. The file /proc/slabinfo gives
       statistics. For example:

	      % cat /proc/slabinfo
	      slabinfo - version: 1.1
	      kmem_cache	    60	   78	 100	2    2	  1
	      blkdev_requests	  5120	 5120	  96  128  128	  1
	      mnt_cache 	    20	   40	  96	1    1	  1
	      inode_cache	  7005	14792	 480 1598 1849	  1
	      dentry_cache	  5469	 5880	 128  183  196	  1
	      filp		   726	  760	  96   19   19	  1
	      buffer_head	 67131	71240	  96 1776 1781	  1
	      vm_area_struct	  1204	 1652	  64   23   28	  1
	      ...
	      size-8192 	     1	   17	8192	1   17	  2
	      size-4096 	    41	   73	4096   41   73	  1
	      ...

       For each slab cache, the cache name, the number of currently active objects, the total number of available objects, the size of each object
       in  bytes,  the	number of pages with at least one active object, the total number of allocated pages, and the number of pages per slab are
       given.

       Note that because of object alignment and slab cache overhead, objects are not normally packed tightly into pages.  Pages with even one in-
       use object are considered in-use and cannot be freed.

       Kernels	compiled with slab cache statistics will also have "(statistics)" in the first line of output, and will have 5 additional columns,
       namely: the high water mark of active objects; the number of times objects have been allocated; the number of times  the  cache	has  grown
       (new  pages  added  to this cache); the number of times the cache has been reaped (unused pages removed from this cache); and the number of
       times there was an error allocating new pages to this cache.  If slab cache statistics are not enabled for this kernel, these columns  will
       not be shown.

       SMP  systems  will  also  have  "(SMP)" in the first line of output, and will have two additional columns for each slab, reporting the slab
       allocation policy for the CPU-local cache (to reduce the need for inter-CPU synchronization when allocating objects from the  cache).   The
       first  column  is  the per-CPU limit: the maximum number of objects that will be cached for each CPU.  The second column is the batchcount:
       the maximum number of free objects in the global cache that will be transferred to the per-CPU cache if it  is  empty,  or  the	number	of
       objects to be returned to the global cache if the per-CPU cache is full.

       If  both  slab  cache  statistics  and SMP are defined, there will be four additional columns, reporting the per-CPU cache statistics.  The
       first two are the per-CPU cache allocation hit and miss counts: the number of times an object was or was not available in the per-CPU cache
       for  allocation.   The  next  two are the per-CPU cache free hit and miss counts: the number of times a freed object could or could not fit
       within the per-CPU cache limit, before flushing objects to the global cache.

       It is possible to tune the SMP per-CPU slab cache limit and batchcount via:

       echo "cache_name limit batchcount" > /proc/slabinfo

AVAILABILITY

       /proc/slabinfo exists since Linux 2.1.23.  SMP per-CPU caches exist since Linux 2.4.0-test3.

FILES

       <linux/slab.h>

								    2001-06-19							       SLABINFO(5)

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Finding longest common substring among filenames

Discussion started by: cmcnorgan

2. Shell Programming and Scripting

Finding Authors in Common Across Dozens of Lists

Discussion started by: Peggy White

3. Shell Programming and Scripting

finding common numbers (contents) across 2 or 3 files

Discussion started by: Lucky Ali

4. Shell Programming and Scripting

for each different entry in column 1 extract maximum values from column 2 in unix/awk

Discussion started by: Diya123