Awk/sed/cut to filter out records from a file based on criteria Post: 302999772

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Selecting records from file on criteria.

Can I have 2 files as in input to the awk command? Situation is somewhat below, File A contains number & value delimited by a space. File B contains number as a part of a line. I am not supposed to retrieve more than 1 number from a line. If number from file B matches with number from...

2. UNIX for Dummies Questions & Answers

Select records based on search criteria on first column

Hi All, I need to select only those records having a non zero record in the first column of a comma delimited file. Suppose my input file is having data like: "0","01/08/2005 07:11:15",1,1,"Created",,"01/08/2005" "0","01/08/2005 07:12:40",1,1,"Created",,"01/08/2005"...

3. Shell Programming and Scripting

Filter records in a file using AWK

I want to filter records in one of my file using AWK command (or anyother command). I am using the below code awk -F@ '$1=="0003"&&"$2==20100402" print {$0}' $INPUT > $OUTPUT I want to pass the 0003 and 20100402 values through a variable. How can I do this? Any help is much...

4. Shell Programming and Scripting

awk - splitting 1 large file into multiple based on same key records

Hello gurus, I am new to "awk" and trying to break a large file having 4 million records into several output files each having half million but at the same time I want to keep the similar key records in the same output file, not to exist accross the files. e.g. my data is like: Row_Num,...

5. Shell Programming and Scripting

Filter/remove duplicate .dat file with certain criteria

I am a beginner in Unix. Though have been asked to write a script to filter(remove duplicates) data from a .dat file. File is very huge containig billions of records. contents of file looks like 30002157,40342424,OTC,mart_rec,100, ,0 30002157,40343369,OTC,mart_rec,95, ,0...

6. Shell Programming and Scripting

Extract error records based on specific criteria from Unix file

Hi, I look for a awk one liner for below issue. input file ABC 1234 abc 12345 ABC 4567 678 XYZ xyz ght 678 ABC 787 yyuu ABC 789 7890 777 zxr hyip hyu mno uii 678 776 ABC ty7 888 All lines should be started with ABC as first field. If a record has another value for 1st...

7. Shell Programming and Scripting

Filter records based on 2nd file

Hello, I want to filter records of a file if they fall in range associated with a second file. First the chr number (2nd col of 1st file and 1st col of 2nd file) needs to be matched. Then if the 3rd col of the first file falls within any of the ranges specified by the 2nd and 3rd cols , then...

8. Shell Programming and Scripting

awk to print specific line in file based on criteria

In the file below I am trying to extract a specific instance of path, if the adjacent plugin": "/rundb/api/v1/plugin/49/. Thank you :). file "path": "/results/analysis/output/Home/Auto_user_S5-00580-4-Medexome_65_028/plugin_out/FileExporter_out.52", "plugin": "/rundb/api/v1/plugin/49/",...

9. Shell Programming and Scripting

awk to filter file based on seperate conditions

The below awk will filter a list of 30,000 lines in the tab-delimited file. What I am having trouble with is adding a condition to SVTYPE=CNV that will only print that line if CI= must be >.05 . The other condition to add is if SVTYPE=Fusion, then in order to print that line READ_COUNT must...

10. UNIX for Beginners Questions & Answers

Filter records from a log file based on timestamp

Dear Experts, I have a log file that contains a timestamp, I would like to filter record from that file based on timestamp. For example refer below file - cat sample.txt Jan 19 20:51:48 mukul-Vostro-14-3468 systemd: pam_unix(systemd-user:session): session opened for user root by (uid=0)...

LEARN ABOUT XFREE86

histo

HISTO(1)						      General Commands Manual							  HISTO(1)

NAME

       histo - compute 1-dimensional histogram of N data columns

SYNOPSIS

       histo [-c][-p] xmin xmax nbins
       histo [-c][-p] imin imax

DESCRIPTION

       Histo  bins columnular data on the standard input between the given minimum and maximum values.	If three command line arguments are given,
       the third is taken as the number of data bins between the first two real numbers.  If only two arguments are given, they are  both  assumed
       to be integers, and the number of data bins will be equal to their difference plus one.	The bins are always of equal size.

       The  output is N+1 columns of data (for N columns input), where the first column is the centroid of each division, and each row corresponds
       to the frequencies for each column around that value.

       If the -c option is present, then histo computes the cumulative histogram for each column instead of the straight frequencies.	The  upper
       value  of  each	bin  is printed also instead of the centroid.  This may be useful in computing percentiles, for example.  Values below the
       minimum specified are still counted in the cumulative total.

       The -p option tells histo to report the percentage of the total number of input lines rather than the absolute counts.  In the  case  of  a
       cumulative total, this yields the percentile values directly.  Values above the maximum are counted as well as values below in this case.

       All  input data is interpreted as real values, and columns must be white-space separated.  If any value is less than the minimum or greater
       than the maximum, it will be ignored unless the -c option is specified.

EXAMPLE

       To count data values between -1 and 1 in 50 bins:

	 histo -1 1 50 < input.dat

       To count frequencies of integers between 0 and 255:

	 histo 0 255 < input.dat

AUTHOR

       Greg Ward

SEE ALSO

       cnt(1), neaten(1), rcalc(1), rlam(1), tabfunc(1), total(1)

RADIANCE
							      9/6/96								  HISTO(1)