I have a very large tab-delimited, ~2GB file2 that I am trying to filter using $2 of file1. If $2 of file1 is in the range of $2 and $3 in file1 then the entire line of file2 is outputed. If the range match is not found then that line is skipped. The awk below does run but no output results. Thank you .
File1
File2
desired output tab-delimited
awk
Note that since $1 in file1 is always the same (i.e., chr1), there is only one element in the arrays min["chr1"] and max["chr1"] with their values being reset by each line that is read from file1.
If your input and output files really do have <tab> delimited fields (instead of fields separated by four <space>s as in the sample data you provided), the following slight changes to your code seem to produce the output you want:
I want to filter records in one of my file using AWK command (or anyother command). I am using the below code
awk -F@ '$1=="0003"&&"$2==20100402" print {$0}' $INPUT > $OUTPUT
I want to pass the 0003 and 20100402 values through a variable. How can I do this?
Any help is much... (1 Reply)
Dear all,
I am using awk to filter some data like this:-
awk 'NR==FNR{a;next}($1 in a)' FS=":" filter.dat data.dat >! out.dat where the filter and input data look like this:-
filter.dat...
n_o00j_1900_40_007195350_0:n_o00j_1940_40_007308526... (3 Replies)
I have a main file:
...
17,466971 0,095185 17,562156 id 676
17,466971 0,096694 17,563665 id 677
17,466971 0,09816 17,565131 id 678
17,466971 0,099625 17,566596 id 679
17,466971 0,101091 17,568062 id 680
17,466971 0,016175 17,483146 id... (4 Replies)
Hi,
I would like extract some lines from file csv using awk , below the example:
I have the file test.csv with in content below.
FLUSSO;COD;DATA_LAV;ESITO
ULL;78;17/09/2013;OL
ULL;45;05/09/2013;Apertura
NP;45;13/09/2013;Riallineamento
ULLNP;78;17/09/2013;OL
NPG;14;12/09/2013;AperturaTK... (6 Replies)
I have a file (sorted_unknown) with ~1400 $5 values before the - that are "unknown". What I am trying to do is use the text in $2 of (sort_targets) to update those "unknown" values in the (sorted_unknown).
In $1 of (sort_targets) there are a set of numbers that can be used to update the "unknown"... (8 Replies)
In the below, I am trying to lookup $1 and $2 from file1, in a range search using $1 $2 $3 of file2. If the search key from file1 is found in file2, then the word low is printed in the last field of that line in the updated file1. Only the last section of file1 needs to be searched, but I am not... (6 Replies)
I am trying to update the below awk, kindly provided by @RavinderSingh13, to update each line of file1 with either Low or No Low based on matching $2 of file1 to a range in $2 and $3 of file2. If the $2 value in file1 matches the range in file2 then that line is Low, otherwise it is No Low in the... (3 Replies)
In the below awk if I use the attached file as the input, I get no results for TCF4. However, if I just copy that line from the attached file and use that as input I get results for TCF4.
Basically the gene file is a 1 column list that is used to filter $8 of the attached file. When there is a... (9 Replies)
I have file and process it and provide clean output.
input file
Device Symmetrix Name : 000A4
Device Symmetrix Name : 000A5
Device Symmetrix Name : 000A6
Device Symmetrix Name : 000A7
Device Symmetrix Name : 000A8
Device Symmetrix Name : 000A9
Device Symmetrix Name ... (10 Replies)