awk to remove field and match strings to add text Post: 302974352

Sponsored Content

Top Forums Shell Programming and Scripting awk to remove field and match strings to add text Post 302974352 by cmccabe on Saturday 28th of May 2016 07:58:12 PM

05-28-2016

Registered User

The awk is close but hopefully the below explanation helps. Both file1 and file2 are tab-delineated, since $1 and $2 of file1 match $1 and $2 in file2, the last 4 strings in file2 are copied to the end of the matching file1 line. In thiscase the last 4 strings are GOOD 399 reads hom and are in bold in file2. Thank you very much Smilie

.

file1

Code:

Chr    Start    End    Ref    Alt    Func.refGene    Gene.refGene    GeneDetail.refGene    ExonicFunc.refGene    AAChange.refGene    PopFreqMax    CLINSIG    CLNDBN    CLNACC    CLNDSDB    CLNDSDBID    common
chr1    949654    949654    A    G    exonic    ISG15    .    synonymous SNV    ISG15:NM_005101:exon2:c.294A>G:p.V98V    0.96    .    .    .    .    .    .

file2

Code:

##.....
##.....
#CHROM    POS    ID    REF    ALT    QUAL    FILTER    INFO    FORMAT    NS12911_BC1
chr1    949654    .    A    G    3825.28    PASS    AF=1;AO=621;DP=624;FAO=399;FDP=399;FR=.;FRO=0;FSAF=225;FSAR=174;FSRF=0;FSRR=0;FWDB=0.00425236;FXX=0.00249994;HRUN=1;LEN=1;MLLD=97.922;OALT=G;OID=.;OMAPALT=G;OPOS=949654;OREF=A;PB=0.5;PBP=1;QD=38.3487;RBI=0.0367904;REFB=0.0353003;REVB=-0.0365438;RO=2;SAF=335;SAR=286;SRF=0;SRR=2;SSEN=0;SSEP=0;SSSB=0.00332809;STB=0.5;STBP=1;TYPE=snp;VARB=-3.42335e-05;ANN=ISG15    GT:GQ:DP:FDP:RO:FRO:AO:FAO:AF:SAR:SAF:SRF:SRR:FSAR:FSAF:FSRF:FSRR    1/1:171:624:399:2:0:621:399:1:286:335:0:2:174:225:0:0 GOOD 399 reads hom

desired output

Code:

Chr    Start    End    Ref    Alt    Func.refGene    Gene.refGene     GeneDetail.refGene    ExonicFunc.refGene    AAChange.refGene     PopFreqMax    CLINSIG    CLNDBN    CLNACC    CLNDSDB    CLNDSDBID     common
chr1    949654    949654    A    G    exonic    ISG15    .     synonymous SNV    ISG15:NM_005101:exon2:c.294A>G:p.V98V    0.96     .    .    .    .    .    . GOOD 399 reads hom

cmccabe

View Public Profile for cmccabe

Find all posts by cmccabe

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Remove Carriage returns between strings in a field

Is there any way to remove carriage retuns between the records? We have input records separated by TABS and have carriage returns as below: 123 456 789 ABC "1952.00" 678 "abcdef ghik lmno" Above we...

2. Shell Programming and Scripting

awk or sed to add field in a text file

Hi there, I have a csv file with some columns comma sepated like this : 4502-17,PETER,ITA2,LEGUE,92,ME - HALF,23/05/10 15:00 4502-18,CARL,ITA2,LEGUE,96,ME - HALF,20/01/09 14:00 4502-19,OTTO,ITA2,LEGUE,97,ME - MARY,23/05/10 15:00 As you can see the column n. 7 is a timestamp column, I need...

3. Shell Programming and Scripting

awk, comma as field separator and text inside double quotes as a field.

Hi, all I need to get fields in a line that are separated by commas, some of the fields are enclosed with double quotes, and they are supposed to be treated as a single field even if there are commas inside the quotes. sample input: for this line, 5 fields are supposed to be extracted, they...

4. Shell Programming and Scripting

AWK: Pattern match between 2 files, then compare a field in file1 as > or < field in file2

First, thanks for the help in previous posts... couldn't have gotten where I am now without it! So here is what I have, I use AWK to match $1 and $2 as 1 string in file1 to $1 and $2 as 1 string in file2. Now I'm wondering if I can extend this AWK command to incorporate the following: If $1...

5. Shell Programming and Scripting

awk to parse field and include the text of 1 pipe in field 4

I am trying to parse the input in awk to include the |gc= in $4 but am not able to. The below is close: awk so far: awk '{sub(/\|]+]++/, ""); print }' input.txt Input chr1 955543 955763 AGRN-6|pr=2|gc=75 0 + chr1 957571 957852 AGRN-7|pr=3|gc=61.2 0 + chr1 970621 ...

6. Shell Programming and Scripting

Using awk to remove lines from file that match text

I am trying to remove each line in which $2 is FP or RFP. I believe the below will remove one instance but not both. Thank you :). file 12 123 FP 11 10 RFP awk awk -F'\t' ' $2 != "FP"' file desired output 12 11

7. Shell Programming and Scripting

awk to match field between two files and use conditions on match

I am trying to look for $2 of file1 (skipping the header) in $2 of file2 (skipping the header) and if they match and the value in $10 is > 30 and $11 is > 49, then print the line from file1 to a output file. If no match is foung the line is not printed. Both the input and output are tab-delimited....

8. UNIX for Beginners Questions & Answers

Use strings from nth field from one file to match strings in entire line in another file, awk

I cannot seem to get what should be a simple awk one-liner to work correctly and cannot figure out why. I would like to use patterns from a specific field in one file as regex to search for matching strings in the entire line ($0) of another file. I would like to output the lines of File2 which...

9. Shell Programming and Scripting

awk to add text to matching pattern in field

In the awk I am trying to add :p.=? to the end of each $9 that matches the pattern NM_. The below executes andis close but I can not seem to figure out why the :p.=? repeats in the split as in the green in the current output. I have added comments as well. Thank you :). file ...

10. Shell Programming and Scripting

awk to print text in field if match and range is met

In the awk below I am trying to match the value in $4 of file1 with the split value from $4 in file2. I store the value of $4 in file1 in A and the split value (using the _ for the split) in array. I then strore the value in $2 as min, the value in $3 as max, and the value in $1 as chr. If A is...

LEARN ABOUT DEBIAN

h5diff

h5diff(1)						      General Commands Manual							 h5diff(1)

NAME

       h5diff - Compares two HDF5 files and reports the differences.

SYNOPSIS

       h5diff file1 file2 [OPTIONS] [object1 [object2 ] ]

DESCRIPTION

       h5diff is a command line tool that compares two HDF5 files, file1 and file2, and reports the differences between them.

       Optionally,  h5diff  will compare two objects within these files. If only one object, object1, is specified, h5diff will compare object1 in
       file1 with object1 in file2. In two objects, object1 and object2, are specified, h5diff will compare  object1  in  file1  with  object2	in
       file2. These objects must be HDF5 datasets.

       object1 and object2 must be expressed as absolute paths from the respective file's root group.

       Additional information, with several sample cases, can be found in the document H5diff Examples.

OPTIONS

       file1 file2
	      The HDF5 files to be compared.

       -h     Print all differences.

       -r     Print  only  the	names  of  objects that differ; do not print the differences. These objects may be HDF5 datasets, groups, or named
	      datatypes.

       -n count
	      Print difference up to count differences, then stop. count must be a positive integer.

       -d delta
	      Print only differences that are greater than the limit delta. delta must be a positive number. The comparison criterion  is  whether
	      the  absolute  value of the difference of two corresponding values is greater than delta (e.g., |a-b| > delta, where a is a value in
	      file1 and b is a value in file2).

       -p relative
	      Print only differences that are greater than a relative error. relative must be a  positive  number.  The  comparison  criterion	is
	      whether the absolute value of the difference 1 and the ratio of two corresponding values is greater than relative (e.g., |1-(b/a)| >
	      relative where a is a value in file1 and b is a value in file2).

       object1 object2
	      Specific object(s) within the files to be compared.

EXAMPLES

       The following h5diff call compares the object /a/b in file1 with the object /a/c in file2:
	   h5diff file1 file2 /a/b /a/c

       This h5diff call compares the object /a/b in file1 with the same object in file2:
	   h5diff file1 file2 /a/b

       And this h5diff call compares all objects in both files:
	   h5diff file1 file2

SEE ALSO

       h5dump(1), h5ls(1), h5repart(1), h5import(1), gif2h5(1), h52gif(1), h5perf(1)

																	 h5diff(1)

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Remove Carriage returns between strings in a field

Discussion started by: acheepi

2. Shell Programming and Scripting

awk or sed to add field in a text file

Discussion started by: capnino

3. Shell Programming and Scripting

awk, comma as field separator and text inside double quotes as a field.

Discussion started by: kevintse

4. Shell Programming and Scripting

AWK: Pattern match between 2 files, then compare a field in file1 as > or < field in file2

Discussion started by: right_coaster

5. Shell Programming and Scripting

awk to parse field and include the text of 1 pipe in field 4

Discussion started by: cmccabe

6. Shell Programming and Scripting

Using awk to remove lines from file that match text

Discussion started by: cmccabe

7. Shell Programming and Scripting

awk to match field between two files and use conditions on match

Discussion started by: cmccabe

8. UNIX for Beginners Questions & Answers

Use strings from nth field from one file to match strings in entire line in another file, awk

Discussion started by: jvoot

9. Shell Programming and Scripting

awk to add text to matching pattern in field

Discussion started by: cmccabe

10. Shell Programming and Scripting

awk to print text in field if match and range is met

Discussion started by: cmccabe

LEARN ABOUT DEBIAN

h5diff