awk to remove range of fields

07-07-2016

Registered User

1,393, 20

Join Date: Nov 2013

Last Activity: 1 May 2020, 2:35 PM EDT

Location: Chicago

Posts: 1,393

Thanks Given: 901

Thanked 20 Times in 19 Posts

awk to remove range of fields

I am trying to cut a range of fields in awk. The below seems to work for removing field 50, but what is the correct syntax for removing a range ($50-$62). Thank you

.

awk

Code:

awk 'BEGIN{FS=OFS="\t"}{$50=""; gsub(/\t\t/,"\t")}1' test.vcf.hg19_multianno.txt > output.csv

Maybe:

Code:

awk 'BEGIN{FS=OFS="\t"}{$50:$62=""; gsub(/\t\t/,"\t")}1' test.vcf.hg19_multianno.txt > output.csv

cmccabe

View Public Profile for cmccabe

Find all posts by cmccabe

07-07-2016

Registered User

15,129, 5,008

Join Date: Jul 2012

Last Activity: 4 May 2020, 4:31 PM EDT

Location: Aachen, Germany

Posts: 15,129

Thanks Given: 735

Thanked 5,008 Times in 4,483 Posts

Code:

awk 'BEGIN{FS=OFS="\t"} {for (i=50; i<=62; i++) $i = ""; gsub(/\t+/,"\t")}1'

(untested)

This User Gave Thanks to RudiC For This Post:

RudiC

View Public Profile for RudiC

Find all posts by RudiC

07-07-2016

Registered User

1,393, 20

Join Date: Nov 2013

Last Activity: 1 May 2020, 2:35 PM EDT

Location: Chicago

Posts: 1,393

Thanks Given: 901

Thanked 20 Times in 19 Posts

works great... thank you

cmccabe

View Public Profile for cmccabe

Find all posts by cmccabe

07-07-2016

Registered User

12,315, 4,560

Join Date: Jul 2012

Last Activity: 22 November 2019, 4:29 PM EST

Location: San Jose, CA, USA

Posts: 12,315

Thanks Given: 952

Thanked 4,560 Times in 3,818 Posts

The code that you posted originally and the code suggested by RudiC will remove any empty fields from your input file in addition to the fields you want to remove. The following will only remove fields 50 through 62, inclusive:

Code:

awk '
BEGIN {	FS = OFS = "\t"
}
{	for(i = 1; i <= NF; i++)
		if(i < 50 || i > 62)
			printf("%s%s", $i, (i == NF) ? ORF : OFS)
}' test.vcf.hg19_multianno.txt > output.csv

The above code should do what you want (assuming that you have at least 63 fields in each input line). If some lines have less than 63 input fields, slightly different logic would be needed to ensure that each line is properly terminated and that no unneeded field separators are included in the output (after we get a clear description of whether empty fields should be added to the ends of short field count lines or if they should be omitted).

As always, if you want to try this on a Solaris/SunOS system, change awk to /usr/xpg4/bin/awk or nawk.

These 2 Users Gave Thanks to Don Cragun For This Post:

Don Cragun

View Public Profile for Don Cragun

Find all posts by Don Cragun

07-07-2016

Moderator

3,791, 1,452

Join Date: Oct 2010

Last Activity: 1 August 2020, 1:38 AM EDT

Posts: 3,791

Thanks Given: 183

Thanked 1,452 Times in 1,302 Posts

This might be a little safer if you have empty fields somewhere on the line or lines with less than 62 fields:

Code:

awk -v F=50 -v T=62 '
BEGIN{FS=OFS="\t"}
{ b=T+1
  t=T<NF?T:NF
  for(i=F;i<NF-t+F;i++) $i=$(b++)
  NF=--i}1'

This User Gave Thanks to Chubler_XL For This Post:

Chubler_XL

View Public Profile for Chubler_XL

Find all posts by Chubler_XL

07-08-2016

Registered User

1,781, 705

Join Date: May 2008

Last Activity: 10 November 2021, 5:38 PM EST

Posts: 1,781

Thanks Given: 62

Thanked 705 Times in 653 Posts

Quote:

Originally Posted by cmccabe

I am trying to cut a range of fields in awk. The below seems to work for removing field 50, but what is the correct syntax for removing a range ($50-$62). Thank you Smilie

.

awk

Code:

awk 'BEGIN{FS=OFS="\t"}{$50=""; gsub(/\t\t/,"\t")}1' test.vcf.hg19_multianno.txt > output.csv

Maybe:

Code:

awk 'BEGIN{FS=OFS="\t"}{$50:$62=""; gsub(/\t\t/,"\t")}1' test.vcf.hg19_multianno.txt > output.csv

Alternative?

Code:

perl -nale '$"="\t"; print "@F[0..48,62..$#F]"' test.vcf.hg19_multianno.txt > output.csv

This User Gave Thanks to Aia For This Post:

Aia

View Public Profile for Aia

Find all posts by Aia

07-08-2016

Registered User

1,393, 20

Join Date: Nov 2013

Last Activity: 1 May 2020, 2:35 PM EDT

Location: Chicago

Posts: 1,393

Thanks Given: 901

Thanked 20 Times in 19 Posts

Thank you all... works great

cmccabe

View Public Profile for cmccabe

Find all posts by cmccabe

Shell Programming and Scripting

awk to remove range of fields

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

sed or awk to remove specific column to one range

Discussion started by: ranjancom2000

2. Shell Programming and Scripting

awk to remove lines where field count is greather than 1 in two fields

Discussion started by: cmccabe

3. Shell Programming and Scripting

awk to search field2 in file2 using range of fields file1 and using match to another field in file1

Discussion started by: cmccabe

4. Shell Programming and Scripting

awk sort based on difference of fields and print all fields

Discussion started by: newstart

5. Shell Programming and Scripting

awk - compare 1st 15 fields of record with 20 fields

Discussion started by: sljnk

6. Shell Programming and Scripting

How to print 1st field and last 2 fields together and the rest of the fields after it using awk?

Discussion started by: 100bees

7. Shell Programming and Scripting

awk to print range of fields

Discussion started by: krishnix

8. Shell Programming and Scripting

Remove rows with first 4 fields duplicated in awk

Discussion started by: tomahawk

9. Shell Programming and Scripting

Trim empty fields in a given range

Discussion started by: cue

10. Shell Programming and Scripting

awk sed cut? to rearrange random number of fields into 3 fields

Discussion started by: axo959