Outputting characters after a given string and reporting the characters in the row below --sed Post: 303028771

Sponsored Content

Top Forums Shell Programming and Scripting Outputting characters after a given string and reporting the characters in the row below --sed Post 303028771 by Xterra on Monday 14th of January 2019 02:55:22 PM

01-14-2019

Registered User

Don
I modified a bit your script to output the total count and give some format:

Code:

awk -v gene="gene-a gene-b" -v lengths="3 6" -v strings="GCATGAAAACATACA TTTCCAGAAATTGT" '
BEGIN {	nString = split(strings, String)
	split(lengths, OutLen)
	split(gene, Id)
	for(i = 1; i <= nString; i++)
		StringLen[i] = length(String[i])
}
/^@/ {	getline CodonLine
	getline
	getline QualityLine
	for(i = 1; i <= nString; i++)
		if(spot = index(CodonLine, String[i]))
			printf("Gene:\t"Id[i]"\tCodon:\t%s\t\tQuality Score:\t%s\t\n",
			    substr(CodonLine, spot + StringLen[i], OutLen[i]),
			    substr(QualityLine, spot + StringLen[i], OutLen[i]))
}' test.txt | awk '{ count[$0]++ } END {{ print "\n\t\t\t\tSummary\n#############################################################################\nCount\t\tGene\t\tCodon\t\t\tQuality Score\n" } {for (gene in count ) print count[gene] "\t" gene | "sort -k 3"}}'

With the above script I am getting the desired output:

Code:

                                Summary
#############################################################################
Count           Gene            Codon                   Quality Score

1       Gene:   gene-a  Codon:  AAC             Quality Score:  ,ED
2       Gene:   gene-a  Codon:  AAC             Quality Score:  GCC
1       Gene:   gene-a  Codon:  TTT             Quality Score:  +GG
2       Gene:   gene-b  Codon:  TCCAAG          Quality Score:  DGGCGG
1       Gene:   gene-b  Codon:  TCCAAG          Quality Score:  G7DCGG

However, I tried to include the END step in your awk script fail miserably. How can I modify the script so I don't have to "stitch" together the two scripts as shown above?
Thanks!

This User Gave Thanks to Xterra For This Post:

Xterra

View Public Profile for Xterra

Find all posts by Xterra

10 More Discussions You Might Find Interesting

1. UNIX for Dummies Questions & Answers

outputting selected characters from within a variable

Hi all, if for example I had a variable containing the string 'hello', is the any way I can output, for example, the e and the 2nd l based on their position in the string not their character (in this case 2 and 4)? any general pointers in the right direction will be much appreciated, at...

2. Shell Programming and Scripting

SED help delete characters in a string

Hi Please help me to refine my syntax. I want to delete the excess characters from the out put below. -bash-3.00$ top -b -n2 -d 00.20 |grep Cpu|tail -1 | awk -F ":" '{ print $2 }' | cut -d, -f1 4.4% us now i want to delete the % and us. How wil i do that to make it just 4.4. Thanks

3. Shell Programming and Scripting

Want to remove the last characters from each row of csv using shell script

Hi, I've a csv file seperated by '|' from which I'm trying to remove the excess '|' characters more than the existing fields. My CSV looks like as below. HRLOAD|Service|AddChange|EN PERSONID|STATUS|LASTNAME|FIRSTNAME|ITDCLIENTUSERID|ADDRESSLINE1 10000001|ACTIVE|Testazar1|Testore1|20041|||...

4. Shell Programming and Scripting

Delete row if a a particular column has more then three characters in it

Hi i have a data like hw:dsfnsmdf:39843 chr2 76219829 51M atatata 51 872389 hw:dsfnsmdf:39853 chr2 76219839 51M65T atatata 51 872389 hw:dsfnsmdf:39863 chr2 76219849 51M atatata 51 872389 hw:dsfnsmdf:39873 chr2 ...

5. Shell Programming and Scripting

sed replacing specific characters and control characters by escaping

sed -e "s// /g" old.txt > new.txt While I do know some control characters need to be escaped, can normal characters also be escaped and still work the same way? Basically I do not know all control characters that have a special meaning, for example, ?, ., % have a meaning and have to be escaped...

6. Shell Programming and Scripting

sed cut characters of string

helloo I wonder if there's a way to cut characters out of a string and keep only the last 2 by using sed. For example if there's the todays' date: 2012-05-06 and we only want to keep the last 2 characters which are the day. Is there a quick way to do it with sed?

7. Shell Programming and Scripting

Trouble with sed and substituting a string with special characters in variable

Hey guys, I know that title is a mouthful - I'll try to better explain my struggles a little better... What I'm trying to do is: 1. Query a db and output to a file, a list of column data. 2. Then, for each line in this file, repeat these values but wrap them with: ITEM{ ...

8. Shell Programming and Scripting

Help with sed command - find a string between two characters

Hi, I have a xml file (Config.xml) <Header name="" TDate="" PDate=""> <Config> {"config" { "Nation" "Pri:|Sec:"}} </Config> </Header> Now I wanted to printed all the strings between "". I tried the following cat Config.xml | sed -n 's/.*\.*//p' ...

9. Shell Programming and Scripting

sed replace nth characters with string

Hi, I hope you can help me out please? I need to replace from character 8-16 with AAAAAAAA and the rest should stay the same after character 16 gtwrhtrd11111111rjytwyejtyjejetjyetgeaEHT wrehrhw22222222hytekutkyukrylryilruilrGEQTH hrwjyety33333333gtrhwrjrgkreglqeriugn;RUGNEURGU ...

10. UNIX for Dummies Questions & Answers

Reporting characters after string

I have a file that looks like this: >ID 1 AATAATTCCGGATCGTGC >ID 2 TTTGACAGTAGAC >ID 3 AGACGATGACGAT I am using the following script to report if AATTCCGGATCG is present in any sequence: awk 'FNR==1{n=substr(FILENAME,1,index(FILENAME,".")-1)} { print n "\t"...

LEARN ABOUT DEBIAN

ace::sequence::transcript

Ace::Sequence::Transcript(3pm)				User Contributed Perl Documentation			    Ace::Sequence::Transcript(3pm)

NAME

       Ace::Sequence::Transcript - Simple "Gene" Object

SYNOPSIS

	   # open database connection and get an Ace::Object sequence
	   use Ace::Sequence;

	   # get a megabase from the middle of chromosome I
	   $seq = Ace::Sequence->new(-name   => 'CHROMOSOME_I,
				     -db     => $db,
				     -offset => 3_000_000,
				     -length => 1_000_000);

	   # get all the transcripts
	   @genes = $seq->transcripts;

	   # get the exons from the first one
	   @exons = $genes[0]->exons;

	   # get the introns
	   @introns = $genes[0]->introns

	   # get the CDSs (NOT IMPLEMENTED YET!)
	   @cds = $genes[0]->cds;

DESCRIPTION

       Ace::Sequence::Gene is a subclass of Ace::Sequence::Feature.  It inherits all the methods of Ace::Sequence::Feature, but adds the ability
       to retrieve the annotated introns and exons of the gene.

OBJECT CREATION

       You will not ordinarily create an Ace::Sequence::Gene object directly.  Instead, objects will be created in response to a transcripts()
       call to an Ace::Sequence object.

OBJECT METHODS

       Most methods are inherited from Ace::Sequence::Feature.	The following methods are also supported:

       exons()
	     @exons = $gene->exons;

	   Return a list of Ace::Sequence::Feature objects corresponding to annotated exons.

       introns()
	     @introns = $gene->introns;

	   Return a list of Ace::Sequence::Feature objects corresponding to annotated introns.

       cds()
	     @cds = $gene->cds;

	   Return a list of Ace::Sequence::Feature objects corresponding to coding sequence.  THIS IS NOT YET IMPLEMENTED.

       relative()
	     $relative = $gene->relative;
	     $gene->relative(1);

	   This turns on and off relative coordinates.	By default, the exons and intron features will be returned in the coordinate system used
	   by the gene.  If relative() is set to a true value, then coordinates will be expressed as relative to the start of the gene.  The first
	   exon will (usually) be 1.

SEE ALSO

       Ace, Ace::Object, Ace::Sequence,Ace::Sequence::Homol, Ace::Sequence::Feature, Ace::Sequence::FeatureList, GFF

AUTHOR

       Lincoln Stein <lstein@cshl.org> with extensive help from Jean Thierry-Mieg <mieg@kaa.crbm.cnrs-mop.fr>

       Copyright (c) 1999, Lincoln D. Stein

       This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself.  See DISCLAIMER.txt for
       disclaimers of warranty.

POD ERRORS

       Hey! The above document had some coding errors, which are explained below:

       Around line 168:
	   You forgot a '=back' before '=head1'

perl v5.14.2							    2001-05-22					    Ace::Sequence::Transcript(3pm)

10 More Discussions You Might Find Interesting

1. UNIX for Dummies Questions & Answers

outputting selected characters from within a variable

Discussion started by: skinnygav

2. Shell Programming and Scripting

SED help delete characters in a string

Discussion started by: redtred

3. Shell Programming and Scripting

Want to remove the last characters from each row of csv using shell script

Discussion started by: rajak.net

4. Shell Programming and Scripting

Delete row if a a particular column has more then three characters in it

Discussion started by: bhargavpbk88