Adding a prefix to a column using awk/sed commands Post: 302483891

Sponsored Content

Operating Systems Linux Adding a prefix to a column using awk/sed commands Post 302483891 by R0H0N on Wednesday 29th of December 2010 01:04:03 AM

12-29-2010

R0H0N

Guest

Input

Code:

$ cat file
1	.	miRNA	548816	548893	.	+	.	ACC="MI0002023"; ID="dre-mir-155";
1	.	miRNA	1651461	1651541	.	+	.	ACC="MI0002180"; ID="dre-mir-459";
1	.	miRNA	23269491	23269603	.	-	.	ACC="MI0004786"; ID="dre-mir-740";
1	.	miRNA	27656240	27656327	.	+	.	ACC="MI0002052"; ID="dre-mir-218a-2";
1	.	miRNA	34527751	34527843	.	+	.	ACC="MI0004780"; ID="dre-mir-734";
1	.	miRNA	40174414	40174523	.	+	.	ACC="MI0010857"; ID="dre-mir-2197";
1	.	miRNA	46862496	46862635	.	-	.	ACC="MI0001895"; ID="dre-mir-16b";
str .	miRNA	46862739	46862822	.	-	.	ACC="MI0001891"; ID="dre-mir-15a-1";
1	.	miRNA	55355143	55355233	.	-	.	ACC="MI0004765"; ID="dre-mir-722";
2	.	miRNA	1085488	1085564	.	+	.	ACC="MI0002181"; ID="dre-mir-460";
2	.	miRNA	6031391	6031475	.	+	.	ACC="MI0002000"; ID="dre-mir-137-1";
str .	miRNA	22105590	22105669	.	-	.	ACC="MI0004782"; ID="dre-mir-736";
2	.	miRNA	23568780	23568883	.	-	.	ACC="MI0010841"; ID="dre-mir-2190";
2	.	miRNA	25338635	25338716	.	-	.	ACC="MI0001966"; ID="dre-mir-124-1";
2	.	miRNA	31878456	31878533	.	+	.	ACC="MI0001916"; ID="dre-mir-23a-3";
2	.	miRNA	31880346	31880476	.	+	.	ACC="MI0001928"; ID="dre-mir-27a";
2	.	miRNA	34798348	34798457	.	+	.	ACC="MI0010847"; ID="dre-mir-2198";
2	.	miRNA	44164796	44164904	.	-	.	ACC="MI0001366"; ID="dre-mir-181b-1";
2	.	miRNA	57907954	57908073	.	-	.	ACC="MI0001879"; ID="dre-mir-7a-3";

Command

Code:

sed 's/^\([0-9].*\)/char \1/g' file

Output

Code:

char 1	.	miRNA	548816	548893	.	+	.	ACC="MI0002023"; ID="dre-mir-155";
char 1	.	miRNA	1651461	1651541	.	+	.	ACC="MI0002180"; ID="dre-mir-459";
char 1	.	miRNA	23269491	23269603	.	-	.	ACC="MI0004786"; ID="dre-mir-740";
char 1	.	miRNA	27656240	27656327	.	+	.	ACC="MI0002052"; ID="dre-mir-218a-2";
char 1	.	miRNA	34527751	34527843	.	+	.	ACC="MI0004780"; ID="dre-mir-734";
char 1	.	miRNA	40174414	40174523	.	+	.	ACC="MI0010857"; ID="dre-mir-2197";
char 1	.	miRNA	46862496	46862635	.	-	.	ACC="MI0001895"; ID="dre-mir-16b";
str .	miRNA	46862739	46862822	.	-	.	ACC="MI0001891"; ID="dre-mir-15a-1";
char 1	.	miRNA	55355143	55355233	.	-	.	ACC="MI0004765"; ID="dre-mir-722";
char 2	.	miRNA	1085488	1085564	.	+	.	ACC="MI0002181"; ID="dre-mir-460";
char 2	.	miRNA	6031391	6031475	.	+	.	ACC="MI0002000"; ID="dre-mir-137-1";
str .	miRNA	22105590	22105669	.	-	.	ACC="MI0004782"; ID="dre-mir-736";
char 2	.	miRNA	23568780	23568883	.	-	.	ACC="MI0010841"; ID="dre-mir-2190";
char 2	.	miRNA	25338635	25338716	.	-	.	ACC="MI0001966"; ID="dre-mir-124-1";
char 2	.	miRNA	31878456	31878533	.	+	.	ACC="MI0001916"; ID="dre-mir-23a-3";
char 2	.	miRNA	31880346	31880476	.	+	.	ACC="MI0001928"; ID="dre-mir-27a";
char 2	.	miRNA	34798348	34798457	.	+	.	ACC="MI0010847"; ID="dre-mir-2198";
char 2	.	miRNA	44164796	44164904	.	-	.	ACC="MI0001366"; ID="dre-mir-181b-1";
char 2	.	miRNA	57907954	57908073	.	-	.	ACC="MI0001879"; ID="dre-mir-7a-3";

See in the output, the starting string 'str' doesn't replaced with 'char'

R0H0N

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

awk-adding a column to a file

Hello Friends, i used awk to sum up total size of files under a directory (with the help of examples, threads here). ls -l | awk '/^-/ {total += $5} END {printf "%15.0f\n",total}' >> total.txt After each execution of the script total result is appended into a text file: 7010 7794 8890 ...

2. Shell Programming and Scripting

sed/awk-adding numeric to a column

I have a txt file as follows Code: Oct 1 file1 4144 Oct 1 file23 5170 Oct 2 file5 3434 Oct 21 file56 2343 I need to add a new column by marking the right log file from current directory. For example populate like this. Please not in the second columt for "1" it has...

3. UNIX for Dummies Questions & Answers

Adding a column with the row number using awk

Is there anyway to use awk to add a first column to my data that automatically goes from 1 to n , where n is the numbers of my rows?:confused:

4. Shell Programming and Scripting

Adding column using awk

Hello everyone, I have a file with the following structure: abc xyz 111 222 agf hjhf 787 799 tht yah 878 898 ... ... ... ... ... ... ... ... ... ... ... ... I want to add a column (with a fixed value of 1000) at the end such that it becomes: abc xyz 111 222 1000 agf hjhf 787...

5. Shell Programming and Scripting

Adding prefix to the values in the script

Hi, test.txt contains below values 1 2 3 4 5 Desired output: 'TT.1', 'TT.2', 'TT.3', 'TT.4', 'TT.5' Last value should not contain the comma after the value. Below is the script which i have tried. I'm using Linux. #!/bin/bash for i in $test.txt

6. UNIX for Dummies Questions & Answers

Adding Filename as column using sed

Hi , Can any one please tell me, how can we add the file name as column using sed. right now we are using the below awk command for adding the file name as column but when we are calling this script from datastage it is deleting the file data..very weird raised a support ticket with datastage....

7. Shell Programming and Scripting

AWK adding prefix/suffix to list of strings

75 103 131 133 138 183 197 221 232 234 248 256 286 342 368 389 463 499 524 538

8. Shell Programming and Scripting

Adding a specified value to a specified column - awk?

Hi everyone! I sometimes need to do some simple arithmetics, like adding a number to a certain column of a file. So I wrote a small function in the .bashrc file, which looks like this shifter() { COL=$1 VAL=$2 FILE=$3 cp $FILE $FILE.shifted awk 'NF==4 {$(( $COL )) = $(( $COL ))...

9. UNIX for Dummies Questions & Answers

awk adding counts together from column

Hello Im new treat me nicely, I have a headache :) I have a script that seemed to work now it doesnt anyway, the last part is adding counts of unique items in a csv file eg 05492U34 38 05492U34 47 two columns, (many different values like this in file) i want...

10. Shell Programming and Scripting

Need to add prefix using sed or awk from cat the file

I need the use sed or AWK using cat the file Node1 TDEV RW 1035788 TDEV RW 1035788 Server1 TDEV RW 69053 Server2 TDEV RW 69053 TDEV RW 103579 Server3 TDEV RW 69053 server4 RDF1+TDEV RW 69053 RDF1+TDEV RW 517894 RDF1+TDEV RW 621473 server6 TDEV RW 34526 TDEV RW 34526

LEARN ABOUT DEBIAN

boulder::unigene

Boulder::Unigene(3pm)					User Contributed Perl Documentation				     Boulder::Unigene(3pm)

NAME

       Boulder::Unigene - Fetch Unigene data records as parsed Boulder Stones

SYNOPSIS

	 # parse a file of Unigene records
	 $ug = new Boulder::Unigene(-accessor=>'File',
				    -param => '/data/unigene/Hs.dat');
	 while (my $s = $ug->get) {
	   print $s->Identifier;
	   print $s->Gene;
	 }

	 # parse flatfile records yourself
	 open (UG,"/data/unigene/Hs.dat");
	 local $/ = "*RECORD*";
	 while (<UG>) {
	    my $s = Boulder::Unigene->parse($_);
	    # etc.
	 }

DESCRIPTION

       Boulder::Unigene provides retrieval and parsing services for UNIGENE records

       Boulder::Unigene provides retrieval and parsing services for NCBI Unigene records.  It returns Unigene entries in Stone format, allowing
       easy access to the various fields and values.  Boulder::Unigene is a descendent of Boulder::Stream, and provides a stream-like interface to
       a series of Stone objects.

       Access to Unigene is provided by one accessors, which give access to  local Unigene database.  When you create a new Boulder::Unigene
       stream, you provide the accessors, along with accessor-specific parameters that control what entries to fetch.  The accessors is:

       File
	 This provides access to local Unigene entries by reading from a flat file (typically Hs.dat file downloadable from NCBI's Ftp site).  The
	 stream will return a Stone corresponding to each of the entries in the file, starting from the top of the file and working downward.  The
	 parameter is the path to the local file.

       It is also possible to parse a single Unigene entry from a text string stored in a scalar variable, returning a Stone object.

   Boulder::Unigene methods
       This section lists the public methods that the Boulder::Unigene class makes available.

       new()
	      # Local fetch via File
	      $ug=new Boulder::Unigene(-accessor  =>  'File',
				       -param	  =>  '/data/unigene/Hs.dat');

	   The new() method creates a new Boulder::Unigene stream on the accessor provided.  The only possible accessors is File.  If successful,
	   the method returns the stream object.  Otherwise it returns undef.

	   new() takes the following arguments:

		   -accessor	   Name of the accessor to use
		   -param	   Parameters to pass to the accessor

	   Specify the accessor to use with the -accessor argument.  If not specified, it defaults to File.

	   -param is an accessor-specific argument.  The possibilities is:

	   For File, the -param argument must point to a string-valued scalar, which will be interpreted as the path to the file to read Unigene
	   entries from.

       get()
	   The get() method is inherited from Boulder::Stream, and simply returns the next parsed Unigene Stone, or undef if there is nothing more
	   to fetch.  It has the same semantics as the parent class, including the ability to restrict access to certain top-level tags.

       put()
	   The put() method is inherited from the parent Boulder::Stream class, and will write the passed Stone to standard output in Boulder
	   format.  This means that it is currently not possible to write a Boulder::Unigene object back into Unigene flatfile form.

OUTPUT TAGS

       The tags returned by the parsing operation are taken from the names shown in the Flat file Hs.dat since no better description of them is
       provided yet by the database source producer.

   Top-Level Tags
       These are tags that appear at the top level of the parsed Unigene entry.

       Identifier
	   The Unigene identifier of this entry.  Identifier is a single-value tag.

	   Example:

		 my $identifierNo = $s->Identifier;

       Title
	   The Unigene title for this entry.

	   Example:
		 my $titledef=$s->Title;

       Gene The Gene associated with   this Unigene entry
	   Example:
		 my $thegene=$s->Gene;

       Cytoband The cytological band position of this entry
	   Example:
		 my $thecytoband=$s->Cytoband;

       Counts The number of EST in this record
	   Example:
		 my $thecounts=$s->Counts;

       LocusLink The id of the LocusLink entry associated with this record
	   Example:
		 my $thelocuslink=$s->LocusLink;

       Chromosome This field contains a list, of the chromosomes numbers in which this entry has been linked
	   Example:
		 my @theChromosome=$s->Chromosome;

   STS Multiple records in the form ^STS     ACC=XXXXXX NAME=YYYYYY
       ACC
       NAME

   TXMAP Multiple records in the form  ^TXMAP  XXXXXXX; MARKER=YYYYY; RHPANEL=ZZZZ
       The TXMAP tag points to a Stone record that contains multiple subtags.  Each subtag is the name of a feature which points, in turn, to a
       Stone that describes the feature's location and other attributes.

       Each feature will contain one or more of the following subtags:

       MARKER
       RHPANEL

   PROTSIM Multiple records in the form ^PROTSIM ORG=XXX; PROTID=DBID:YYY; PCT=ZZZ; ALN=QQQQ Where DBID is PID for indicate presence of GenPept
       identifier, SP to indicate SWISSPROT identifier, PIR to indicate PIR identifier, PRF to indicate ???
       ORG
       PROTID
       PCT
       ALN

   SEQUENCE Multiple records in the form ^SEQUENCE ACC=XXX; NID=YYYY; PID = CLONE= END= LID=
       ACC
       NID
       PID
       CLONE
       END
       LID

SEE ALSO

       Boulder, Boulder::Blast, Boulder::Genbank

AUTHOR

       Lincoln Stein <lstein@cshl.org>.  Luca I.G. Toldo <luca.toldo@merck.de>

       Copyright (c) 1997 Lincoln D. Stein Copyright (c) 1999 Luca I.G. Toldo

       This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself.  See DISCLAIMER.txt for
       disclaimers of warranty.

perl v5.10.1							    2000-01-18						     Boulder::Unigene(3pm)

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

awk-adding a column to a file

Discussion started by: EAGL�

2. Shell Programming and Scripting

sed/awk-adding numeric to a column

Discussion started by: gubbu

3. UNIX for Dummies Questions & Answers

Adding a column with the row number using awk

Discussion started by: cosmologist

4. Shell Programming and Scripting

Adding column using awk

Discussion started by: ad23

5. Shell Programming and Scripting

Adding prefix to the values in the script

Discussion started by: venkatesht

6. UNIX for Dummies Questions & Answers

Adding Filename as column using sed

Discussion started by: mora

7. Shell Programming and Scripting

AWK adding prefix/suffix to list of strings

Discussion started by: chrisjorg

8. Shell Programming and Scripting

Adding a specified value to a specified column - awk?

Discussion started by: radudownload

9. UNIX for Dummies Questions & Answers

awk adding counts together from column

Discussion started by: aniquebmx

10. Shell Programming and Scripting

Need to add prefix using sed or awk from cat the file

Discussion started by: ranjancom2000

LEARN ABOUT DEBIAN

boulder::unigene