Sponsored Content
Top Forums Shell Programming and Scripting Extracting and copying text from one file to another Post 302964304 by RavinderSingh13 on Wednesday 13th of January 2016 09:57:18 AM
Old 01-13-2016
Hello 4galaxy7,

Could you please try with following and let me know if this helps you.
Let's say following is the Input_file1 and Input_file2.
Code:
cat Input_file1
>TCONS_00000001 gene=XLOC_000001
AATTGTGGTGAAATGACTTCTGTTAACGGAGACATCGATGATTGTTGTTACTATTTGTTCTCAGGATTCA
TTTGTCCGGTTCATACCCCGGACGGCGCCCCTTGCGGGCTGCTCAATCACCTGACAATGAACTGTATCGT
CACGAAGCATCCGGATCGCAAATTAAAGGCTGCGCTACCAACGGTGCTGGTGGATCTAGGAATGCTTCCG
TTGTCTGTTGCGAATAATTGGAAGGACTCGTACACGGTAATGCTGAATGGTAAAGTGATCGGCCTGATCG
AAGATAATATTGTTGATAAGGTGGCCCGCAAACTAAGGCAGCTGAAGATAATTGGTGAAGAGGTGCCGAA
CACGTTGGAGATCGCGCTGGTGCCGAAGAGGAAGG
>TCONS_00000002 gene=XLOC_000002
CGGATGTATATCGTGCCGTGCTTTGATCGTTTATTTGATGTCCCATTTGCTGTTGGACTTGCGGCGGTAT
TGCCGTTGTTCTCGGCCTTGGTCGTGGCCGTGTGTCTTGCGTGTTTAGGTCCGGGCTGTCTTGAGCACCA
ACTTCCAGTGTCGGTAGTGGAGCTCGTGGTTGCAGGGTTTGCTGCCGAGTTCGTTGGGGCGTTTTGATTG
TTAGGCCTCGTGAACTCGTTTTTTTCGACGCAGATATTGATTTCGAAGGTGTGTGTCTCCTTTCCTGCGG
TTGTTTCGTTTGTTTTGTCGTCGACGGCTCGACGTATTTCGTTGTACTTGAGGTGTCTTTGTTTTGTCGA
TCTTTGTTTCGATCGAGTATATTCCCAACGTTGTGGACGTTGGTCTTCATTCTTCTTATTTCAAATATTA
TATTTTTCCGGCGTTCCTCAAGATATTGGAGGCACCGTTGTTCTCTTTCGCGAAGTCGCGTGAACTCTTC
>TCONS_00000002 gene=XLOC_003874
CGGATGTATATCGTGCCGTGCTTTGATCGTTTATTTGATGTCCCATTTGCTGTTGGACTTGCGGCGGTAT
TGCCGTTGTTCTCGGCCTTGGTCGTGGCCGTGTGTCTTGCGTGTTTAGGTCCGGGCTGTCTTGAGCACCA
ACTTCCAGTGTCGGTAGTGGAGCTCGTGGTTGCAGGGTTTGCTGCCGAGTTCGTTGGGGCGTTTTGATTG
TTAGGCCTCGTGAACTCGTTTTTTTCGACGCAGATATTGATTTCGAAGGTGTGTGTCTCCTTTCCTGCGG
TTGTTTCGTTTGTTTTGTCGTCGACGGCTCGACGTATTTCGTTGTACTTGAGGTGTCTTTGTTTTGTCGA
TCTTTGTTTCGATCGAGTATATTCCCAACGTTGTGGACGTTGGTCTTCATTCTTCTTATTTCAAATATTA
TATTTTTCCGGCGTTCCTCAAGATATTGGAGGCACCGTTGTTCTCTTTCGCGAAGTCGCGTGAACTCTTC
  
  
cat Input_file2
XLOC_002667
XLOC_002676
XLOC_003874
XLOC_000001

Then following code may help you.
Code:
awk 'FNR==NR{A[$0];next} ($2 in A){C=1;print $0;next} {if($0 ~ /^>/){C=""}} {if(C){print}}'  Input_file2 FS="="  Input_file1

Output will be as follows.
Code:
>TCONS_00000001 gene=XLOC_000001
AATTGTGGTGAAATGACTTCTGTTAACGGAGACATCGATGATTGTTGTTACTATTTGTTCTCAGGATTCA
TTTGTCCGGTTCATACCCCGGACGGCGCCCCTTGCGGGCTGCTCAATCACCTGACAATGAACTGTATCGT
CACGAAGCATCCGGATCGCAAATTAAAGGCTGCGCTACCAACGGTGCTGGTGGATCTAGGAATGCTTCCG
TTGTCTGTTGCGAATAATTGGAAGGACTCGTACACGGTAATGCTGAATGGTAAAGTGATCGGCCTGATCG
AAGATAATATTGTTGATAAGGTGGCCCGCAAACTAAGGCAGCTGAAGATAATTGGTGAAGAGGTGCCGAA
CACGTTGGAGATCGCGCTGGTGCCGAAGAGGAAGG
>TCONS_00000002 gene=XLOC_003874
CGGATGTATATCGTGCCGTGCTTTGATCGTTTATTTGATGTCCCATTTGCTGTTGGACTTGCGGCGGTAT
TGCCGTTGTTCTCGGCCTTGGTCGTGGCCGTGTGTCTTGCGTGTTTAGGTCCGGGCTGTCTTGAGCACCA
ACTTCCAGTGTCGGTAGTGGAGCTCGTGGTTGCAGGGTTTGCTGCCGAGTTCGTTGGGGCGTTTTGATTG
TTAGGCCTCGTGAACTCGTTTTTTTCGACGCAGATATTGATTTCGAAGGTGTGTGTCTCCTTTCCTGCGG
TTGTTTCGTTTGTTTTGTCGTCGACGGCTCGACGTATTTCGTTGTACTTGAGGTGTCTTTGTTTTGTCGA
TCTTTGTTTCGATCGAGTATATTCCCAACGTTGTGGACGTTGGTCTTCATTCTTCTTATTTCAAATATTA
TATTTTTCCGGCGTTCCTCAAGATATTGGAGGCACCGTTGTTCTCTTTCGCGAAGTCGCGTGAACTCTTC

Thanks,
R. Singh
 

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Extracting a line in a text file

If my file looks like this…. 10 20 30 and I want to take each line individually and put it in a variable so it can be read later in it's on individual test statement, how can I do that? I guess what I'm asking is how can I extract each line individually. Thanks (5 Replies)
Discussion started by: terryporter51
5 Replies

2. Shell Programming and Scripting

Extracting specific text from a file

Dear All, I have to extract a a few lines from a log file and I know the starting String and end string(WHich is same ). Is there any simplere way using sed - awk. e.g. from the following file -------------------------------------- Some text Date: 21 Oct 2008 Text to be extracted... (8 Replies)
Discussion started by: rahulkav
8 Replies

3. UNIX for Dummies Questions & Answers

extracting text and reusing the text to rename file

Hi, I have some ps files where I want to ectract/copy a certain number from and use that number to rename the ps file. eg: 'file.ps' contains following text: 14 (09 01 932688 0)t the text can be variable, the only fixed element is the '14 ('. The problem is that the fixed element can appear... (7 Replies)
Discussion started by: JohnDS
7 Replies

4. Shell Programming and Scripting

Need help with searching and copying in a text file

Hi, I need help searching through a large text file. I need to find a certain string within the text, and copy each line until another string appears. The file looks like this: >scf15164843 ATTAAAGGNNNGGAATTTCCCCAA ATTACCGGCTTTAAANNNTTACCC >scf15154847 CCGGGNNNTTTAAACCCGNGNGCC... (2 Replies)
Discussion started by: repiv
2 Replies

5. UNIX for Dummies Questions & Answers

Copying and Pasting columns from one text file to another

I have a tab delimited text file that I want to cut columns 3,4,5 from. Then I want to paste these columns into a space delimited text file between columns 2 and 3. I still want to keep the space delimited format in the final text file. How do I go about doing that? Thanks! (1 Reply)
Discussion started by: evelibertine
1 Replies

6. UNIX for Dummies Questions & Answers

Extracting the last column of a text file

I would like to extract the last column of a text file but different rows of the text file have different numbers of columns. How do I go about doing that? Thanks! (1 Reply)
Discussion started by: evelibertine
1 Replies

7. Shell Programming and Scripting

help extracting text from file

Hello I have a large file with lines beginning with 552, 553, 554, below is a small sample, I need to extract the data you can see below highlighted in bold from this file on the same location on every line and output it to a new file. Thank you in advance for any help 55201KL... (2 Replies)
Discussion started by: firefox2k2
2 Replies

8. UNIX for Dummies Questions & Answers

Extracting lines from a text file based on another text file with line numbers

Hi, I am trying to extract lines from a text file given a text file containing line numbers to be extracted from the first file. How do I go about doing this? Thanks! (1 Reply)
Discussion started by: evelibertine
1 Replies

9. Shell Programming and Scripting

Inserting text in file names while copying them.

I'm trying to find a Bourne shell script that will copy files from one directory using a wild card for the file name (*) and add some more characters in the middle of the file name as it is copied. As an example: /u01/tmp-file1.xml => /u02/tmp-file1-20130620.xml /u01/tmp-file2.xml => ... (6 Replies)
Discussion started by: Tony Keller
6 Replies

10. Shell Programming and Scripting

Copying a file to multiple other files using a text file as input

Hello, I have a file called COMPLIST as follows that contains 4 digit numbers.0002 0003 0010 0013 0015 0016 0022 0023 0024 0025 0027 0030 0031 0032 0033 0035 0038 0041 (3 Replies)
Discussion started by: sph90457
3 Replies
Bio::Map::GeneRelative(3pm)				User Contributed Perl Documentation			       Bio::Map::GeneRelative(3pm)

NAME
Bio::Map::GeneRelative - Represents being relative to named sub-regions of a gene. SYNOPSIS
use Bio::Map::GeneRelative; # say that a somthing will have a position relative to the start of the # gene on map my $rel = Bio::Map::GeneRelative->new(-gene => 0); # or that something will be relative to the third transcript of a gene # on a map $rel = Bio::Map::GeneRelative->new(-transcript => 3); # or to the 5th intron of the default transcript $rel = Bio::Map::GeneRelative->new(-intron => [0, 5]); # use the $rel as normal; see L<Bio::Map::Relative> DESCRIPTION
Be able to say that a given position is relative to some standard part of a gene. FEEDBACK
Mailing Lists User feedback is an integral part of the evolution of this and other Bioperl modules. Send your comments and suggestions preferably to the Bioperl mailing list. Your participation is much appreciated. bioperl-l@bioperl.org - General discussion http://bioperl.org/wiki/Mailing_lists - About the mailing lists Support Please direct usage questions or support issues to the mailing list: bioperl-l@bioperl.org rather than to the module maintainer directly. Many experienced and reponsive experts will be able look at the problem and quickly address it. Please include a thorough description of the problem with code and data examples if at all possible. Reporting Bugs Report bugs to the Bioperl bug tracking system to help us keep track of the bugs and their resolution. Bug reports can be submitted via the web: https://redmine.open-bio.org/projects/bioperl/ AUTHOR - Sendu Bala Email bix@sendu.me.uk APPENDIX
The rest of the documentation details each of the object methods. Internal methods are usually preceded with a _ new Title : new Usage : my $relative = Bio::Map::Relative->new(); Function: Build a new Bio::Map::Relative object. Returns : Bio::Map::Relative object Args : -gene => int : coordinates are relative to the int'th base downstream of the Position's map's gene [default is gene => 0, ie. relative to the start of the gene], -transcript => int : or relative to the start of the int'th transcript of the Position's map's gene, -exon => [i, n] : or relative to the start of the n'th transcript's i'th exon, -intron => [i, n] : or intron, -coding => int : or the start of the int'th transcript's coding region. -description => string : Free text description of what this relative describes (To say a Position is relative to something and upstream of it, the Position's start() co-ordinate should be set negative) In all cases, a transcript number of 0 means the active transcript. absolute_conversion Title : absolute_conversion Usage : my $absolute_coord = $relative->absolute_conversion($pos); Function: Convert the start co-ordinate of the supplied position into a number relative to the start of its map. Returns : scalar number Args : Bio::Map::PositionI object type Title : type Usage : my $type = $relative->type(); Function: Get the type of thing we are relative to. The types correspond to a method name, so the value of what we are relative to can subsequently be found by $value = $relative->$type; Note that type is set by the last method that was set, or during new(). Returns : 'gene', 'transcript', 'exon', 'intron' or 'coding' Args : none gene Title : gene Usage : my $int = $relative->gene(); $relative->gene($int); Function: Get/set the distance from the start of the gene that the Position's co-ordiantes are relative to. Returns : int Args : none to get, OR int to set; a value of 0 means relative to the start of the gene. transcript Title : transcript Usage : my $int = $relative->transcript(); $relative->transcript($int); Function: Get/set which transcript of the Position's map's gene the Position's co-ordinates are relative to. Returns : int Args : none to get, OR int to set; a value of 0 means the active (default) transcript. exon Title : exon Usage : my ($exon_number, $transcript_number) = @{$relative->exon()}; $relative->exon($exon_number, $transcript_number); Function: Get/set which exon of which transcript of the Position's map's gene the Position's co-ordinates are relative to. Returns : reference to list with two ints, exon number and transcript number Args : none to get, OR int (exon number) AND int (transcript number) to set. The second int is optional and defaults to 0 (meaning default/active transcript). intron Title : intron Usage : my ($intron_number, $transcript_number) = @{$relative->intron()}; $relative->intron($intron_number, $transcript_number); Function: Get/set which intron of which transcript of the Position's map's gene the Position's co-ordinates are relative to. Returns : reference to list with two ints, intron number and transcript number Args : none to get, OR int (intron number) AND int (transcript number) to set. The second int is optional and defaults to 0 (meaning default/active transcript). coding Title : coding Usage : my $transcript_number = $relative->coding; $relative->coding($transcript_number); Function: Get/set which transcript's coding region of the Position's map's gene the Position's co-ordinates are relative to. Returns : int Args : none to get, OR int to set (the transcript number, see transcript()) perl v5.14.2 2012-03-02 Bio::Map::GeneRelative(3pm)
All times are GMT -4. The time now is 11:31 PM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy