Modifying headers


 
Thread Tools Search this Thread
Top Forums UNIX for Dummies Questions & Answers Modifying headers
# 1  
Old 08-15-2011
Modifying headers

I have a FASTA file with thousands of sequences that looks something like this:
Quote:
>LBHCV-1-C1.seq_Contig_1_(1311)
---------TGGATATGATGATGAACTGGTCACCTACAGCAGCCCTGGTGGTGTCGCAGTTACTCCGGATCCCACAAGCCGTCGTGG-ATATGGTGGCGGGGGCCCACTGGGG-AATCCTGGC-GGGCCTTGCCTACTATTCCATGGTGGGGAACTGGGCTAAGGCCTTGATTGTGATGCTGCTATTCGCCGGCGTTGACGGGGACACCTACACAACGGGAGGA GCGGCGGGCAAGAACGCGCAGGGCCTCACGAGCCTCTTTGCAGTCGGGGCGTCTCAGAAAATCCAGCTCGTAAACACCAACGGCAGCT-GGC-ACATCAAA---------
>LBHCV-1-C10.seq_Contig_1_(1312)
--------TTGGATATGATGATGAACTGGTCACCTACAGCAGCCCTGGTGGTGTCGCAGTTACTCCGGATCCCACAAGCCGTCGTGG-ATATGGTGGCGGGGGCCCACTGGGG-AATCCTGGC-GGGACTTGCCTACTATTCCATGGTGGGGAACTGGGCTAAGGTCTTGATTGTGATGCTACTATTCGCCGGCGTTGACGGGGACACCTACACAACGGGGGGA GCGGCGGGCAAGAACGCGCAGGGCCTCACGAGCCTCTTTGCAGTCGGGGCGTCTCAGAAAATCCAGCTCGTAAACACCAACGGCAGCT-GGC-ACATCAAA---------
>LBHCV-1-C11.seq_Contig_1_(1314)
------CTTTGGATATGATGATGAACTGGTCACCTACAGCAGCCCTGGTGGTGTCGCAGTTACTCCGGATCCCACAAGCCGTCGTGG-ATATGGTGGCGGGGGCCCACTGGGG-AATCCTGGC-GGGCCTTGCCTACTATTCCATGGTGGGGAACTGGGCTAAGGTCTTGATTGTGATGCTACTATTCGCCGGCGTTGACGGGGACACCTACACAACGGGGGGA GCGGCGGGCAAGAACGCGCAGGGCCTCACGAGCCTCTTTGCAGTCGGGGCGTCTCAGAAAATCCAGCTCGTAAACACCAACGGCAGCT-GGC-ACATCAAA---------
I need to modfy the header in such way that everything after the dot is remove. Thus, I will end up with something like this:
Quote:
>LBHCV-1-C1
---------TGGATATGATGATGAACTGGTCACCTACAGCAGCCCTGGTGGTGTCGCAGTTACTCCGGATCCCACAAGCCGTCGTGG-ATATGGTGGCGGGGGCCCACTGGGG-AATCCTGGC-GGGCCTTGCCTACTATTCCATGGTGGGGAACTGGGCTAAGGCCTTGATTGTGATGCTGCTATTCGCCGGCGTTGACGGGGACACCTACACAACGGGAGGA GCGGCGGGCAAGAACGCGCAGGGCCTCACGAGCCTCTTTGCAGTCGGGGCGTCTCAGAAAATCCAGCTCGTAAACACCAACGGCAGCT-GGC-ACATCAAA---------
>LBHCV-1-C10
--------TTGGATATGATGATGAACTGGTCACCTACAGCAGCCCTGGTGGTGTCGCAGTTACTCCGGATCCCACAAGCCGTCGTGG-ATATGGTGGCGGGGGCCCACTGGGG-AATCCTGGC-GGGACTTGCCTACTATTCCATGGTGGGGAACTGGGCTAAGGTCTTGATTGTGATGCTACTATTCGCCGGCGTTGACGGGGACACCTACACAACGGGGGGA GCGGCGGGCAAGAACGCGCAGGGCCTCACGAGCCTCTTTGCAGTCGGGGCGTCTCAGAAAATCCAGCTCGTAAACACCAACGGCAGCT-GGC-ACATCAAA---------
>LBHCV-1-C11
------CTTTGGATATGATGATGAACTGGTCACCTACAGCAGCCCTGGTGGTGTCGCAGTTACTCCGGATCCCACAAGCCGTCGTGG-ATATGGTGGCGGGGGCCCACTGGGG-AATCCTGGC-GGGCCTTGCCTACTATTCCATGGTGGGGAACTGGGCTAAGGTCTTGATTGTGATGCTACTATTCGCCGGCGTTGACGGGGACACCTACACAACGGGGGGA GCGGCGGGCAAGAACGCGCAGGGCCTCACGAGCCTCTTTGCAGTCGGGGCGTCTCAGAAAATCCAGCTCGTAAACACCAACGGCAGCT-GGC-ACATCAAA---------
Thanks
# 2  
Old 08-15-2011
Try:
Code:
perl -i -pe 's/\..*//' file

 
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. UNIX for Dummies Questions & Answers

Email Headers

I'm trying to pick up some Unix SysAdmin skills on my own outside of work through the use of the "Unix and Linux System Administrators Handbook." I've found the exercises to be very beneficial, until I came to this.... "What path did the email take? To Whom was it addressed, and to whom was it... (0 Replies)
Discussion started by: ksmarine1980
0 Replies

2. Shell Programming and Scripting

editing headers

Hi, I have a folder that contains many (multiple) files 1.fasta 2.fasta 3.fasta 4.fasta 5.fasta . . 100's of files Each such file have data in the following format for example: vi 1.fasta 58 390 A GTATACATTATTGATGAAGTCCACATGCTTTCTATGGGTGCCTTCAATGCGCTTTTAAAA (7 Replies)
Discussion started by: Lucky Ali
7 Replies

3. Shell Programming and Scripting

Editing headers

Hi, I have a folder that contains many (multiple) files 1.fasta 2.fasta 3.fasta 4.fasta 5.fasta . . 100's of files Each such file have data in the following format for example: vi 1.fasta >AB_1 200bp MLKKPIIIGVTGGSGGGKTSVSRAILDSFPNARIAMIQHDSYYKDQSHMSFEERVKTNYDHPLAFDTDFM... (4 Replies)
Discussion started by: Lucky Ali
4 Replies

4. Programming

c - problem with headers?

I have a simple program to create a poker deck, shuffle it and deal cards. Here it is: #include <stdio.h> #include <stdlib.h> #include <time.h> struct Card { char *face, *suit; }; void fillDeck (Card *deck, char *face, char *suit); void shuffle (Card *deck); void... (4 Replies)
Discussion started by: Luke Bonham
4 Replies

5. IP Networking

Modifying IPv4 headers

Hi, My query is: I need to set the IPv4 header fields(like TOS, TTL, Precedence and Flags) in the linux kernel using the system calls. I tried setting the fields using the setsockopt() call using sockets but it is not reflecting the actual IPv4 header fields. Is it possible to modify these... (0 Replies)
Discussion started by: kiran_4u
0 Replies

6. UNIX for Dummies Questions & Answers

Grep Headers

Hi! Trying to find string and then put the above Headers of corresponding fist line. After executing a Property command a get this output: SP/CH-CH Span Name Type TG Idle InUse OffHk OnHk Ring -------- ------------------------------ ------ ---- ----- ----- ----- 02/01-24 CARRIERSS7... (6 Replies)
Discussion started by: Joel_john
6 Replies

7. Shell Programming and Scripting

Merging of files with different headers to make combined headers file

Hi , I have a typical situation. I have 4 files and with different headers (number of headers is varible ). I need to make such a merged file which will have headers combined from all files (comman coluns should appear once only). For example - File 1 H1|H2|H3|H4 11|12|13|14 21|22|23|23... (1 Reply)
Discussion started by: marut_ashu
1 Replies

8. Shell Programming and Scripting

Remove text between headers while leaving headers intact

Hi, I'm trying to strip all lines between two headers in a file: ### BEGIN ### Text to remove, contains all kinds of characters ... Antispyware-Downloadserver.com (Germany)=http://www.antispyware-downloadserver.c om/updates/ Antispyware-Downloadserver.com #2... (3 Replies)
Discussion started by: Trones
3 Replies

9. Programming

headers of the query

when we are spooling query o/p to certain txt file,in that file how we can get headers in the query.(through unix shell scripting). for exmple q1="slect * from XXXXXX;"; sqlplus XXX/XXXX@XXXXX spool XXXX.txt $q1 spool off in the text file i want the headers of the query..... ... (0 Replies)
Discussion started by: bhagya.puccha
0 Replies

10. Programming

C Headers

Where can i get C/C++ headers for OS MINIX 2.0.3? (0 Replies)
Discussion started by: biosdos
0 Replies
Login or Register to Ask a Question