Sponsored Content
Top Forums Shell Programming and Scripting parse fasta file to tabular file Post 302584663 by kato on Saturday 24th of December 2011 10:18:08 AM
Old 12-24-2011
You could try using a tab, instead of replacing the new line with nothing:
Code:
awk 'BEGIN{RS=">"}{gsub("\n","\t",$0); print ">"$0}' file

 

10 More Discussions You Might Find Interesting

1. UNIX for Dummies Questions & Answers

How to change sequence name in along fasta file?

Hi I have an alignment file (.fasta) with ~80 sequences. They look like this- >JV101.contig00066(+):25302-42404|sequence_index=0|block_index=4|species=JV101|JV101_4_0 GAGGTTAATTATCGATAACGTTTAATTAAAGTGTTTAGGTGTCATAATTT TAAATGACGATTTCTCATTACCATACACCTAAATTATCATCAATCTGAAT... (2 Replies)
Discussion started by: baika
2 Replies

2. UNIX for Dummies Questions & Answers

Change sequence names in fasta file

I have fasta files with multiple sequences in each. I need to change the sequence name headers from: >accD:_59176-60699 ATGGAAAAGTGGAGGATTTATTCGTTTCAGAAGGAGTTCGAACGCA >atpA_(reverse_strand):_showing_revcomp_of_10525-12048 ATGGTAACCATTCAAGCCGACGAAATTAGTAATCTTATCCGGGAAC... (2 Replies)
Discussion started by: tyrianthinae
2 Replies

3. Shell Programming and Scripting

Extract sequence from fasta file

Hi, I want to match the sequence id (sub-string of line starting with '>' and extract the information upto next '>' line ). Please help . input > fefrwefrwef X900 AGAGGGAATTGG AGGGGCCTGGAG GGTTCTCTTC > fefrwefrwef X932 AGAGGGAATTGG AGGAGGTGGAG GGTTCTCTTC > fefrwefrwef X937... (2 Replies)
Discussion started by: ritakadm
2 Replies

4. Shell Programming and Scripting

Extract sequences from a FASTA file based on another file

I have two files. File1 is shown below. >153L:B|PDBID|CHAIN|SEQUENCE RTDCYGNVNRIDTTGASCKTAKPEGLSYCGVSASKKIAERDLQAMDRYKTIIKKVGEKLCVEPAVIAGIISRESHAGKVL KNGWGDRGNGFGLMQVDKRSHKPQGTWNGEVHITQGTTILINFIKTIQKKFPSWTKDQQLKGGISAYNAGAGNVRSYARM DIGTTHDDYANDVVARAQYYKQHGY >16VP:A|PDBID|CHAIN|SEQUENCE... (7 Replies)
Discussion started by: nelsonfrans
7 Replies

5. UNIX for Dummies Questions & Answers

Append file name to fasta file headers in Linux

How do we append the file name to fasta file headers in multiple fasta-files in Linux? (10 Replies)
Discussion started by: Mauve
10 Replies

6. Shell Programming and Scripting

Convert text file to HTML tabular format.

Please provide script/commands to convert text file to HTML tabular format. No need of styles and colours, just output and a heading in table is required. Output file will be send via email and will be seen from outlook. (script required without using awk). output file content: (sar... (7 Replies)
Discussion started by: Veera_V
7 Replies

7. UNIX for Dummies Questions & Answers

Select distinct sequences from fasta file and list

Hi How can I extract sequences from a fasta file with respect a certain criteria? The beginning of my file (containing in total more than 1000 sequences) looks like this: >H8V34IS02I59VP SDACNDLTIALLQIAREVRVCNPTFSFRWHPQVKDEVMRECFDCIRQGLG YPSMRNDPILIANCMNWHGHPLEEARQWVHQACMSPCPSTKHGFQPFRMA... (6 Replies)
Discussion started by: Marion MPI
6 Replies

8. UNIX for Dummies Questions & Answers

Round up -FASTA file

I have the following script: awk 'FNR==NR{s+=$3;next;} { print $1 , $2, 100*$3/s }' and the following file: >P39PT-1224 Freq 900 cccctacgacggcattggtaatggctcagctgctccggatcccgcaagccatcttggatatgagggttcgtcggcctcttcagccaagg-cccccagcagaacatccagctgatcg >P39PT-784 Freq 2... (2 Replies)
Discussion started by: Xterra
2 Replies

9. UNIX for Dummies Questions & Answers

Selectively extracting entries from FASTA file

I would like to extract all entries containing the following patterns: ccccta & ccccccccc from the following infile: >P39PT-1224_Freq_900 cccctacgacggcattggtaatggctcccgcaagccatctctcttcagccaagg >P39PT-784_Freq_2 cccctacgacggcattggtaatggcacccgcaagccatctctcttccccccccc >P39PT-678_Freq_5... (4 Replies)
Discussion started by: Xterra
4 Replies

10. Shell Programming and Scripting

Getting unique sequences from multiple fasta file

Hi, I have a fasta file with multiple sequences. How can i get only unique sequences from the file. For example my_file.fasta >seq1 TCTCAAAGAAAGCTGTGCTGCATACTGTACAAAACTTTGTCTGGAGAGATGGAGAATCTCATTGACTTTACAGGTGTGGACGGTCTTCAGAGATGGCTCAAGCTAACATTCCCTGACACACCTATAGGGAAAGAGCTAAC >seq2... (3 Replies)
Discussion started by: Ibk
3 Replies
VIEWPERL(1)							   User Commands						       VIEWPERL(1)

NAME
viewperl - quickly view syntax highlighted Perl code SYNOPSIS
viewperl [OPTION]... FILE... DESCRIPTION
View a Perl source code file, syntax highlighted. -c, --code=CODE view CODE, syntax highlighted -l, --lines display line numbers -L, --no-lines supress display of line numbers (default) -m, --module=FILE consider FILE the name of a module, not a file name -n, --name display the name of each file (default) -N, --no-name supress display of file names (implied by --no-reset) -p, --pod display inline POD documentation (default) -P, --no-pod hide POD documentation (line numbers still increment) -r, --reset reset formatting and line numbers each file (default) -R, --no-reset supress resetting of formatting and line numbers -s, --shift=WIDTH set tab width (default is 4) -t, --tabs translate tabs into spaces (default) -T, --no-tabs supress translating of tabs into spaces --help display this help and exit Note that module names should be given as they would appear after a Perl `use' or `require' statement. `Getopt::Long', for example. Each string given using -c is considered a different file, so line number and formatting resets will apply. View a Perl source code file, syntax highlighted. -c, --code=CODE view CODE, syntax highlighted -l, --lines display line numbers -L, --no-lines supress display of line numbers (default) -m, --module=FILE consider FILE the name of a module, not a file name -n, --name display the name of each file (default) -N, --no-name supress display of file names (implied by --no-reset) -p, --pod display inline POD documentation (default) -P, --no-pod hide POD documentation (line numbers still increment) -r, --reset reset formatting and line numbers each file (default) -R, --no-reset supress resetting of formatting and line numbers -s, --shift=WIDTH set tab width (default is 4) -t, --tabs translate tabs into spaces (default) -T, --no-tabs supress translating of tabs into spaces --help display this help and exit Note that module names should be given as they would appear after a Perl `use' or `require' statement. `Getopt::Long', for example. Each string given using -c is considered a different file, so line number and formatting resets will apply. viewperl August 2007 VIEWPERL(1)
All times are GMT -4. The time now is 08:04 PM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy