10-26-2019
Thanks, would it be possible to retain the '>' at the header of each sequence?
10 More Discussions You Might Find Interesting
1. Shell Programming and Scripting
Hopefully someone here can point me in the correct direction.
I'm working on a username migration and am trying to map my users ols usernames to the new ones.
Right now every user has a username of firstname.lastname i.e. john.doe
I'm trying to create a bash or python script that will take... (3 Replies)
Discussion started by: binary-ninja
3 Replies
2. Shell Programming and Scripting
Hi,
I am having a file of dna sequences in fasta format which look like this:
>admin_1_45
atatagcaga
>admin_1_46
atatagcagaatatatat
with many such thousands of sequences in a single file. I want to the replace the accession Id "admin_1_45" similarly in following sequences to... (5 Replies)
Discussion started by: margarita
5 Replies
3. Shell Programming and Scripting
I have two files. File1 is shown below.
>153L:B|PDBID|CHAIN|SEQUENCE
RTDCYGNVNRIDTTGASCKTAKPEGLSYCGVSASKKIAERDLQAMDRYKTIIKKVGEKLCVEPAVIAGIISRESHAGKVL
KNGWGDRGNGFGLMQVDKRSHKPQGTWNGEVHITQGTTILINFIKTIQKKFPSWTKDQQLKGGISAYNAGAGNVRSYARM
DIGTTHDDYANDVVARAQYYKQHGY
>16VP:A|PDBID|CHAIN|SEQUENCE... (7 Replies)
Discussion started by: nelsonfrans
7 Replies
4. Shell Programming and Scripting
I have a fasta file as follows
>sp|O15090|FABP4_HUMAN Fatty acid-binding protein, adipocyte OS=Homo sapiens GN=FABP4 PE=1 SV=3
MCDAFVGTWKLVSSENFDDYMKEVGVGFATRKVAGMAKPNMIISVNGDVITIKSESTFKN
TEISFILGQEFDEVTADDRKVKSTITLDGGVLVHVQKWDGKSTTIKRKREDDKLVVECVM
KGVTSTRVYERA
>sp|L18484|AP2A2_RAT AP-2... (3 Replies)
Discussion started by: alexypaul
3 Replies
5. UNIX for Dummies Questions & Answers
Hi
How can I extract sequences from a fasta file with respect a certain criteria? The beginning of my file (containing in total more than 1000 sequences) looks like this:
>H8V34IS02I59VP
SDACNDLTIALLQIAREVRVCNPTFSFRWHPQVKDEVMRECFDCIRQGLG
YPSMRNDPILIANCMNWHGHPLEEARQWVHQACMSPCPSTKHGFQPFRMA... (6 Replies)
Discussion started by: Marion MPI
6 Replies
6. UNIX for Dummies Questions & Answers
Hi,
I need some help with modifying fasta headers.
I have a fasta file with thousands of contigs and I need to modify their headers with the information obtained from a second file.
File 1 contains the fasta sequences:
>contig0001 length=11115 numreads=10777
agatgtagatctct... (6 Replies)
Discussion started by: Lokaps
6 Replies
7. Shell Programming and Scripting
Hi,
I have a fasta file with multiple sequences. How can i get only unique sequences from the file.
For example
my_file.fasta
>seq1
TCTCAAAGAAAGCTGTGCTGCATACTGTACAAAACTTTGTCTGGAGAGATGGAGAATCTCATTGACTTTACAGGTGTGGACGGTCTTCAGAGATGGCTCAAGCTAACATTCCCTGACACACCTATAGGGAAAGAGCTAAC
>seq2... (3 Replies)
Discussion started by: Ibk
3 Replies
8. UNIX for Beginners Questions & Answers
I could calculate the length of entire fasta sequences by following command,
awk '/^>/{if (l!="") print l; print; l=0; next}{l+=length($0)}END{print l}' unique.fasta
But, I need to calculate the length of a particular fasta sequence specified/listed in another txt file. The results to to be... (14 Replies)
Discussion started by: dineshkumarsrk
14 Replies
9. Shell Programming and Scripting
I've been struggling with this one for quite a while and cannot seem to find a solution for this find/replace scenario. Perhaps I'm getting rusty.
I have a file that contains a number of metrics (exactly 3 fields per line) from a few appliances that are collected in parallel. To identify the... (3 Replies)
Discussion started by: verdepollo
3 Replies
10. UNIX for Beginners Questions & Answers
Hi,
I have to add 7 bases of specific nucleotide at the beginning and ending of all the fasta sequences of a file. For example, I have a multi fasta file namely test.fasta as given below
test.fasta
>TalAA18_Xoo_CIAT_NZ_CP033194.1:_2936369-2939570:+1... (1 Reply)
Discussion started by: dineshkumarsrk
1 Replies
LEARN ABOUT DEBIAN
pynast
VERSION:(1) User Commands VERSION:(1)
NAME
PyNAST - alignment of short DNA sequences
SYNOPSIS
pynast [options] {-i input_fp -t template_fp}
DESCRIPTION
[] indicates optional input (order unimportant) {} indicates required input (order unimportant)
Example usage:
pynast -i my_input.fasta -t my_template.fasta
OPTIONS
--version
show program's version number and exit
-h, --help
show this help message and exit
-t TEMPLATE_FP, --template_fp=TEMPLATE_FP
path to template alignment file [REQUIRED]
-i INPUT_FP, --input_fp=INPUT_FP
path to input fasta file [REQUIRED]
-v, --verbose
Print status and other information during execution [default: False]
-p MIN_PCT_ID, --min_pct_id=MIN_PCT_ID
minimum percent sequence identity to consider a sequence a match [default: 75.0]
-l MIN_LEN, --min_len=MIN_LEN
minimum sequence length to include in NAST alignment [default: 1000]
-m PAIRWISE_ALIGNMENT_METHOD, --pairwise_alignment_method=PAIRWISE_ALIGNMENT_METHOD
method for performing pairwise alignment [default: uclust]
-a FASTA_OUT_FP, --fasta_out_fp=FASTA_OUT_FP
path to store resulting alignment file [default: derived from input filepath]
-g LOG_FP, --log_fp=LOG_FP
path to store log file [default: derived from input filepath]
-f FAILURE_FP, --failure_fp=FAILURE_FP
path to store file of seqs which fail to align [default: derived from input filepath]
-e MAX_E_VALUE, --max_e_value=MAX_E_VALUE
Depreciated. Will be removed in PyNAST 1.2
-d BLAST_DB, --blast_db=BLAST_DB
Depreciated. Will be removed in PyNAST 1.2
SEE ALSO
http://pynast.sourceforge.net
Version: pynast 1.1 August 2011 VERSION:(1)