Hopefully someone here can point me in the correct direction.
I'm working on a username migration and am trying to map my users ols usernames to the new ones.
Right now every user has a username of firstname.lastname i.e. john.doe
I'm trying to create a bash or python script that will take... (3 Replies)
Hi,
I am having a file of dna sequences in fasta format which look like this:
>admin_1_45
atatagcaga
>admin_1_46
atatagcagaatatatat
with many such thousands of sequences in a single file. I want to the replace the accession Id "admin_1_45" similarly in following sequences to... (5 Replies)
I have two files. File1 is shown below.
>153L:B|PDBID|CHAIN|SEQUENCE
RTDCYGNVNRIDTTGASCKTAKPEGLSYCGVSASKKIAERDLQAMDRYKTIIKKVGEKLCVEPAVIAGIISRESHAGKVL
KNGWGDRGNGFGLMQVDKRSHKPQGTWNGEVHITQGTTILINFIKTIQKKFPSWTKDQQLKGGISAYNAGAGNVRSYARM
DIGTTHDDYANDVVARAQYYKQHGY
>16VP:A|PDBID|CHAIN|SEQUENCE... (7 Replies)
I have a fasta file as follows
>sp|O15090|FABP4_HUMAN Fatty acid-binding protein, adipocyte OS=Homo sapiens GN=FABP4 PE=1 SV=3
MCDAFVGTWKLVSSENFDDYMKEVGVGFATRKVAGMAKPNMIISVNGDVITIKSESTFKN
TEISFILGQEFDEVTADDRKVKSTITLDGGVLVHVQKWDGKSTTIKRKREDDKLVVECVM
KGVTSTRVYERA
>sp|L18484|AP2A2_RAT AP-2... (3 Replies)
Hi
How can I extract sequences from a fasta file with respect a certain criteria? The beginning of my file (containing in total more than 1000 sequences) looks like this:
>H8V34IS02I59VP
SDACNDLTIALLQIAREVRVCNPTFSFRWHPQVKDEVMRECFDCIRQGLG
YPSMRNDPILIANCMNWHGHPLEEARQWVHQACMSPCPSTKHGFQPFRMA... (6 Replies)
Hi,
I need some help with modifying fasta headers.
I have a fasta file with thousands of contigs and I need to modify their headers with the information obtained from a second file.
File 1 contains the fasta sequences:
>contig0001 length=11115 numreads=10777
agatgtagatctct... (6 Replies)
Hi,
I have a fasta file with multiple sequences. How can i get only unique sequences from the file.
For example
my_file.fasta
>seq1
TCTCAAAGAAAGCTGTGCTGCATACTGTACAAAACTTTGTCTGGAGAGATGGAGAATCTCATTGACTTTACAGGTGTGGACGGTCTTCAGAGATGGCTCAAGCTAACATTCCCTGACACACCTATAGGGAAAGAGCTAAC
>seq2... (3 Replies)
I could calculate the length of entire fasta sequences by following command,
awk '/^>/{if (l!="") print l; print; l=0; next}{l+=length($0)}END{print l}' unique.fasta
But, I need to calculate the length of a particular fasta sequence specified/listed in another txt file. The results to to be... (14 Replies)
I've been struggling with this one for quite a while and cannot seem to find a solution for this find/replace scenario. Perhaps I'm getting rusty.
I have a file that contains a number of metrics (exactly 3 fields per line) from a few appliances that are collected in parallel. To identify the... (3 Replies)
Hi,
I have to add 7 bases of specific nucleotide at the beginning and ending of all the fasta sequences of a file. For example, I have a multi fasta file namely test.fasta as given below
test.fasta
>TalAA18_Xoo_CIAT_NZ_CP033194.1:_2936369-2939570:+1... (1 Reply)
Discussion started by: dineshkumarsrk
1 Replies
LEARN ABOUT DEBIAN
reprof
REPROF(1) User Commands REPROF(1)NAME
reprof - predict protein secondary structure and solvent accessibility
SYNOPSIS
reprof -i [query.blastPsiMat] [OPTIONS]
reprof -i [query.fasta] [OPTIONS]
reprof -i [query.blastPsiMat|query.fasta] --mutations [mutations.txt] [OPTIONS]
DESCRIPTION
Predict protein secondary structure and solvent accessibility.
Output Format
The output format is self-explanatory, i.e. the colums of the output are described in the output file itself.
OPTIONS -i, --input=FILE
Input BLAST PSSM matrix file (from Blast -Q option) or input (single) FASTA file.
-o, --out=FILE
Either an output file or a directory. If not provided or a directory, the suffix of the input filename (i.e. .fasta or .blastPsiMat) is
replaced to create an output filename.
--mutations=[all|FILE]
Either the keyword "all" to predict all possible mutations or a file containing mutations one per line such as "C12M" for C is mutated
to M on position 12:
C30Y
R31W
G48D
This mutation code is also attached to the output filename using "_". An additional file ending "_ORI" contains the prediction using
no evolutionary information even if a BLAST PSSM matrix was provided.
--modeldir=DIR
Directory where the model and feature files are stored. Default: /usr/share/reprof.
AUTHOR
Peter Hoenigschmid hoenigschmid@rostlab.org, Burkhard Rost
EXAMPLES
Prediction from BLAST PSSM matrix for best results:
reprof -i /usr/share/doc/reprof/examples/example.Q -o /tmp/example.Q.reprof
Prediction from FASTA file:
reprof -i /usr/share/doc/reprof/examples/example.fasta -o /tmp/example.fasta.reprof
Prediction from BLAST PSSM matrix file using the mutation mode:
reprof -i /usr/share/doc/reprof/examples/example.Q -o /tmp/mutations_example.Q.reprof --mutations /usr/share/doc/reprof/examples/mutations.txt
# Result files for the above call are going to be:
# /tmp/mutations_example.Q.{reprof,reprof_F172P,reprof_M1Q,reprof_N34Y,reprof_ORI} - see --mutations for a description of the extensions.
COPYRIGHT
This program is free software: you can redistribute it and/or modify it under the terms of the GNU General Public License as published by
the Free Software Foundation, either version 3 of the License, or (at your option) any later version.
This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of
MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.
You should have received a copy of the GNU General Public License along with this program. If not, see <http://www.gnu.org/licenses/>.
BUGS
https://rostlab.org/bugzilla3/enter_bug.cgi?product=reprof
SEE ALSO blast2(1)
http://rostlab.org/
1.0.1 2012-01-13 REPROF(1)