May I paraphrase your request: You want file2's sequences to prevail and file1's sequences inserted only if there's no equivalent in file2. If so, try
with any length fasta sequences. The traiing ">" can be taken care of piping through | sed '$d' if need be. Please be aware that your desired output's "Contig_98" line is missing an "AG" at the end.
Hi, I have two files where 1 contains data and the other contains strings eg
file 1
-0.00000 0.00000 0.00000
0.00000 0.00000 0.80000
0.50000 0.50000 0.60000
0.50000 0.50000 0.20000
-0.00000 0.00000 0.40000
file 2
F F F
F F F
T T T
T T T
T T T
How to I append file2 to file 1 to... (1 Reply)
I would like to extract the sequences larger than 10 bases but shorter than 18 along with the identifier from a FASTA file that looks like this:
> Seq I
ACGACTAGACGATAGACGATAGA
> Seq 2
ACGATGACGTAGCAGT
> Seq 3
ACGATACGAT
I know I can extract the IDs alone with the following code
grep... (3 Replies)
I have a fasta file that looks like this:
>Noname
ACCAAAATAATTCATGATATACTCAGATCCATCTGAGGGTTTCACCACTTGTAGAGCTAT
CAGAAGAATGTCAATCAACTGTCCGAGAAAAAAGAATCCCAGG
>Noname
ACTATAAACCCTATTTCTCTTTCTAAAAATTGAAATATTAAAGAAACTAGCACTAGCCTG
ACCTTTAGCCAGACTTCTCACTCTTAATGCTGCGGACAAACAGA
...
I want to... (2 Replies)
I tried to write a script ( not working) to append first value from mylist to a file called my myfirstResult and to another called mysecondResult
awk ' {print $1} >> myfirsResult ' < mylist
awk ' {print $1} >> mysecondResult ' < mylist
$ cat mylist
A 02/16/2012
B 02/19/2012
C... (3 Replies)
Hey,
I've been trying to break a massive fasta formatted file into files containing each gene separately. Could anyone help me? I've tried to use the following code but i've recieved errors every time:
for i in *.rtf.out
do
awk '/^>/{f=++d".fasta"} {print > $i.out}' $i
done (1 Reply)
Hi All,
I have to append 2 lines at the end of a text file. If those 2 lines are already there then do not append else append the 2 lines to the text file.
Eg: I have a text file, file.txt
This text file might look like this,
/home/kp/make.jsp
/home/pk/model.jsp
I have to append... (1 Reply)
Hi frnds,
My requirement is I have a zip file with name say eg: test_ABC_UH_ccde2a_awdeaea_20150422.zip
within that there are subdirectories on each directory we again have .zip files and in that we have files like mama20150422.gz and so on.
Iam in need of a bash script so that it unzips... (0 Replies)
Hii,
Could someone help me to append string to the starting of all the filenames inside a directory but it should exclude .zip files and subdirectories.
Eg.
file1: test1.log
file2: test2.log
file3 test.zip
After running the script
file1: string_test1.log
file2: string_test2.log
file3:... (4 Replies)
REPROF(1) User Commands REPROF(1)NAME
reprof - predict protein secondary structure and solvent accessibility
SYNOPSIS
reprof -i [query.blastPsiMat] [OPTIONS]
reprof -i [query.fasta] [OPTIONS]
reprof -i [query.blastPsiMat|query.fasta] --mutations [mutations.txt] [OPTIONS]
DESCRIPTION
Predict protein secondary structure and solvent accessibility.
Output Format
The output format is self-explanatory, i.e. the colums of the output are described in the output file itself.
OPTIONS -i, --input=FILE
Input BLAST PSSM matrix file (from Blast -Q option) or input (single) FASTA file.
-o, --out=FILE
Either an output file or a directory. If not provided or a directory, the suffix of the input filename (i.e. .fasta or .blastPsiMat) is
replaced to create an output filename.
--mutations=[all|FILE]
Either the keyword "all" to predict all possible mutations or a file containing mutations one per line such as "C12M" for C is mutated
to M on position 12:
C30Y
R31W
G48D
This mutation code is also attached to the output filename using "_". An additional file ending "_ORI" contains the prediction using
no evolutionary information even if a BLAST PSSM matrix was provided.
--modeldir=DIR
Directory where the model and feature files are stored. Default: /usr/share/reprof.
AUTHOR
Peter Hoenigschmid hoenigschmid@rostlab.org, Burkhard Rost
EXAMPLES
Prediction from BLAST PSSM matrix for best results:
reprof -i /usr/share/doc/reprof/examples/example.Q -o /tmp/example.Q.reprof
Prediction from FASTA file:
reprof -i /usr/share/doc/reprof/examples/example.fasta -o /tmp/example.fasta.reprof
Prediction from BLAST PSSM matrix file using the mutation mode:
reprof -i /usr/share/doc/reprof/examples/example.Q -o /tmp/mutations_example.Q.reprof --mutations /usr/share/doc/reprof/examples/mutations.txt
# Result files for the above call are going to be:
# /tmp/mutations_example.Q.{reprof,reprof_F172P,reprof_M1Q,reprof_N34Y,reprof_ORI} - see --mutations for a description of the extensions.
COPYRIGHT
This program is free software: you can redistribute it and/or modify it under the terms of the GNU General Public License as published by
the Free Software Foundation, either version 3 of the License, or (at your option) any later version.
This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of
MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.
You should have received a copy of the GNU General Public License along with this program. If not, see <http://www.gnu.org/licenses/>.
BUGS
https://rostlab.org/bugzilla3/enter_bug.cgi?product=reprof
SEE ALSO blast2(1)
http://rostlab.org/
1.0.1 2012-01-13 REPROF(1)