I have two fasta files as shown below,
Both files are having the same fasta headers but vary in terms of sequences. Hence, I need to replace File:2 sequences in File:1 as shown below,
I tried with
command, but it is concatenating all the sequences instead of replacing the sequences as I mentioned above.
Please help me to do the same.
Thank you in advance.
Last edited by dineshkumarsrk; 06-13-2019 at 01:36 AM..
Reason: correction
Hi, I have two files where 1 contains data and the other contains strings eg
file 1
-0.00000 0.00000 0.00000
0.00000 0.00000 0.80000
0.50000 0.50000 0.60000
0.50000 0.50000 0.20000
-0.00000 0.00000 0.40000
file 2
F F F
F F F
T T T
T T T
T T T
How to I append file2 to file 1 to... (1 Reply)
I would like to extract the sequences larger than 10 bases but shorter than 18 along with the identifier from a FASTA file that looks like this:
> Seq I
ACGACTAGACGATAGACGATAGA
> Seq 2
ACGATGACGTAGCAGT
> Seq 3
ACGATACGAT
I know I can extract the IDs alone with the following code
grep... (3 Replies)
I have a fasta file that looks like this:
>Noname
ACCAAAATAATTCATGATATACTCAGATCCATCTGAGGGTTTCACCACTTGTAGAGCTAT
CAGAAGAATGTCAATCAACTGTCCGAGAAAAAAGAATCCCAGG
>Noname
ACTATAAACCCTATTTCTCTTTCTAAAAATTGAAATATTAAAGAAACTAGCACTAGCCTG
ACCTTTAGCCAGACTTCTCACTCTTAATGCTGCGGACAAACAGA
...
I want to... (2 Replies)
I tried to write a script ( not working) to append first value from mylist to a file called my myfirstResult and to another called mysecondResult
awk ' {print $1} >> myfirsResult ' < mylist
awk ' {print $1} >> mysecondResult ' < mylist
$ cat mylist
A 02/16/2012
B 02/19/2012
C... (3 Replies)
Hey,
I've been trying to break a massive fasta formatted file into files containing each gene separately. Could anyone help me? I've tried to use the following code but i've recieved errors every time:
for i in *.rtf.out
do
awk '/^>/{f=++d".fasta"} {print > $i.out}' $i
done (1 Reply)
Hi All,
I have to append 2 lines at the end of a text file. If those 2 lines are already there then do not append else append the 2 lines to the text file.
Eg: I have a text file, file.txt
This text file might look like this,
/home/kp/make.jsp
/home/pk/model.jsp
I have to append... (1 Reply)
Hi frnds,
My requirement is I have a zip file with name say eg: test_ABC_UH_ccde2a_awdeaea_20150422.zip
within that there are subdirectories on each directory we again have .zip files and in that we have files like mama20150422.gz and so on.
Iam in need of a bash script so that it unzips... (0 Replies)
Hii,
Could someone help me to append string to the starting of all the filenames inside a directory but it should exclude .zip files and subdirectories.
Eg.
file1: test1.log
file2: test2.log
file3 test.zip
After running the script
file1: string_test1.log
file2: string_test2.log
file3:... (4 Replies)
ABACAS(1) User Commands ABACAS(1)NAME
abacas - Algorithm Based Automatic Contiguation of Assembled Sequences
SYNOPSIS
abacas -r ref -q qs -p prog [OPTIONS]
OR
abacas -r ref -q psf -e
ref reference sequence in a single fasta file
qs contigs in multi-fasta format
rog MUMmer program to use: 'nucmer' or 'promer'
psf pseudomolecule/ordered sequence file in fasta format
OPTIONS
-h print usage
-d use default nucmer/promer parameters
-s int minimum length of exact matching word (nucmer default = 12, promer default = 4)
-m print ordered contigs to file in multifasta format
-b print contigs in bin to file
-N print a pseudomolecule without "N"s
-i int mimimum percent identity [default 40]
-v int mimimum contig coverage [default 40]
-V int minimum contig coverage difference [default 1]
-l int minimum contig length [default 1]
-t run tblastx on contigs that are not mapped
-g string (file name) print uncovered regions (gaps) on reference to file name
-a append contigs in bin to the pseudomolecule
-o prefix output files will have this prefix
-P pick primer sets to close gaps
-f int number of flanking bases on either side of a gap for primer design (default 350)
-R int Run mummer [default 1, use -R 0 to avoid running mummer]
-e Escape contig ordering i.e. go to primer design
-c Reference sequence is circular
DESCRIPTION
ABACAS is intended to rapidly contiguate (align, order, orientate), visualize and design primers to close gaps on shotgun assembled contigs
based on a reference sequence.
ABACAS uses MUMmer to find alignment positions and identify syntenies of assembled contigs against the reference. The output is then pro-
cessed to generate a pseudomolecule taking overlapping contigs and gaps in to account. ABACAS generates a comparision file that can be used
to visualize ordered and oriented contigs in ACT. Synteny is represented by red bars where colour intensity decreases with lower values of
percent identity between comparable blocks. Information on contigs such as the orientation, percent identity, coverage and overlap with
other contigs can also be visualized by loading the outputted feature file on ACT.
AUTHOR
ABACAS IS Copyright (C) 2008-10 The Wellcome Trust Sanger Institute, Cambridge, UK.
This manual page was written by Andreas Tille <tille@debian.org>, for the Debian project (and may be used by others).
1.3.1 2011-02-11 ABACAS(1)