08-31-2011
Quote:
Thus, "A" should be read as "T"; "C" should be read as "G"; "T" should be converted into "A"; and "G" must be seen as "C" in the reversed-complemented sequence
Is there some algorithm behind or this is a fixed mapping?
I mean there are only A, C, G and T and T, G, C and A?
9 More Discussions You Might Find Interesting
1. Windows & DOS: Issues & Discussions
Here's my problem:
I have a laptop running Windows XP Pro with no internal CD or Floppy drives. I want to install Linux on it. I don't care about the Windows XP Pro installation, in fact I would like to install Linux over the entirety of the HD. However I cannot boot from any external CD drive... (1 Reply)
Discussion started by: saabir
1 Replies
2. Shell Programming and Scripting
Hello. I am trying to convert occurrences of 'NULL' from a datafile. The 'NULL' occurences appears at this:
|NULL| NULL|NULL| NULL|NULL| NULL|NULL| NULL|
There should be 52 fields per line.
I would like any occurrence of | NULL| or |NULL| to appear as '||'
Currently I am using this sed... (2 Replies)
Discussion started by: bestbuyernc
2 Replies
3. Shell Programming and Scripting
I have a file contains
TASK gsnmpproxy {
CommandLine = $SMCHOME/bin/gsnmpProxy.exe
}
TASK gsnmpdbgui {
CommandLine = $SMCHOME/bin/gsnmpdbgui.exe
I would like to comment and than uncomment specific task eg TASK gsnmpproxy
Pls suggest how to do in shell script (9 Replies)
Discussion started by: madhusmita
9 Replies
4. Shell Programming and Scripting
I have an task definition listing xml file that contains a list of tasks such as
<TASKLIST
<TASK definition="Completion date" id="Taskname1" Some other
<CODE name="Code12"
<Parameter pname="Dog" input="5.6" units="feet" etc /Parameter>
<Parameter... (3 Replies)
Discussion started by: MissI
3 Replies
5. Shell Programming and Scripting
Hi all,
Using Perl, I need to extract DNA bases from a GenBank file for a given plant species. A sample GenBank file is here...
Nucleotide
This is saved on my computer as NC_001666.gb. I also have a file that is saved on my computer as NC_001666.txt. This text file has a list of all... (5 Replies)
Discussion started by: akreibich07
5 Replies
6. Solaris
Hi folks!
My first post here.
I'm working on a script that retrieves a range of files from a list depending on a range of time.
UPDATE:
I've seen it could be difficult to read all this thing, so I'll make a summarize it..
How come I do this and take a result..
grep "..\:.." lista.new |... (4 Replies)
Discussion started by: kl0x
4 Replies
7. HP-UX
Hi,
I have a multihomed system HP-UX with two NIC cards having IP address 10.9.0.13 & 10.9.0.45
I have two weblogic servers running one listening on "10.9.0.13" and the other on "10.9.0.45"
Given a PID how is it possible to extract the IP Address that the weblogic server is using and... (1 Reply)
Discussion started by: mohtashims
1 Replies
8. Shell Programming and Scripting
Hi,
I am having a file of dna sequences in fasta format which look like this:
>admin_1_45
atatagcaga
>admin_1_46
atatagcagaatatatat
with many such thousands of sequences in a single file. I want to the replace the accession Id "admin_1_45" similarly in following sequences to... (5 Replies)
Discussion started by: margarita
5 Replies
9. Shell Programming and Scripting
I am trying to write a bash script that would be able to read DNA sequences (each line in the file is a sequence) from a file, where sequences are separated by an empty line. I am then to find the amino acid that these DNA sequences encode per codon (each group of three literals.) For example, if I... (3 Replies)
Discussion started by: faizlo
3 Replies
LEARN ABOUT DEBIAN
glam2-purge
GLAM2-PURGE(1) glam2 Manual GLAM2-PURGE(1)
NAME
glam2-purge - Removes redundant sequences from a FASTA file
SYNOPSIS
glam2-purge file score [options]
DESCRIPTION
glam2-purge is a modified version of Andrew Neuwald's purge program that removes redundant sequences from a FASTA file. This is recommended
in order to prevent highly similar sequences distorting the search for motifs. Purge works with either DNA or protein sequences and creates
an output file such that no two sequences have a (gapless) local alignment score greater than a threshold specified by the user. The output
file is named <file>.<score>. The alignment score is based on the BLOSUM62 matrix for proteins, and on a +5/-1 scoring scheme for DNA.
Purge can also be used to mask tandem repeats. It uses the XNU program for this purpose.
OPTIONS
-n
Sequences are DNA (default: protein).
-b
Use blast heuristic method (default for protein).
-e
Use an exhaustive method (default for DNA).
-q
Keep first sequence in the set.
-x
Use xnu to mask protein tandem repeats.
SEE ALSO
glam2(1), glam2format(1), glam2mask(1), glam2scan(1), xnu(1)
The full Hypertext documentation of GLAM2 is available online at http://bioinformatics.org.au/glam2/ or on this computer in
/usr/share/doc/glam2/.
REFERENCES
Purge was written by Andy Neuwald and is described in more detail in Neuwald et al., "Gibbs motif sampling: detection of bacterial outer
membrane protein repeats", Protein Science, 4:1618-1632, 1995. Please cite it if you use Purge.
If you use GLAM2, please cite: MC Frith, NFW Saunders, B Kobe, TL Bailey (2008) Discovering sequence motifs with arbitrary insertions and
deletions, PLoS Computational Biology (in press).
AUTHORS
Andrew Neuwald
Author of purge, renamed glam2-purge in Debian.
Martin Frith
Modified purge to be ANSI standard C and improved the user interface.
Timothy Bailey
Modified purge to be ANSI standard C and improved the user interface.
Charles Plessy <plessy@debian.org>
Formatted this manpage in DocBook XML for the Debian distribution.
COPYRIGHT
The source code and the documentation of Purge and GLAM2 are released in the public domain.
GLAM2 1056 05/19/2008 GLAM2-PURGE(1)