Tricky task with DNA sequences. Post: 302551789

9 More Discussions You Might Find Interesting

1. Windows & DOS: Issues & Discussions

Tricky one...

Here's my problem: I have a laptop running Windows XP Pro with no internal CD or Floppy drives. I want to install Linux on it. I don't care about the Windows XP Pro installation, in fact I would like to install Linux over the entirety of the HD. However I cannot boot from any external CD drive...

2. Shell Programming and Scripting

Tricky Sed

Hello. I am trying to convert occurrences of 'NULL' from a datafile. The 'NULL' occurences appears at this: |NULL| NULL|NULL| NULL|NULL| NULL|NULL| NULL| There should be 52 fields per line. I would like any occurrence of | NULL| or |NULL| to appear as '||' Currently I am using this sed...

3. Shell Programming and Scripting

comment and Uncomment single task out of multiple task

I have a file contains TASK gsnmpproxy { CommandLine = $SMCHOME/bin/gsnmpProxy.exe } TASK gsnmpdbgui { CommandLine = $SMCHOME/bin/gsnmpdbgui.exe I would like to comment and than uncomment specific task eg TASK gsnmpproxy Pls suggest how to do in shell script

4. Shell Programming and Scripting

Parse an XML task list to create each task.xml file

I have an task definition listing xml file that contains a list of tasks such as <TASKLIST <TASK definition="Completion date" id="Taskname1" Some other <CODE name="Code12" <Parameter pname="Dog" input="5.6" units="feet" etc /Parameter> <Parameter...

5. Shell Programming and Scripting

Extracting DNA sequences from GenBank files using Perl

Hi all, Using Perl, I need to extract DNA bases from a GenBank file for a given plant species. A sample GenBank file is here... Nucleotide This is saved on my computer as NC_001666.gb. I also have a file that is saved on my computer as NC_001666.txt. This text file has a list of all...

6. Solaris

Tricky egrep

Hi folks! My first post here. I'm working on a script that retrieves a range of files from a list depending on a range of time. UPDATE: I've seen it could be difficult to read all this thing, so I'll make a summarize it.. How come I do this and take a result.. grep "..\:.." lista.new |...

7. HP-UX

Tricky situation getting IP address

Hi, I have a multihomed system HP-UX with two NIC cards having IP address 10.9.0.13 & 10.9.0.45 I have two weblogic servers running one listening on "10.9.0.13" and the other on "10.9.0.45" Given a PID how is it possible to extract the IP Address that the weblogic server is using and...

8. Shell Programming and Scripting

Shell script for changing the accession number of DNA sequences in a FASTA file

Hi, I am having a file of dna sequences in fasta format which look like this: >admin_1_45 atatagcaga >admin_1_46 atatagcagaatatatat with many such thousands of sequences in a single file. I want to the replace the accession Id "admin_1_45" similarly in following sequences to...

9. Shell Programming and Scripting

Convert a DNA sequence into Amino Acid

I am trying to write a bash script that would be able to read DNA sequences (each line in the file is a sequence) from a file, where sequences are separated by an empty line. I am then to find the amino acid that these DNA sequences encode per codon (each group of three literals.) For example, if I...

LEARN ABOUT DEBIAN

glam2-purge

GLAM2-PURGE(1)							   glam2 Manual 						    GLAM2-PURGE(1)

NAME

       glam2-purge - Removes redundant sequences from a FASTA file

SYNOPSIS

       glam2-purge file score [options]

DESCRIPTION

       glam2-purge is a modified version of Andrew Neuwald's purge program that removes redundant sequences from a FASTA file. This is recommended
       in order to prevent highly similar sequences distorting the search for motifs. Purge works with either DNA or protein sequences and creates
       an output file such that no two sequences have a (gapless) local alignment score greater than a threshold specified by the user. The output
       file is named <file>.<score>. The alignment score is based on the BLOSUM62 matrix for proteins, and on a +5/-1 scoring scheme for DNA.
       Purge can also be used to mask tandem repeats. It uses the XNU program for this purpose.

OPTIONS

       -n
	   Sequences are DNA (default: protein).

       -b
	   Use blast heuristic method (default for protein).

       -e
	   Use an exhaustive method (default for DNA).

       -q
	   Keep first sequence in the set.

       -x
	   Use xnu to mask protein tandem repeats.

SEE ALSO

       glam2(1), glam2format(1), glam2mask(1), glam2scan(1), xnu(1)

       The full Hypertext documentation of GLAM2 is available online at http://bioinformatics.org.au/glam2/ or on this computer in
       /usr/share/doc/glam2/.

REFERENCES

       Purge was written by Andy Neuwald and is described in more detail in Neuwald et al., "Gibbs motif sampling: detection of bacterial outer
       membrane protein repeats", Protein Science, 4:1618-1632, 1995. Please cite it if you use Purge.

       If you use GLAM2, please cite: MC Frith, NFW Saunders, B Kobe, TL Bailey (2008) Discovering sequence motifs with arbitrary insertions and
       deletions, PLoS Computational Biology (in press).

AUTHORS

       Andrew Neuwald
	   Author of purge, renamed glam2-purge in Debian.

       Martin Frith
	   Modified purge to be ANSI standard C and improved the user interface.

       Timothy Bailey
	   Modified purge to be ANSI standard C and improved the user interface.

       Charles Plessy <plessy@debian.org>
	   Formatted this manpage in DocBook XML for the Debian distribution.

COPYRIGHT

       The source code and the documentation of Purge and GLAM2 are released in the public domain.

GLAM2 1056							    05/19/2008							    GLAM2-PURGE(1)