BP_OLIGO_COUNT(1p) User Contributed Perl Documentation BP_OLIGO_COUNT(1p)NAME
oligo_count - oligo count and frequency
SYNOPSIS
Usage: oligo_count [-h/--help] [-l/--length OLIGOLENGTH]
[-f/--format SEQFORMAT] [-i/--in/-s/--sequence SEQFILE]
[-o/--out OUTFILE]
DESCRIPTION
This scripts counts occurrence and frequency for all oligonucleotides of given length.
It can be used to determine what primers are useful for frequent priming of nucleic acid for random labeling.
Note that this script could be run by utilizing the compseq program which is part of EMBOSS.
OPTIONS
The default sequence format is fasta. If no outfile is given, the results will be printed to standard out. All other options can entered
interactively.
FEEDBACK
Mailing Lists
User feedback is an integral part of the evolution of this and other Bioperl modules. Send your comments and suggestions preferably to the
Bioperl mailing list. Your participation is much appreciated.
bioperl-l@bioperl.org - General discussion
http://bioperl.org/wiki/Mailing_lists - About the mailing lists
Reporting Bugs
Report bugs to the Bioperl bug tracking system to help us keep track of the bugs and their resolution. Bug reports can be submitted via the
web:
https://redmine.open-bio.org/projects/bioperl/
AUTHOR - Charles C. Kim
Email cckim@stanford.edu
HISTORY
Written July 2, 2001
Submitted to bioperl scripts project 2001/08/06
>> 100 x speed optimization by Heikki Lehvaslaiho
perl v5.14.2 2012-03-02 BP_OLIGO_COUNT(1p)
Check Out this Related Man Page
BP_GCCALC(1p) User Contributed Perl Documentation BP_GCCALC(1p)NAME
gccalc - GC content of nucleotide sequences
SYNOPSIS
gccalc [-f/--format FORMAT] [-h/--help] filename
or
gccalc [-f/--format FORMAT] < filename
or
gccalc [-f/--format FORMAT] -i filename
DESCRIPTION
This scripts prints out the GC content for every nucleotide sequence from the input file.
OPTIONS
The default sequence format is fasta.
The sequence input can be provided using any of the three methods:
unnamed argument
gccalc filename
named argument
gccalc -i filename
standard input
gccalc < filename
FEEDBACK
Mailing Lists
User feedback is an integral part of the evolution of this and other Bioperl modules. Send your comments and suggestions preferably to the
Bioperl mailing list. Your participation is much appreciated.
bioperl-l@bioperl.org - General discussion
http://bioperl.org/wiki/Mailing_lists - About the mailing lists
Reporting Bugs
Report bugs to the Bioperl bug tracking system to help us keep track of the bugs and their resolution. Bug reports can be submitted via the
web:
https://redmine.open-bio.org/projects/bioperl/
AUTHOR - Jason Stajich
Email jason@bioperl.org
HISTORY
Based on script code (see bottom) submitted by cckim@stanford.edu
Submitted as part of bioperl script project 2001/08/06
perl v5.14.2 2012-03-02 BP_GCCALC(1p)
Hi all,
Just writing to ask if any one can advise on what tools to use best for maintaining your scripts ... preferably free/open source and portable if there is one, that is, one that can be placed and run on a USB stick ...
At the moment, am having them in directories and files and no... (2 Replies)
Hi.. I have a seperate chromosome sequences and i wanted to parse some regions of chromosome based on start site and end site.. how can i achieve this?
For Example Chr 1 is in following format
I need regions from 2 - 10 should give me AATTCCAAA
and in a similar way 15- 25 should give... (8 Replies)
Hi all,
Looking for suggestions on a better way to sum numbers in a key value pair formated file. What I have works but seems really clunky to me. Any suggestions would be greatly appreciated.
cat test.txt | perl -ne 'm/(M=)(\d+\.?\d?\d?)/ && print "$2\n"' | awk '{ sum+=$1} END {printf... (7 Replies)
Hello,
I am working with a perl script that tries to find the average "frequency" in which lines are duplicated. So far I've only managed to find the way to count how many times the lines are repeated, the code is as follows:
perl -ae'
my $filename= $ENV{'i'};
open (FILE, "$filename") or... (10 Replies)
Hello,
A bioperl problem I thought could be done with awk: convert the fasta format (Note: the length of each row is not the same for each entry as they were combined from different files!) to tabular format.
input.fasta:
>YAL069W-1.334 Putative promoter sequence... (6 Replies)
I would like to convert the most frequent and second most frequent duplet in each row to 1 and -1 respectively ...and everything else to 0. please assist
A duplet is only AA , CC, GG and TT
- C1 C2 C3 C4 C5
R1 AA AA - - CC
R2 AC AA AA CC CC
R3 AT AT TT TT TT
R5 AT TT AA AA AA
... (7 Replies)