Extract length wise sequences from fastq file Post: 302788285

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

To extract data of a perticular interval (date-time wise)

I want a shell script which extract data from a log file which contains date and time-wise data and i need the data for a perticular interval of time...what can i do???

2. UNIX for Dummies Questions & Answers

Convert a tab delimited/variable length file to fixed length file

Hi, all. I need to convert a file tab delimited/variable length file in AIX to a fixed lenght file delimited by spaces. This is the input file: 10200002<tab>US$ COM<tab>16/12/2008<tab>2,3775<tab>2,3783 19300978<tab>EURO<tab>16/12/2008<tab>3,28523<tab>3,28657 And this is the expected...

3. Shell Programming and Scripting

Extract sequences based on the list

Hi, I have a file with more than 28000 records and it looks like below.. >mm10_refflat_ABCD range=chr1:1234567-2345678 tgtgcacactacacatgactagtacatgactagac....so on >mm10_refflat_BCD range=chr1:3234567-4545678... tgtgcacactacacatgactagtatgtgcacactacacatgactagta . . . . . so on ...

4. Shell Programming and Scripting

Extract substring specif position and length from file line

Hi gurus, I am trying to figure out how to extract substring from file line (all lines in file), as specified position and specified legth. Example input (file lines) dhaskjdsa dsadhkjsa dhsakjdsad hsadkjh dsahjdksahdsad sahkjd sahdkjsahd sajkdh adhjsak I want to extract substring on...

5. Shell Programming and Scripting

Extract sequences of bytes from binary for differents blocks

Hello to all, I would like to search sequences of bytes inside big binary file. The bin file contains blocks of information, each block begins is estructured as follow: 1- Each block begins with the hex 32 (1 byte) and ends with FF. After the FF of the last block, it follows 33. 2- Next...

6. Shell Programming and Scripting

Extract the part of sequences from a file

I have a text file, input.fasta contains some protein sequences. input.fasta is shown below. >P02649 MKVLWAALLVTFLAGCQAKVEQAVETEPEPELRQQTEWQSGQRWELALGRFWDYLRWVQT LSEQVQEELLSSQVTQELRALMDETMKELKAYKSELEEQLTPVAEETRARLSKELQAAQA RLGADMEDVCGRLVQYRGEVQAMLGQSTEELRVRLASHLRKLRKRLLRDADDLQKRLAVY...

7. Shell Programming and Scripting

Extract sequences from a FASTA file based on another file

8. Shell Programming and Scripting

Extract count of string in all files and display on date wise

Hi All, hope you all are doing well! I kindly ask you for shell scripting help, here is the description: I have huge number of files shown below on date wise, which contains different strings(numbers you can say) including 505001 and 602001. ...

9. Shell Programming and Scripting

Outputting sequences based on length with sed

I have this file: >ID1 AA >ID2 TTTTTT >ID-3 AAAAAAAAA >ID4 TTTTTTGGAGATCAGTAGCAGATGACAG-GGGGG-TGCACCCC Add I am trying to use this script to output sequences longer than 15 characters: sed -r '/^>/N;{/^.{,15}$/d}' The desire output would be this: >ID4...

10. UNIX for Beginners Questions & Answers

How to count the length of fasta sequences?

I could calculate the length of entire fasta sequences by following command, awk '/^>/{if (l!="") print l; print; l=0; next}{l+=length($0)}END{print l}' unique.fasta But, I need to calculate the length of a particular fasta sequence specified/listed in another txt file. The results to to be...

LEARN ABOUT DEBIAN

bio::index::fastq

Bio::Index::Fastq(3pm)					User Contributed Perl Documentation				    Bio::Index::Fastq(3pm)

NAME

       Bio::Index::Fastq - Interface for indexing (multiple) fastq files

SYNOPSIS

	   # Complete code for making an index for several
	   # fastq files
	   use Bio::Index::Fastq;
	   use strict;

	   my $Index_File_Name = shift;
	   my $inx = Bio::Index::Fastq->new(
	       '-filename' => $Index_File_Name,
	       '-write_flag' => 1);
	   $inx->make_index(@ARGV);

	   # Print out several sequences present in the index
	   # in Fastq format
	   use Bio::Index::Fastq;
	   use strict;

	   my $Index_File_Name = shift;
	   my $inx = Bio::Index::Fastq->new('-filename' => $Index_File_Name);
	   my $out = Bio::SeqIO->new('-format' => 'Fastq','-fh' => *STDOUT);

	   foreach my $id (@ARGV) {
	       my $seq = $inx->fetch($id); # Returns Bio::Seq::Quality object
	       $out->write_seq($seq);
	   }

	   # or, alternatively
	   my $id;
	   my $seq = $inx->get_Seq_by_id($id); #identical to fetch

DESCRIPTION

       Inherits functions for managing dbm files from Bio::Index::Abstract.pm, and provides the basic funtionallity for indexing fastq files, and
       retrieving the sequence from them. Note: for best results 'use strict'.

       Bio::Index::Fastq supports the Bio::DB::BioSeqI interface, meaning it can be used as a Sequence database for other parts of bioperl

FEED_BACK
   Mailing Lists
       User feedback is an integral part of the evolution of this and other Bioperl modules. Send your comments and suggestions preferably to one
       of the Bioperl mailing lists.  Your participation is much appreciated.

	 bioperl-l@bioperl.org			- General discussion
	 http://bioperl.org/wiki/Mailing_lists	- About the mailing lists

   Support
       Please direct usage questions or support issues to the mailing list:

       bioperl-l@bioperl.org

       rather than to the module maintainer directly. Many experienced and reponsive experts will be able look at the problem and quickly address
       it. Please include a thorough description of the problem with code and data examples if at all possible.

   Reporting Bugs
       Report bugs to the Bioperl bug tracking system to help us keep track the bugs and their resolution.  Bug reports can be submitted via the
       web:

	 https://redmine.open-bio.org/projects/bioperl/

AUTHOR - Tony Cox
       Email - avc@sanger.ac.uk

APPENDIX

       The rest of the documentation details each of the object methods. Internal methods are usually preceded with a _

   _file_format
	Title	: _file_format
	Function: The file format for this package, which is needed
		  by the SeqIO system when reading the sequence.
	Returns : 'Fastq'

   _index_file
	 Title	 : _index_file
	 Usage	 : $index->_index_file( $file_name, $i )
	 Function: Specialist function to index FASTQ format files.
		   Is provided with a filename and an integer
		   by make_index in its SUPER class.
	 Example :
	 Returns :
	 Args	 :

   id_parser
	 Title	 : id_parser
	 Usage	 : $index->id_parser( CODE )
	 Function: Stores or returns the code used by record_id to
		   parse the ID for record from a string.  Useful
		   for (for instance) specifying a different
		   parser for different flavours of FASTQ file.
		   Returns &default_id_parser (see below) if not
		   set. If you supply your own id_parser
		   subroutine, then it should expect a fastq
		   description line.  An entry will be added to
		   the index for each string in the list returned.
	 Example : $index->id_parser( &my_id_parser )
	 Returns : ref to CODE if called without arguments
	 Args	 : CODE

   default_id_parser
	 Title	 : default_id_parser
	 Usage	 : $id = default_id_parser( $header )
	 Function: The default Fastq ID parser for Fastq.pm
		   Returns $1 from applying the regexp /^>s*(S+)/
		   to $header.
	 Returns : ID string
	 Args	 : a fastq header line string

perl v5.14.2							    2012-03-02						    Bio::Index::Fastq(3pm)

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

To extract data of a perticular interval (date-time wise)

Discussion started by: abhishek27

2. UNIX for Dummies Questions & Answers

Convert a tab delimited/variable length file to fixed length file

Discussion started by: Everton_Silveir

3. Shell Programming and Scripting

Extract sequences based on the list

Discussion started by: Diya123

4. Shell Programming and Scripting

Extract substring specif position and length from file line

Discussion started by: ProsteJa