Extract sequences of bytes from binary for differents blocks Post: 302843499

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Remove first N bytes and last N bytes from a binary file on AIX.

Hi all, Does anybody know or guide me on how to remove the first N bytes and the last N bytes from a binary file? Is there any AWK or SED or any command that I can use to achieve this? Your help is greatly appreciated!! Best Regards, Naveen.

2. UNIX for Advanced & Expert Users

Deal with binary sequences

Hello, I have come across the necessity for me to deal with binary sequences and I had a few questions. 1- Does any UNIX scripting language provide any tool or command for converting text data to binary sequences? Example of binary sequence: "0x97 0x93 0x85 0x40 0xd5 0xd6 0xd7" 2- If I want...

3. Shell Programming and Scripting

Extract sequence blocks

Hi, I have an one-line file consisting of a sequence of 660 letters. I would like to extract 9-letter blocks iteratively: ASDFGHJKLQWERTYUIOPZXCVBNM first block: ASDFGHJKL 1nd block: SDFGHJKLQ What I have so far only gives me the first block, can anyone please explain why? cat...

4. Shell Programming and Scripting

extract blocks of text from a file

Hi, This is part of a large text file I need to separate out. I'd like some help to build a shell script that will extract the text between sets of dashed lines, write that to a new file using the whole or part of the first text string as the new file name, then move on to the next one and...

5. Linux

Why does ext3 allocate 8 blocks for files that are few bytes long

The title is clear: why does ext3 allocate 8 blocks for files that are few bytes long? If I create a file named "test", put a few chars in it, and then I run: stat test I get that "Blocks: 8" I searched in the web and found that ext does that, it allocates 8 blocks even if It doesn't need...

6. UNIX for Dummies Questions & Answers

X bytes of 0, Y bytes of random data, Z bytes of 5, T bytes of 1. ??

Hello guys. I really hope someone will help me with this one.. So, I have to write this script who: - creates a file home/student/vmdisk of 10 mb - formats that file to ext3 - mounts that partition to /mnt/partition - creates a file /mnt/partition/data. In this file, there will...

7. Shell Programming and Scripting

Extract sequences based on the list

Hi, I have a file with more than 28000 records and it looks like below.. >mm10_refflat_ABCD range=chr1:1234567-2345678 tgtgcacactacacatgactagtacatgactagac....so on >mm10_refflat_BCD range=chr1:3234567-4545678... tgtgcacactacacatgactagtatgtgcacactacacatgactagta . . . . . so on ...

8. Shell Programming and Scripting

Extract length wise sequences from fastq file

I have a fastq file from small RNA sequencing with sequence lengths between 15 - 30. I wanted to filter sequence lengths between 21-25 and write to another fastq file. how can i do that?

9. Shell Programming and Scripting

Extract the part of sequences from a file

I have a text file, input.fasta contains some protein sequences. input.fasta is shown below. >P02649 MKVLWAALLVTFLAGCQAKVEQAVETEPEPELRQQTEWQSGQRWELALGRFWDYLRWVQT LSEQVQEELLSSQVTQELRALMDETMKELKAYKSELEEQLTPVAEETRARLSKELQAAQA RLGADMEDVCGRLVQYRGEVQAMLGQSTEELRVRLASHLRKLRKRLLRDADDLQKRLAVY...

10. Shell Programming and Scripting

Blocks of text in a file - extract when matches...

I sat down yesterday to write this script and have just realised that my methodology is broken........ In essense I have..... ----------------------------------------------------------------- (This line really is in the file) Service ID: 12345 ...

LEARN ABOUT DEBIAN

srf2fastq

srf2fastq(1)							   Staden io_lib						      srf2fastq(1)

NAME

       srf2fastq - Converts SRF files to Sanger fastq format

SYNOPSIS

       srf2fastq  [options] srf_archive ...

DESCRIPTION

       srf2fastq extracts sequences and qualities from one or more SRF archives and writes them in Sanger fastq format to stdout.

       Note  that  Illumina also have a fastq format (used in the GERALD directories) which differs slightly in the use of log-odds scores for the
       quality values. The format described here is using the traditional Phred style of quality encoding.

OPTIONS

       -c     Outputs calibrated confidence values using the ZTR CNF1 chunk type for a single quality per base.  Without  this	use  the  original
	      Illumina _prb.txt files consisting of four quality values per base, stored in the ZTR CNF4 chunks.

       -C     Masks out sequences tagged as bad quality.

       -s root
	      Generates  files	on  disk with filenames starting root, one file per non-explicit element in the SRF/ZTR region (REGN) chunk. Typi-
	      cally this results in two files for paired end runs. The filename suffixes come from the names listed  in  the  SRF  region  chunks.
	      This option conflicts with the -S parameter.

       -S     Splits sequences into regions, but sequentially lists each sequence region to stdout instead of splitting to separate files on disk.
	      This option conflicts with the -s parameter.

       -n     When using -s the filename suffixes are simply numbered (starting with 1) instead of using  the  names  listed  in  the  SRF  region
	      chunks.

       -a     Appends region index to the sequence names. Ie generate "name/1" and "name/2" for a paired read.

       -e     Include  any  explicit sequence (ZTR region chunk of type 'E') in the sequence output. The explicit sequence is also included in the
	      quality line too. Currently this is utilised by ABI SOLiD to store the last base of the primer.

       -r region list
	      Reverse complements the sequence and reverses the quality values for all regions in the region list. This is a comma separated  list
	      of integer values enumerating the regions, starting from 1. Note that this option only works when either -s or -S are specified.

EXAMPLES

       To extract only the good quality sequences from all srf files in the current directory using calibrated confidence values (if available).

	   srf2fastq -c -C *.srf > runX.fastq

       To extract a paired end run into two separate files with sequences named name/1 and name/2.

	   srf2fastq -s runX -a -n runX.srf

       To extract a paired end run as a single file, alternating forward and reverse sequences, with the second read being reverse complemented.

	   srf2fastq -S -r 2 runX.srf > runX.fastq

AUTHOR

       James Bonfield, Steven Leonard - Wellcome Trust Sanger Institute

								    December 10 						      srf2fastq(1)

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Remove first N bytes and last N bytes from a binary file on AIX.

Discussion started by: naveendronavall

2. UNIX for Advanced & Expert Users

Deal with binary sequences

Discussion started by: Indalecio

3. Shell Programming and Scripting

Extract sequence blocks

Discussion started by: solli

4. Shell Programming and Scripting

extract blocks of text from a file

Discussion started by: cajunfries