Quick question,
File A has one line (10 charachters), File B has one line (10 charachters).
how do a concat these two files together to produce a file (File C) that has the contents of Files A + B together onto one line of 20 Charachters
if I use:
cat FileA FileB > FileC
I get the... (5 Replies)
Hi,
I was a typical Windows guy. Like to do things just by clicking my mouse:cool:. I got a new job now...where they are big on unix.
I am trying to wet my fingures now with unix. Haven't taken the dive yet.
I am trying to find a solution for this problem.
Please help me with some... (4 Replies)
i have data files in different directories like /jadat
/cadat etcc... i have list of this directories in a file files.
content of files:
ja
ca
directories:
/jadat
/cadat
file in each of these directories
natt_trans_like.dat
i need to loop though each directory for a... (1 Reply)
Hi all,
I have a directory with many subdirectories each named like so: KOG0001, KOG0002, ...KOG9999.
Each of these subdirectories contain a variable number two kinds of files (nuc and prot) named like so: Capitella_sp_nuc_hits.fasta (nuc) and Capitella_sp_prot_hits.fasta (prot). The... (2 Replies)
Hi, sometimes one wants to edit files while still seeing output of earlier commands in terminal. I've found out that cat test && cat - >> test does the trick for displaying file content and adding lines but I believe I saw a much cooler command that was also able to erase lines from files. I cannot... (6 Replies)
I am concatenating txt-files using cat:
cat *.txt > file.dat
However, the same directory has the installation instructions included, which is also a txt file: install.txt
I currently have the install.txt file renamed to install._txt, but I prefer a solution using regular expressions.
Is there... (5 Replies)
I want to printing into two files under difference situation.
For example, file 1 name.txt
>gma-miR172a Glyma02g28845
>gma-miR1513a-3p Glyma02g15840
>gma-miR166a-5p Glyma02g15840
>gma-miR1530 Glyma02g15130
>gma-miR1507a Glyma02g01841
File 2 a.gff
Glyma01g07930 ... (4 Replies)
I am trying to cat 3 files. They all have the same name MFExtract.txt. But, they are in 3 seperate file names. Any idea how I could cat the 3 together. It's throwing me off figuring out a way because the names are the same. I was thinking just doing a 3 cat commands with the full path names and... (2 Replies)
I am new to Linux and I am trying to cat only N files in a folder. N is dynamically given number at runtime.
If I give N as 2 then cat only 2 files in the folder
and If I give N as 5 then cat only 5 files in the folder.
Is there any way to do that? (6 Replies)
Discussion started by: KMusunuru
6 Replies
LEARN ABOUT DEBIAN
srf2fastq
srf2fastq(1) Staden io_lib srf2fastq(1)NAME
srf2fastq - Converts SRF files to Sanger fastq format
SYNOPSIS
srf2fastq [options] srf_archive ...
DESCRIPTION
srf2fastq extracts sequences and qualities from one or more SRF archives and writes them in Sanger fastq format to stdout.
Note that Illumina also have a fastq format (used in the GERALD directories) which differs slightly in the use of log-odds scores for the
quality values. The format described here is using the traditional Phred style of quality encoding.
OPTIONS -c Outputs calibrated confidence values using the ZTR CNF1 chunk type for a single quality per base. Without this use the original
Illumina _prb.txt files consisting of four quality values per base, stored in the ZTR CNF4 chunks.
-C Masks out sequences tagged as bad quality.
-s root
Generates files on disk with filenames starting root, one file per non-explicit element in the SRF/ZTR region (REGN) chunk. Typi-
cally this results in two files for paired end runs. The filename suffixes come from the names listed in the SRF region chunks.
This option conflicts with the -S parameter.
-S Splits sequences into regions, but sequentially lists each sequence region to stdout instead of splitting to separate files on disk.
This option conflicts with the -s parameter.
-n When using -s the filename suffixes are simply numbered (starting with 1) instead of using the names listed in the SRF region
chunks.
-a Appends region index to the sequence names. Ie generate "name/1" and "name/2" for a paired read.
-e Include any explicit sequence (ZTR region chunk of type 'E') in the sequence output. The explicit sequence is also included in the
quality line too. Currently this is utilised by ABI SOLiD to store the last base of the primer.
-r region list
Reverse complements the sequence and reverses the quality values for all regions in the region list. This is a comma separated list
of integer values enumerating the regions, starting from 1. Note that this option only works when either -s or -S are specified.
EXAMPLES
To extract only the good quality sequences from all srf files in the current directory using calibrated confidence values (if available).
srf2fastq -c -C *.srf > runX.fastq
To extract a paired end run into two separate files with sequences named name/1 and name/2.
srf2fastq -s runX -a -n runX.srf
To extract a paired end run as a single file, alternating forward and reverse sequences, with the second read being reverse complemented.
srf2fastq -S -r 2 runX.srf > runX.fastq
AUTHOR
James Bonfield, Steven Leonard - Wellcome Trust Sanger Institute
December 10 srf2fastq(1)