06-22-2010
trimming sequences
My file looks like this:
Quote:
>GHXCZCC01AJ8CJ
TTGATGTGCCAGCTGCCGTTGGTGTGTATCAGCTGGATTTTCTGGGACGC
CCCGGGGC
>GHXCZCC01APUO5
TGATGTGCCAGCTGCCGTTGGTGTGTATCAGCTGGATTTCTGGGACGCCC
CGGGGCGA
>GHXCZCC01AQSRP
TTGATGTTGCCAGCTGCCGTTGGTGTGTATCAGCTGGATTTTCTGGGACG
CCCCGGGG
But I would like to 'trim' all sequences to the same lenght 32 characters, keeping intact all the identifier (>GHXCZCC01AJ8CJ)
Quote:
>GHXCZCC01AJ8CJ
TTGATGTGCCAGCTGCCGTTGGTGTGTATCAG
CCCGGGGC
>GHXCZCC01APUO5
TGATGTGCCAGCTGCCGTTGGTGTGTATCAGC
CGGGGCGA
>GHXCZCC01AQSRP
TTGATGTTGCCAGCTGCCGTTGGTGTGTATCA
CCCCGGGG
Would it be possible to use awk to perform this task?
10 More Discussions You Might Find Interesting
1. UNIX for Dummies Questions & Answers
Hi everyone
I have this script that appends a line to a file to log the running status of an application. I need to write another script to run as a scheduled job in cron to trim the first x number of lines of this file.
Could someone give me an idea how to do this?
Regards (1 Reply)
Discussion started by: alwayslearningunix
1 Replies
2. Shell Programming and Scripting
Hi,
I am trying to find a script command that will let me trim leading and trailing space from a string. I have coded a SQL Select and sending the output to a file. Later I am parsing the file and reading each field. The problem is that each field uses the same size as the DB2 type it was defined... (2 Replies)
Discussion started by: fastgoon
2 Replies
3. UNIX for Advanced & Expert Users
Hi,
I want to trim +with leading zero's with amount fields.I know using awk for trimming leading zeros with +,but I want get the entire row itself.
cat file_name |awk -F " " '{printf "%14.4f%f\n",$4}'
ex:
10 xyz bc +00000234.4500
20 yzx foxic +002456.000
Expexted
10 xyz bc... (3 Replies)
Discussion started by: mohan705
3 Replies
4. UNIX for Advanced & Expert Users
Hi,
How can I remove the unwanted spaces in the line.
123456 789 ABC DEF. - I wanna remove the sapces in this line, I need the output 123456789ABCDEF.
Pls help me...... (3 Replies)
Discussion started by: sharif
3 Replies
5. Shell Programming and Scripting
hi have output as
i have trim of lines before CREATE statement and lins after last ")"
any idea how to achieve it ? (3 Replies)
Discussion started by: crackthehit007
3 Replies
6. Shell Programming and Scripting
I'm trying to parse an output log and I've managed to reduce the output to the lines I need. But I'm having trouble pulling out only the info I'm interested in. The output is 40+ lines and here is a sample
Installing AppFresh 0.8.5.pkg from ./InstallerFiles/CustomPKG/26 (26)
Installing... (2 Replies)
Discussion started by: kaltekar
2 Replies
7. Shell Programming and Scripting
My files look like this
And I need to cut the sequences at the last "A" found in the following 'pattern' -highlighted for easier identification, the pattern is the actual file is not highlighted.
The expected result should look like this
Thus, all the sequences would end with AGCCCTA... (2 Replies)
Discussion started by: Xterra
2 Replies
8. Shell Programming and Scripting
My file looks something like this
Wnat I need is to look for the Reference sequence (">Reference1") and based on the length of that sequence trim all the entries in that file. So, the rersulting file will contain all sequences with the same length, like this
Thus, all sequences will keep... (5 Replies)
Discussion started by: Xterra
5 Replies
9. Shell Programming and Scripting
Hi I need to trim white spaces from strings in a file.
Input file is like this:
1_rrc_CatalogGroups.csv = 607
1_rrc_Sales_TopCatalogGroups.csv = 4
1_rrc_Sales_CatalogEntries_CatalogGroup_Rel.csv = 7
Need to trim space before and after = symbol.
This is my script:
#!/usr/bin/ksh
... (2 Replies)
Discussion started by: sukhdip
2 Replies
10. UNIX for Dummies Questions & Answers
My files look like this
I need to remove the sequence GGGAAA and anything before that
I also need to remove the sequence AGCCCTA and anything after that
So I will end up with something like this
The left side is done but I cannot get the right side correctly. I would like to use... (3 Replies)
Discussion started by: Xterra
3 Replies
LEARN ABOUT XFREE86
ppmtopgm
ppmtopgm(1) General Commands Manual ppmtopgm(1)
NAME
ppmtopgm - convert a portable pixmap into a portable graymap
SYNOPSIS
ppmtopgm [ppmfile]
DESCRIPTION
Reads a portable pixmap as input. Produces a portable graymap as output. The output is a "black and white" rendering of the original
image, as in a black and white photograph. The quantization formula used is .299 r + .587 g + .114 b.
Note that although there is a pgmtoppm program, it is not necessary for simple conversions from pgm to ppm , because any ppm program can
read pgm (and pbm ) files automatically. pgmtoppm is for colorizing a pgm file. Also, see ppmtorgb3 for a different way of converting
color to gray. And ppmdist generates a grayscale image from a color image, but in a way that makes it easy to differentiate the original
colors, not necessarily a way that looks like a black and white photograph.
QUOTE
Cold-hearted orb that rules the night
Removes the colors from our sight
Red is gray, and yellow white
But we decide which is right
And which is a quantization error.
SEE ALSO
pgmtoppm(1),ppmtorgb3(1),rgb3toppm(1),ppmdist(1),ppm(5),pgm(5)
AUTHOR
Copyright (C) 1989 by Jef Poskanzer.
10 April 2000 ppmtopgm(1)