06-18-2015
Trimming ends
My files look like this
Quote:
>GHXCZCC01AJ8CJ
ACGTGCGTGGGAAATTGATGTGCTTGGTGTGTATCATTTCTGGGAAGCCCTACGCCCCGGGGC
>GHXCZCC01APUO5
ACGTGCGTGGGAAAT-ATGTGCCGTTGGTGTGTATCAGCTGGATTTCTGGGACGCAGCCCTACCCGGGGCGA
>GHXCZCC01AQSRP
ACGTGCGTGGGAAATTGATGTTA---AGCTGGATTTTCTGGGACGCCCCGGGGAGCCCTA
>GHXCZCC01AQSRP
TTGTTGCCAGCTAGCTGAGCCCTAGATTTTCTGGGGCCCCGGGG
>GHXCZCC01AQSRP
TTGATGTTGCCCAGCCCTATAGCTGGATTTTCTGGGACGCCCCGGGGTGC
I need to remove the sequence GGGAAA and anything before that
I also need to remove the sequence AGCCCTA and anything after that
So I will end up with something like this
Quote:
>GHXCZCC01AJ8CJ
TTGATGTGCTTGGTGTGTATCATTTCTGGGA
>GHXCZCC01APUO5
T-ATGTGCCGTTGGTGTGTATCAGCTGGATTTCTGGGACGC
>GHXCZCC01AQSRP
TTGATGTTA---AGCTGGATTTTCTGGGACGCCCCGGGG
>GHXCZCC01AQSRP
TTGTTGCCAGCTAGCTG
>GHXCZCC01AQSRP
TTGATGTTGCCC
The left side is done but I cannot get the right side correctly. I would like to use one AWK script to accomplish the whole think instead of multiple steps
Thanks for any help!
10 More Discussions You Might Find Interesting
1. UNIX for Dummies Questions & Answers
Hi everyone
I have this script that appends a line to a file to log the running status of an application. I need to write another script to run as a scheduled job in cron to trim the first x number of lines of this file.
Could someone give me an idea how to do this?
Regards (1 Reply)
Discussion started by: alwayslearningunix
1 Replies
2. UNIX for Advanced & Expert Users
Hi
i have question, is this ok in ksh, like
while ]
do
something
read true
done
loop runs but never ends even i pass y/Y.
can any body please resolve it (4 Replies)
Discussion started by: Raom
4 Replies
3. UNIX for Advanced & Expert Users
Hi,
I want to trim +with leading zero's with amount fields.I know using awk for trimming leading zeros with +,but I want get the entire row itself.
cat file_name |awk -F " " '{printf "%14.4f%f\n",$4}'
ex:
10 xyz bc +00000234.4500
20 yzx foxic +002456.000
Expexted
10 xyz bc... (3 Replies)
Discussion started by: mohan705
3 Replies
4. UNIX for Advanced & Expert Users
Hi,
How can I remove the unwanted spaces in the line.
123456 789 ABC DEF. - I wanna remove the sapces in this line, I need the output 123456789ABCDEF.
Pls help me...... (3 Replies)
Discussion started by: sharif
3 Replies
5. Shell Programming and Scripting
hi have output as
i have trim of lines before CREATE statement and lins after last ")"
any idea how to achieve it ? (3 Replies)
Discussion started by: crackthehit007
3 Replies
6. Shell Programming and Scripting
I'm trying to parse an output log and I've managed to reduce the output to the lines I need. But I'm having trouble pulling out only the info I'm interested in. The output is 40+ lines and here is a sample
Installing AppFresh 0.8.5.pkg from ./InstallerFiles/CustomPKG/26 (26)
Installing... (2 Replies)
Discussion started by: kaltekar
2 Replies
7. Shell Programming and Scripting
My file looks like this:
But I would like to 'trim' all sequences to the same lenght 32 characters, keeping intact all the identifier (>GHXCZCC01AJ8CJ)
Would it be possible to use awk to perform this task? (2 Replies)
Discussion started by: Xterra
2 Replies
8. Shell Programming and Scripting
Hi I need to trim white spaces from strings in a file.
Input file is like this:
1_rrc_CatalogGroups.csv = 607
1_rrc_Sales_TopCatalogGroups.csv = 4
1_rrc_Sales_CatalogEntries_CatalogGroup_Rel.csv = 7
Need to trim space before and after = symbol.
This is my script:
#!/usr/bin/ksh
... (2 Replies)
Discussion started by: sukhdip
2 Replies
9. Shell Programming and Scripting
Hi i have a log file
P12345_15728710:DEBUG:Begin
P12345_15728710:DEBUG:Being
P12345_15729310:DEBUG:GetAgen
P12345_15726510:DEBUG:end
i want to trim this file and i want like this
15728710
15728710
15729310
15726510
i tried sed ..but not working..
sed "s/.*P12345__ \(.*\)... (4 Replies)
Discussion started by: navsan420
4 Replies
10. Shell Programming and Scripting
I am new to bash/shell scripting.
I want to find all the files in directory and subdirectories, which are not ends with “.zip” and which are contains in the file name “*.log*” or “*.out*”.
I know below command to get the files which ends with “.log”; but I need which are not ends with this... (4 Replies)
Discussion started by: Mallikgm
4 Replies
LEARN ABOUT LINUX
tzselect
TZSELECT(8) Linux System Administration TZSELECT(8)
NAME
tzselect - select a timezone
SYNOPSIS
tzselect
DESCRIPTION
The tzselect program asks the user for information about the current location, and outputs the resulting timezone description to standard
output. The output is suitable as a value for the TZ environment variable.
All interaction with the user is done via standard input and standard error.
EXIT STATUS
The exit status is zero if a timezone was successfully obtained from the user, nonzero otherwise.
ENVIRONMENT
AWK Name of a Posix-compliant awk program (default: awk).
TZDIR Name of the directory containing timezone data files (default: /usr/share/zoneinfo).
FILES
TZDIR/iso3166.tab
Table of ISO 3166 2-letter country codes and country names.
TZDIR/zone.tab
Table of country codes, latitude and longitude, TZ values, and descriptive comments.
TZDIR/TZ
Time zone data file for timezone TZ.
SEE ALSO
tzfile(5), zdump(8), zic(8)
COLOPHON
This page is part of release 3.55 of the Linux man-pages project. A description of the project, and information about reporting bugs, can
be found at http://www.kernel.org/doc/man-pages/.
2007-05-18 TZSELECT(8)