Sponsored Content
Full Discussion: Help Parsing Sequence File
Top Forums UNIX for Dummies Questions & Answers Help Parsing Sequence File Post 302436858 by Fahmida on Tuesday 13th of July 2010 10:10:43 AM
Old 07-13-2010
Many Thanks for your reply. However, for input parameters "len=100 limit=20 testseq.txt", I get the following output:
HTML Code:
>1630 length 144 cvg_0.0_tip_0
TGTCTTCTTAAATCTTTATTAGCACgAATCCTGTGACGTAA
>1655 length 128 cvg_0.0_tip_0
aTGTGCTGGgTGGGTGCCTGtGTCTGTG
>1662 length 142 cvg_0.0_tip_0
cttcttaaatctttattctcTAGCAATATCCTGTGACGTAC
Length of the extracted sequences are 41, 28, and 41 respectively, which should be 40 for all of them. The expected output, considering 20 characters each taken from the start and end of each sequence is:

HTML Code:
>1630 length 144 cvg_0.0_tip_0
TGTCTTCTTAAATCTTTATTGCACgAATCCTGTGACGTAA
>1655 length 128 cvg_0.0_tip_0
aTGTGCTGGgTGGGTGCCTGtGGGAAGTGAGCtGTCTGTG
>1662 length 142 cvg_0.0_tip_0
cttcttaaatctttattctcAGCAATATCCTGTGACGTAC
All of them are 040 characters long. Thanks.
 

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Adding a sequence string to a file

I have a pipe delimited file I need to add a sequence number to in the third field. The record fields will be variable length, so I have to parse for the second pipe. Another requirement is that the sequence number must be unique to all records in the file and subsequent files created, so the... (5 Replies)
Discussion started by: MrPeabody
5 Replies

2. Shell Programming and Scripting

Parsing of file for Report Generation (String parsing and splitting)

Hey guys, I have this file generated by me... i want to create some HTML output from it. The problem is that i am really confused about how do I go about reading the file. The file is in the following format: TID1 Name1 ATime=xx AResult=yyy AExpected=yyy BTime=xx BResult=yyy... (8 Replies)
Discussion started by: umar.shaikh
8 Replies

3. Shell Programming and Scripting

Renaming a file use another file as a sequence calling a shl

have this shl that will FTP a file from the a directory in windows to UNIX, It get the name of the file stored in this variable $UpLoadFileName then put in the local directory LocalDir="${MPATH}/xxxxx/dat_files" that part seems to be working, but then I need to take that file and rename, I am using... (3 Replies)
Discussion started by: rechever
3 Replies

4. Shell Programming and Scripting

Adding sequence to the file

How do I add the sequence number to the file? I have a file seperated by commas. appusage,243,jsdgh,798 appusage,876,0989,900 . . appusage,82374,ajfgdh,9284 The output would be as below 1,appusage,243,jsdgh,798 2,appusage,876,0989,900 . . 100,appusage,876,0989,900 (5 Replies)
Discussion started by: smee
5 Replies

5. Shell Programming and Scripting

Parsing a fasta sequence with start and end coordinates

Hi.. I have a seperate chromosome sequences and i wanted to parse some regions of chromosome based on start site and end site.. how can i achieve this? For Example Chr 1 is in following format I need regions from 2 - 10 should give me AATTCCAAA and in a similar way 15- 25 should give... (8 Replies)
Discussion started by: empyrean
8 Replies

6. Shell Programming and Scripting

find common entries and match the number with long sequence and cut that sequence in output

Hi all, I have a file like this ID 3BP5L_HUMAN Reviewed; 393 AA. AC Q7L8J4; Q96FI5; Q9BQH8; Q9C0E3; DT 05-FEB-2008, integrated into UniProtKB/Swiss-Prot. DT 05-JUL-2004, sequence version 1. DT 05-SEP-2012, entry version 71. FT COILED 59 140 ... (1 Reply)
Discussion started by: manigrover
1 Replies

7. Shell Programming and Scripting

Get string of sequence from other file

Hi guys, Does anyone know how to get a string of sequence from other file? Should I use awk? Please see below. Thanks! LIST_FILE: >NAME1 >NAME3 >NAME5 >NAME7 >NAME8 SEQ_FILE: >NAME1 LEN75 100100101001010001010 >NAME2 LEN90 111010101010101101101 >NAME3 LEN27 101000101001010010101... (5 Replies)
Discussion started by: narachaid
5 Replies

8. Shell Programming and Scripting

Identifying Missing File Sequence

Hi, I have a file which contains few columns and the first column has the file names, and I would like to identify the missing file sequence number form the file and would copy to another file. My files has data in below format. APKRISPSIN320131231201319_0983,1,54,125,... (5 Replies)
Discussion started by: rramkrishnas
5 Replies

9. Shell Programming and Scripting

To search duplicate sequence in file

Hi, I want to search only duplicate sequence number in file e.g 4757610 4757610 should display only duplicate sequence number in file. file contain is: 4757610 6zE:EXPNL ORDER_PRIORITY='30600022004757610' ORDER_IDENTIFIER='4257771056' MM_ASK_VOLUME='273' MM_ASK_PRICE='1033.0000' m='GBX'... (5 Replies)
Discussion started by: ashfaque
5 Replies

10. Shell Programming and Scripting

Inserting IDs from a text file into a sequence alignment file

Hi, I have one file with one column and several hundred entries File1: NA1 NA2 NA3And now I need to run a command within a mapping aligner tool to insert these sample names into a sequence alignment file (SAM) such that they look like this @RG ID:Library1 SM:NA1 PL:Illumina ... (7 Replies)
Discussion started by: nans
7 Replies
pdbtxt2html(1)						      General Commands Manual						    pdbtxt2html(1)

NAME
pdbtxt2html - Doc Text to HTML converter for Palm Pilots SYNOPSIS
pdbtxt2html [ -t ] file.txt [ file.html ] pdbtxt2html -v DESCRIPTION
pdbtxt2html converts text converted from a Doc(4) file via txt2pdbdoc(1) to HTML. If no HTML filename is given, the generated HTML is sent to standard output. Document Title The first line of the file is used for the HTML document title. Bookmarks The last line of the file is examined and, if it contains a string enclosed between < and >, that is taken to be the bookmark marker. The entire file is then scanned looking for lines beginning with it (ignoring leading whitespace). These lines are converted to HTML headings. The number of whitespace characters after the first bookmark marker is used for heading level 1. The level of subsequent headings is set to the number of whitespace characters between the bookmark marker and the bookmark text minus the number for the first bookmark plus one. Embedded URLs Valid URLs (according to RFC 1630) embedded in the text are turned into hyperlinks. The ftp, gopher, http, https, mailto, news, telnet, and wais URLs are recognized. OPTIONS
-t Compile a table of contents and insert it between the first heading and the body. -v Print the version number to standard output and exit. EXAMPLE
To convert a Doc file to HTML: txt2pdbdoc alice.pdb alice.txt pdbtxt2html alice.txt alice.html SEE ALSO
html2pdbtxt(1), txt2pdbdoc(1), doc(4), pdb(4) Tim Berners Lee. Universal Resource Identifiers in WWW, Network Working Group of the Internet Engineering Task Force, June 1994. http://info.internet.isi.edu/in-notes/rfc/files/rfc1630.txt AUTHOR
Paul J. Lucas <pauljlucas@mac.com> txt2pdbdoc January 21, 2005 pdbtxt2html(1)
All times are GMT -4. The time now is 05:43 AM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy