09-22-2018
Thank you Rudic,
The code is almost perfect but returns the input fasta file as header before the expected output.
I don't want the input fasta file added to the output.
Thanks
------ Post updated 09-22-18 at 04:21 AM ------
Thank you Rudic
Fixed it
10 More Discussions You Might Find Interesting
1. Shell Programming and Scripting
--------------------------------------------------------------------------------
Have to check in a file that the lines starting with 620 and 705
are ending at same posiotin.
82012345
62023232323
70523949558
62023255454
9999
In the above lines, i have to check the lines starting... (1 Reply)
Discussion started by: senthil_is
1 Replies
2. Shell Programming and Scripting
I Have to check in a file that all the lines are ending at same posiotin.
Ex : line 1 is ending at position 88
line 2 should at same position i.e 88
Thanks in advance (6 Replies)
Discussion started by: evvander
6 Replies
3. Shell Programming and Scripting
I have file format like below and I'm trying to modify this file.
I need to add 'ENDEND' end of each record.
01 ASH01 1CTCTL EDPPOO STAND
01 ASH08 0020 A1TH 101
01 ASH09 0022 A1TH 102
01 ASH09 0022 A1TH 103
01 ASH02 2CTCTL ... (5 Replies)
Discussion started by: naveenkcl
5 Replies
4. UNIX for Dummies Questions & Answers
Hi,
I am a newbie in unix programming so maybe this is a simple question.
I would like to know how can I make a script that outputs only the values that are not between any given start and end positions
Example
file1:
2 30
40 80
82 100
file2:
ID1 1
ID2 35
ID3 80
ID4 81
ID6... (9 Replies)
Discussion started by: fadista
9 Replies
5. Shell Programming and Scripting
Hi,
Having a following file's content, lets say:
ABC|ANA|LDJ|||||DKD||||||
AJJ|KKDD||KKDK||||||||||||
KKD||KD|||LLLD||||LLD|||||
Problem:
Need to replace pipes from 8th occurrence of pipe till end.
so the result should be:
ABC|ANA|LDJ|||||DKD
AJJ|KKDD||KKDK||||
-------
-------
... (12 Replies)
Discussion started by: _Noprofi
12 Replies
6. Shell Programming and Scripting
I want to remove the trailing spaces at the end of each line starting from a particular position(using ksh script). For example, in the attached file, I want to remove all the spaces starting from the position 430 till the end. The space has to be removed only from the 430th position no matter in... (3 Replies)
Discussion started by: Suryaaravindh
3 Replies
7. Shell Programming and Scripting
I have a need to calculate when British Summer Time starts and ends. After messing around, the following seems to work in Bash.
echo `date +%Y`-03-`cal 3 \`date +%Y\` | grep -oE "^]{2}" | tail
-1`T01:00:00Zand
echo `date +%Y`-03-`cal 10 \`date +%Y\` | grep -oE "^]{2}" | tail ... (10 Replies)
Discussion started by: esb4me
10 Replies
8. UNIX for Dummies Questions & Answers
Hi, I have a file1 of many long sequences, each preceded by a unique header line. file2 is 3-columns list: headers name, start position, end position. I'd like to extract the sequence region of file1 specified in file2.
Based on a post elsewhere, I found the code:
awk... (2 Replies)
Discussion started by: pathunkathunk
2 Replies
9. Shell Programming and Scripting
Hi,
I am trying to remove lines once a string is found till another string is found including the start string and end string. I want to basically grab all the lines starting with color (closing bracket). PS: The line after the closing bracket for color could be anything (currently 'more').... (1 Reply)
Discussion started by: Dabheeruz
1 Replies
10. UNIX for Beginners Questions & Answers
Below are my custom period start and end dates based on a calender, these dates are placed in a file, for each period i need to split into three weeks for each period row, example is given below.
Could you please help out to achieve solution through shell script..
File content:
... (2 Replies)
Discussion started by: nani2019
2 Replies
LEARN ABOUT DEBIAN
bp_mask_by_search
BP_MASK_BY_SEARCH(1p) User Contributed Perl Documentation BP_MASK_BY_SEARCH(1p)
NAME
mask_by_search - mask sequence(s) based on its alignment results
SYNOPSIS
mask_by_search.pl -f blast genomefile blastfile.bls > maskedgenome.fa
DESCRIPTION
Mask sequence based on significant alignments of another sequence. You need to provide the report file and the entire sequence data which
you want to mask. By default this will assume you have done a TBLASTN (or TFASTY) and try and mask the hit sequence assuming you've
provided the sequence file for the hit database. If you would like to do the reverse and mask the query sequence specify the -t/--type
query flag.
This is going to read in the whole sequence file into memory so for large genomes this may fall over. I'm using DB_File to prevent keeping
everything in memory, one solution is to split the genome into pieces (BEFORE you run the DB search though, you want to use the exact file
you BLASTed with as input to this program).
Below the double dash (--) options are of the form --format=fasta or --format fasta or you can just say -f fasta
By -f/--format I mean either are acceptable options. The =s or =n or =c specify these arguments expect a 'string'
Options:
-f/--format=s Search report format (fasta,blast,axt,hmmer,etc)
-sf/--sformat=s Sequence format (fasta,genbank,embl,swissprot)
--hardmask (booelean) Hard mask the sequence
with the maskchar [default is lowercase mask]
--maskchar=c Character to mask with [default is N], change
to 'X' for protein sequences
-e/--evalue=n Evalue cutoff for HSPs and Hits, only
mask sequence if alignment has specified evalue
or better
-o/--out/
--outfile=file Output file to save the masked sequence to.
-t/--type=s Alignment seq type you want to mask, the
'hit' or the 'query' sequence. [default is 'hit']
--minlen=n Minimum length of an HSP for it to be used
in masking [default 0]
-h/--help See this help information
AUTHOR - Jason Stajich
Jason Stajich, jason-at-bioperl-dot-org.
perl v5.14.2 2012-03-02 BP_MASK_BY_SEARCH(1p)