The perl parser below works as expected assuming the last digit in the NC_ before the . is a single digit.
out_position.txt
contents of out1.txt --- output is correct
However, I can not seem to adjust it to account for the last digit in NC_ before the . in bold, may not always be 1 digit as in the case above, it could be 2 digits, as n the case below. In this case I would need to parse out 4 zeros, instead of 5. So my question is I am not sure how to make the condition in italics in the perl command adjust based on the NC_ being 1 or 2 digits? Thank you .
So in this case the desired output would be:
It is also possible for the NC_ to be a letter, not a digit, but in that case it is always one letter, NC_00000X.11:g.41747805_41747806delinsTT
to this:
Last edited by cmccabe; 03-07-2017 at 02:14 PM..
Reason: fixed format
...assuming the last digit in the NC_ before the . is a single digit.
...
However, .... the last digit in NC_ before the . in bold, may not always be 1 digit as in the case above, it could be 2 digits, as n the case below. In this case I would need to parse out 4 zeros, instead of 5.
...
...
It is also possible for the NC_ to be a letter, not a digit, but in that case it is always one letter, ...
So the string is one of the following:
(1) NC_ + five zeros + 1 digit + "." character => you want that one digit before before "." character
(2) NC_ + four zeros + 2 digits + "." character => you want those two digits before "." character
(3) NC_ + five zeros + 1 character + "." character => you want that one character before "." character
One way to look at it is:
NC_ + a sequence of more than one zeros + sequence of characters that are not zero + "." character
And you want to capture that sequence of non-zero characters before the "." character.
Here's a sample regex that does that:
This User Gave Thanks to durden_tyler For This Post:
Like I have below string
XX_49154534_491553_201_122023_D
XX_49159042_491738_201_103901_D
and the expected output would be
0154534
0159042
XX and 49 can be dynamic. (1 Reply)
The below perl code imports the data in the attached document. However, I can not seem to update the perl code to include a parser like in the desired tab of that document. Thank you :).
Most of the data for the parse is included in the document except for the gene and RNA which can is... (0 Replies)
Dear Perl Experts,
Could some body help me to find the solution for my problem below:
Input file:
-----------
THE-0 tsjp
THE-32 tsjp
THE-64 tsjp
Output desired:
---------------
THE-0&&-31 tsjp
THE-32&&-63 tsjp
THE-64&&-95 tsjp
Note:
31 = 0+31, (2 Replies)
Q: Where to get a 64 bit Expat.so?
I run a perl script and got this error:
Can't load '/usr/perl5/vendor_perl/5.8.4/i86pc-solaris-64int/auto/XML/Parser/Expat/Expat.so' for module XML:parser::Expat: ld.so.1:myPerl: fatal:... (0 Replies)
Hello,
What's the best way to split a large into multiple files based on the last digit in the first column.
input file:
f
2738483300000x0y03772748378831x1y13478378358383x2y23743878383802x3y33787828282820x4y43748838383881x5y5
Desired Output:
f0
3738483300000x0y03787828282820x4y4
f1... (9 Replies)
Hello.
Can anybody help me with some sub on perl that can parse config like this:
%CFG (
'databases' => {
'db1' => 'db_11', 'db_12', 'db_13',
'db2' => 'db_21', 'db_22', 'db_23'
}
'datafiles' => {
'datadir1' => 'datadir_11', 'datadir_12',
'datadir2' =>... (4 Replies)
Hello
I want to write an xml- parser with perl an i use the libary XML::LibXML.
I have a problem with the command getElementsByTagName.
If there is an empty tag, the getElementsByTagName method returns a NodeList of length zero.
how can i check if this is a nodelist of lenght zero??
i... (1 Reply)
I am very new to XML. Really I have an excel file that I am trying to read w/ Perl on a Linux machine. I don't have a mod for reading excel files so I have to convert the excel file to xml to be able to read it. I can read the file and everything is ok except...the Number style is being dropped... (0 Replies)
hi all i want to read xml file in perl i am using XML::Simple for this. i am not getting how to read following file
removing xml file due to some reason (1 Reply)