In cases where you don't have quoted > characters in tags (and I didn't see any of them in your samples, but didn't do an exhaustive search in your attachment), the following much simpler script might work:
With the sample data you posted in the 1st message in this thread, it produces the output:
I didn't see any problems processing your attached sample either, but due to the length (since this preserves all input lines and just removes tags), I won't post the results here. It would also be easy to get rid of empty lines after removing tags if that is what you want.
This User Gave Thanks to Don Cragun For This Post:
hi all,
i have a html file something similar to this.
<tr class="evenrow">
<td class="data">added</td><td class="data">xyz@abc.com</td>
<td class="data">filename.sql</td><td class="modifications-data">08/25/2009 07:58:40</td><td class="data">Added TK prof script</td>
</tr>
<tr... (1 Reply)
Hi!
I have a bunch of HTML files, which I want to parse to CSV files. Every page has a table in it, and I need to parse each row into a csv record.
With awk and sed, I managed to put every table row in separate lines. So my file looks like this:
<TR> .... </TR>
<TR> .... </TR>
...One... (1 Reply)
hi guys,
i want to parse a file using public function, the file contain raw data in the below format i want to get the output like this to load it to Oracle DB
MARWA1,BSS:26,1,3,0,0,0,0,0.00,22,22,22.00
MARWA2,BSS:26,1,3,0,0,0,0,0.00,22,22,22.00
this the file raw format:
Number of... (6 Replies)
Hello,
I have a html file like this :
<html>
...
...
...
<table>
.......
......
</table>
<table name = "hi">
......
.....
...
</table>
<h1> Welcome </h1>
.......
......
</html> (11 Replies)
Hello,
I want to extract some informations from a html (website, http://www.energiecontracting.de/7-mitglieder/von-A-Z.php?a_z=B&seite=2 ) file and save those in a predefined format (.csv).. However it seems that the code on that website is kinda messy and I can't find a way to handle it... (5 Replies)
Hi all, I have a file that contains a good hundred of these job definitions below:
Job Name Last Start Last End ST Run Pri/Xit
________________________________________________________________ ____________________... (7 Replies)
<DIV><P>Pré-condição aceder ao ecrã Home do MRS.</P></DIV><DIV><P>OK.</P></DIV><DIV><P>Seleccionar Pesquisa de Recepção Directa.</P></DIV><DIV><P>Confirmar que abriu ecrã de Recepção Directa.</P></DIV><DIV> (6 Replies)
I have downloaded source code for 97 files using:
wget -x -i link.txt then run a rename loop:
for file in *
do
mv $file $file.txt
done to keep the html tags but make the file a text that can be parsed.
In each of the 97 txt files the gene # is variable, but the gene is associated... (15 Replies)
I downloaded source code using:
wget -qO- http://fulgentdiagnostics.com/test/clinical-exome/ | cat > flugentsource.txt
Now I am trying to use sed to parse it to confirm a gene count. Basically, output (flugent.txt) all the gene names with a total count after them
I'm not all that... (5 Replies)
Hi,
im trying to read a Temperature value from html code.
So far i have managed to reduce the whole html page down to this single line with the following sed command:sed -n '/Temperature/p' $temp_temperature | tee temp_string
<TD width='350'>Temperature :</td><td>25... (2 Replies)
Discussion started by: naittis
2 Replies
LEARN ABOUT DEBIAN
bio::seqfeature::gene::nc_feature
Bio::SeqFeature::Gene::NC_Feature(3pm) User Contributed Perl Documentation Bio::SeqFeature::Gene::NC_Feature(3pm)NAME
Bio::SeqFeature::Gene::NC_Feature.pm - superclass for non-coding features
SYNOPSIS
Give standard usage here
DESCRIPTION
Describe the object here
FEEDBACK
Mailing Lists
User feedback is an integral part of the evolution of this and other Bioperl modules. Send your comments and suggestions preferably to the
Bioperl mailing list. Your participation is much appreciated.
bioperl-l@bioperl.org - General discussion
http://bioperl.org/wiki/Mailing_lists - About the mailing lists
Support
Please direct usage questions or support issues to the mailing list:
bioperl-l@bioperl.org
rather than to the module maintainer directly. Many experienced and reponsive experts will be able look at the problem and quickly address
it. Please include a thorough description of the problem with code and data examples if at all possible.
Reporting Bugs
Report bugs to the Bioperl bug tracking system to help us keep track of the bugs and their resolution. Bug reports can be submitted via the
web:
https://redmine.open-bio.org/projects/bioperl/
AUTHOR - David Block
Email dblock@gnf.org
APPENDIX
The rest of the documentation details each of the object methods. Internal methods are usually preceded with a _
is_coding
Title : is_coding
Usage : if ($feature->is_coding()) {
#do something
}
Function: Whether or not the feature codes for amino acid.
Returns : FALSE
Args : none
cds
Title : cds
Usage : $cds=$feature->cds();
Function: get the coding sequence of this feature
Returns : undef
Args : none
perl v5.14.2 2012-03-02 Bio::SeqFeature::Gene::NC_Feature(3pm)