Here is a possible solution. The script SS_WebPageToCSV ( http://www.biterscripting.com/SS_WebPageToCSV.html ) exactly does what you need. It takes a URL and a table number, and extracts the data in that table into a CSV. The output by default is written to screen. But, you can redirect the CSV data to a CSV file. Here are couple of example commands.
Or,
First command will show the output on screen. Second command will create the CSV file "Output.CSV" (in current directory) with the data from the table.
The number of the table you want to extract (an HTML document may have more than one table), is supplied thru the number() argument to the script. The URL is supplied thru the page() argument. It can extract tables from many document types - .html, .php, .asp, etc.
I have a html file called myfile. If I simply put "cat myfile.html" in UNIX, it shows all the html tags like <a href=r/26><img src="http://www>. But I want to extract only text part.
Same problem happens in "type" command in MS-DOS.
I know you can do it by opening it in Internet Explorer,... (4 Replies)
Hi everyone:
I want to extract string which is in between certain html tag.
e.g.
I tried with grep,cut, awk but could not find exact syntax for this one. :wall:
PS>Sorry about bad english. (8 Replies)
Hi,
I have the following code in my xml file:
<aaaRule loginIdPattern=".*"
orgIdPattern=".*" deny="false" />
<aaaRuleGroup name="dpaas">
<aaaRule loginIdPattern=".*" orgIdPattern=".*"
deny="false" />
I want to retrieve orgIdPattern and loginIdPattern parameter value based on... (2 Replies)
I have an XML tag like this:
<property name="agent" value="/var/tmp/root/eclipse" />
Is there way using awk that i can get the value from the above tag. So the output should be:
/var/tmp/root/eclipse
Help will be appreciated.
Regards,
Adi (6 Replies)
I have a xml file in where I need to parse only a particular tag and print the output in the shell script.
Here is the tag info in the xml file
<dp:file> This is dp file output </dp:file>
Output should be printed as
This is dp file output.
Please help.Thank you. (5 Replies)
Hi
I am new to string extractions in shell script... I am trying to extract a string such as #1753 from html tag looks like below.
<a class="model-link tl-tr" href="lastSuccessfulBuild/">Last successful build (#1753), 40 min ago</a>
and want the value as
1753
Could someone help me to... (3 Replies)
I want to print from <fruits> to </fruits> tag which have <fruit> as mango. Also i want both <fruits> and </fruits> in output. Please help
eg.
<fruits>
<fruit id="111">mango<fruit>
.
another 20 lines
.
</fruits> (3 Replies)
Hi Guys
Here is my Input :
<?xml version="1.0" encoding="UTF-8"?>
<xn:MeContext id="01736">
<xn:VsDataContainer id="01736">
<xn:attributes>
<xn:vsDataType>vsDataMeContext</xn:vsDataType>
... (12 Replies)
Hello,
I want to parse the contents of a multiline html tag
ex:
<html>
<body>
<p>some other text</p>
<div>
<p class="margin-bottom-0">
text1
<br>
text2
<br>
<br>
text3
</p>
</div>
</body> (15 Replies)
Discussion started by: SorcRR
15 Replies
LEARN ABOUT DEBIAN
anydata::format::csv
AnyData::Format::CSV(3pm) User Contributed Perl Documentation AnyData::Format::CSV(3pm)NAME
AnyData::Format::CSV - tiedhash & DBI/SQL access to CSV data
SYNOPSIS
use AnyData;
my $table = adTable( 'CSV', $filename,'r',$flags );
while (my $row = each %$table) {
print $row->{name},"
" if $row->{country} =~ /us|mx|ca/;
}
# ... other tied hash operations
OR
use DBI
my $dbh = DBI->connect('dbi:AnyData:');
$dbh->func('table1','CSV', $filename,$flags,'ad_catalog');
my $hits = $dbh->selectall_arrayref( qq{
SELECT name FROM table1 WHERE country = 'us'
});
# ... other DBI/SQL operations
DESCRIPTION
This is a plug-in format parser for the AnyData and DBD::AnyData modules. It will read column names from the first row of the file, or
accept names passed by the user. In addition to column names, the user may set other options as follows:
col_names : a comma separated list of column names
eol : the end of record mark,
by default
quote_char : the character used to quote fields " by default
escape_char : the character used to escape the quote char, " by default
If you are using this with DBD::AnyData, put ad_ in front of the flags, e.g. ad_eol.
Please refer to the documentation for AnyData.pm and DBD::AnyData.pm for further details.
AUTHOR & COPYRIGHT
copyright 2000, Jeff Zucker <jeff@vpservices.com> all rights reserved
perl v5.10.1 2004-08-17 AnyData::Format::CSV(3pm)