debian man page for html::treebuilder::libxml

Query: html::treebuilder::libxml

OS: debian

Section: 3pm

Format: Original Unix Latex Style Formatted with HTML and a Horizontal Scroll Bar

HTML::TreeBuilder::LibXML(3pm)				User Contributed Perl Documentation			    HTML::TreeBuilder::LibXML(3pm)

NAME
HTML::TreeBuilder::LibXML - HTML::TreeBuilder and XPath compatible interface with libxml
SYNOPSIS
use HTML::TreeBuilder::LibXML; my $tree = HTML::TreeBuilder::LibXML->new; $tree->parse($html); $tree->eof; # $tree and $node compatible to HTML::Element my @nodes = $tree->findvalue($xpath); for my $node (@nodes) { print $node->tag; my %attr = $node->all_external_attr; } HTML::TreeBuilder::LibXML->replace_original(); # replace HTML::TreeBuilder::XPath->new
DESCRIPTION
HTML::TreeBuilder::XPath is libxml based compatible interface to HTML::TreeBuilder, which could be slow for a large document. HTML::TreeBuilder::LibXML is drop-in-replacement for HTML::TreeBuilder::XPath. This module doesn't implement all of HTML::TreeBuilder and HTML::Element APIs, but enough methods are defined so modules like Web::Scraper work.
BENCHMARK
This is a benchmark result by tools/benchmark.pl Web::Scraper: 0.26 HTML::TreeBuilder::XPath: 0.09 HTML::TreeBuilder::LibXML: 0.01_01 Rate no_libxml use_libxml no_libxml 5.45/s -- -94% use_libxml 94.3/s 1632% --
AUTHOR
Tokuhiro Matsuno <tokuhirom slkjfd gmail.com> Tatsuhiko Miyagawa <miyagawa@cpan.org> Masahiro Chiba
THANKS TO
woremacx++ http://d.hatena.ne.jp/woremacx/20080202/1201927162 id:dailyflower
SEE ALSO
HTML::TreeBuilder, HTML::TreeBuilder::XPath
LICENSE
This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself. perl v5.14.2 2012-04-02 HTML::TreeBuilder::LibXML(3pm)
Related Man Pages
html::parse(3) - mojave
html::parse(3) - osx
html::treebuilder::libxml(3pm) - debian
html::treebuilder::xpath(3pm) - debian
xml::element(3pm) - debian
Similar Topics in the Unix Linux Community
HTML question
Footers and iFrames in HTML
Geek Tool Shell Commands in HTML
Remove all HTML, scripts and styles?
Writing HTML with variables from an array or other means