html::autopagerize(3pm) debian man page | unix.com

Man Page: html::autopagerize

Operating Environment: debian

Section: 3pm

HTML::AutoPagerize(3pm) 				User Contributed Perl Documentation				   HTML::AutoPagerize(3pm)

NAME
HTML::AutoPagerize - Utility to load AutoPagerize SITEINFO stuff
SYNOPSIS
use HTML::AutoPagerize; my $autopager = HTML::AutoPagerize->new; $autopager->add_site( url => 'http://.+.tumblr.com/', nextLink => '//div[@id="content" or @id="container"]/div[last()]/a[last()]', pageElement => '//div[@id="content" or @id="container"]/div[@class!="footer" or @class!="navigation"]', ); my $uri = 'http://otsune.tumblr.com/'; my $html = LWP::Simple::get($uri); my $res = $autopager->handle($uri, $html); if ($res) { my $next_link = $res->{next_link}; # URI object my $content = $res->{page_element}; # XML::XPathEngine::NodeSet object. may be empty }
DESCRIPTION
HTML::AutoPagerize is an utility module to load SITEINFO defined in AutoPagerize. AutoPagerize is an userscript to automatically figure out the "next link" of the current page, then fetch the content and insert the content by extracting the "page element".
AUTHOR
Tatsuhiko Miyagawa <miyagawa@bulknews.net>
LICENSE
This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself.
SEE ALSO
WWW::Mechanize::AutoPager, <http://swdyh.infogami.com/autopagerize> perl v5.12.4 2011-09-26 HTML::AutoPagerize(3pm)
Related Man Pages
html::autopagerize(3pm) - debian
html::mason::psgihandler(3pm) - debian
html::selector::xpath(3pm) - debian
text::worddiff::html(3pm) - debian
web::scraper(3pm) - debian
Similar Topics in the Unix Linux Community
pytumblr 1.0 (Default branch)
HTML Extractor 1.0 (Default branch)
Bricolage 1.10.6 (Stable branch)
Dynamic naming based on file content
Replace Block