extract_url.pl 1.4.1 (Default branch)


 
Thread Tools Search this Thread
Special Forums News, Links, Events and Announcements Software Releases - RSS News extract_url.pl 1.4.1 (Default branch)
# 1  
Old 10-24-2008
extract_url.pl 1.4.1 (Default branch)

extract_url.pl is a Perl script that extracts URLsfrom either correctly-encoded MIME email messagesor from plain text. It can be used either as apre-parser for a URL selector (such as urlview),or as a standalone URL selector. It is veryconfigurable, can handle URLs that have beenbroken over several lines in format=floweddelsp=yes email messages, and eliminates duplicateURLs. It was designed primarily for use with Mutt,but can be used by for any similar purpose. Theidea is that if you want to access a URL in anemail, you pipe the email to a URL extractor (likethis one), which then lets you select a URL toview in some third program (such as Firefox).License: BSD License (original)Changes:
This release has better contextual text handling (it uses word boundaries instead of explicit string lengths), and pulls URLs out of HTML text in addition to HTML tags. This may be somewhat sensitive to formatting issues (unexpected line breaks, etc).Image

Image

More...
Login or Register to Ask a Question

Previous Thread | Next Thread
Login or Register to Ask a Question
URLSCAN(1)						      General Commands Manual							URLSCAN(1)

NAME
urlscan - browse the URLs in an email message from a terminal SYNOPSIS
urlscan [options] < message urlscan [options] message DESCRIPTION
urlscan accepts a single email message on standard input, then displays a terminal-based list of the URLs in the given message. Selecting a URL will invoke sensible-browser(1) on it (and hence any browser specified in the BROWSER environment variable). urlscan is primarily intended to be used with the mutt (1) mailreader, but it should work well with any terminal-based mail program. urlscan is similar to urlview(1), but has the following additional features: 1. Support for more message encodings, such as quoted-printable and base64. 2. Extraction and display of the context surrounding each URL. OPTIONS
-b, --background Run the Web browser in the background, so you can select another URL without closing it (this will not work with terminal-based Web browsers such as lynx, links, or w3m). -c, --compact Display a simple list of the extracted URLs, instead of showing the context of each URL. MUTT INTEGRATION
To integrate urlscan with mutt, include the following two commands in ~/.muttrc: macro index,pager cb "<pipe-message> urlscan<Enter>" "call urlscan to extract URLs out of a message" macro attach,compose cb "<pipe-entry> urlscan<Enter>" "call urlscan to extract URLs out of a message" Once these lines are in your mutt configuration file, pressing Control-b will allow you to browse and open the URLs in the currently selected message. SEE ALSO
/usr/share/doc/urlscan/README, sensible-browser(1), urlview(1), mutt(1) AUTHOR
This manual page was written by Daniel Burrows <dburrows@debian.org>. December 10, 2006 URLSCAN(1)