extract_url.pl 1.3.3 (Default branch)


 
Thread Tools Search this Thread
Special Forums News, Links, Events and Announcements Software Releases - RSS News extract_url.pl 1.3.3 (Default branch)
# 1  
Old 09-23-2008
extract_url.pl 1.3.3 (Default branch)

extract_url.pl is a Perl script that extracts URLsfrom either correctly-encoded MIME email messagesor from plain text. It can be used either as apre-parser for a URL selector (such as urlview),or as a standalone URL selector. It is veryconfigurable, can handle URLs that have beenbroken over several lines in format=floweddelsp=yes email messages, and eliminates duplicateURLs. It was designed primarily for use with Mutt,but can be used by for any similar purpose. Theidea is that if you want to access a URL in anemail, you pipe the email to a URL extractor (likethis one), which then lets you select a URL toview in some third program (such as Firefox).License: BSD License (original)Changes:
Sometimes, multipart/alternative parts don'tactually have analternative, which could confuse the script.They're now handledcorrectly, and MIME parsing has been made morerobust.Image

Image

More...
Login or Register to Ask a Question

Previous Thread | Next Thread
Login or Register to Ask a Question
urlview(1)						      General Commands Manual							urlview(1)

NAME
urlview - URL extractor/launcher SYNOPSIS
urlview filename [ filename ... ] DESCRIPTION
urlview is a screen oriented program for extracting URLs from text files and displaying a menu from which you may launch a command to view a specific item. CONFIGURATION
urlview attempts to read ~/.urlview upon startup. If this file doesn't exist, it will try to read a system wide file in /etc/urlview/sys- tem.urlview. There are two configuration commands (order does not matter): REGEXP regexp urlview uses a regular expression to extract URLs from the specified text files. , , and f are all converted to their nor- mal printf(3) meanings. The default REGEXP is: (((http|https|ftp|gopher)|mailto):(//)?[^ <>" ]*|(www|ftp)[0-9]?.[-a-z0-9.]+)[^ .,; <">):]?[^, <>" ]*[^ .,; <">):] COMMAND command If the specified command contains a %s, it will be subsituted with the URL that was requested, otherwise the URL is appended to the COMMAND string. The default COMMAND is: /etc/urlview/url_handler.sh Note: You should never put single quotes around the %s. urlview does this for you, and also makes sure that single quotes eventually show- ing up inside the URL are handled properly. (Note that this shouldn't happen with the default regular expression, which explicitly excludes single quotes.) WRAP choice Enable or disable URL wrapping. Valid values for choice are: yes, no (case insensitive). If this option is not supplied, the default behaviour is to disable wrapping. FILES
/etc/urlview/system.urlview system-wide urlview configuration file ~/.urlview urlview configuration file ENVIRONMENT
If the environment variable BROWSER is set to a browser command, or a colon-delimited list of commands to try, then the specified browser is used. %s is replaced with the quoted url to view. If %s is not part of a command, the url is appended to the command. The BROWSER environment variable is honored only if the rc-file doesn't contain the COMMAND option. The rc-file provided by the Debian package contains a COMMAND option. SEE ALSO
printf(3), regcomp(3), regex(7), environ(7) AUTHOR
Michael Elkins <me@cs.hmc.edu>. Modified for Debian by Luis Francisco Gonzalez <luisgh@debian.org> and Emanuele Rocca <ema@debian.org>. Modified for SuSE by Dr. Werner Fink <werner@suse.de> and Stepan Kasal <kasal@suse.cz>. Changes put together by Thomas Roessler <roessler@does-not-exist.org>. urlview(1)