extract_url.pl 1.4.1 (Default branch)


 
Thread Tools Search this Thread
Special Forums News, Links, Events and Announcements Software Releases - RSS News extract_url.pl 1.4.1 (Default branch)
# 1  
Old 10-24-2008
extract_url.pl 1.4.1 (Default branch)

extract_url.pl is a Perl script that extracts URLsfrom either correctly-encoded MIME email messagesor from plain text. It can be used either as apre-parser for a URL selector (such as urlview),or as a standalone URL selector. It is veryconfigurable, can handle URLs that have beenbroken over several lines in format=floweddelsp=yes email messages, and eliminates duplicateURLs. It was designed primarily for use with Mutt,but can be used by for any similar purpose. Theidea is that if you want to access a URL in anemail, you pipe the email to a URL extractor (likethis one), which then lets you select a URL toview in some third program (such as Firefox).License: BSD License (original)Changes:
This release has better contextual text handling (it uses word boundaries instead of explicit string lengths), and pulls URLs out of HTML text in addition to HTML tags. This may be somewhat sensitive to formatting issues (unexpected line breaks, etc).Image

Image

More...
Login or Register to Ask a Question

Previous Thread | Next Thread
Login or Register to Ask a Question
curl_unescape(3)						  libcurl Manual						  curl_unescape(3)

NAME
curl_unescape - URL decodes the given string SYNOPSIS
#include <curl/curl.h> char *curl_unescape( char *url, int length ); DESCRIPTION
Obsolete function. Use curl_easy_unescape(3) instead! This function will convert the given URL encoded input string to a "plain string" and return that as a new allocated string. All input characters that are URL encoded (%XX where XX is a two-digit hexadecimal number) will be converted to their plain text versions. If the 'length' argument is set to 0, curl_unescape() will use strlen() on the input 'url' string to find out the size. You must curl_free() the returned string when you're done with it. AVAILABILITY
Since 7.15.4, curl_easy_unescape(3) should be used. This function will be removed in a future release. RETURN VALUE
A pointer to a zero terminated string or NULL if it failed. SEE ALSO
curl_easy_escape(3), curl_easy_unescape(3), curl_free(3), RFC 2396 libcurl 7.7 22 March 2001 curl_unescape(3)