Extract URL from RSS Feed in AWK Post: 302449040

6 More Discussions You Might Find Interesting

1. What is on Your Mind?

Post Your Favorite UNIX/Linux Related RSS Feed Links

Hello, I am planning to revise the RSS News subforum areas, here: News, Links, Events and Announcements - The UNIX Forums ... maybe with a subforum for each OS specific news, like HP-UX, Solaris, RedHat, OSX, etc. RSS subforums.... Please post your favorite OS specific RSS (RSS2) link...

2. Shell Programming and Scripting

replace last form feed with line feed

Hi I have a file with lots of line feeds and form feeds (page break). Need to replace last occurrence of form feed (created by - echo "\f" ) in the file with line feed. Please advise how can i achieve this. TIA Prvn

3. Shell Programming and Scripting

SED extract url - please help a lamer

Hello everybody. I have lines that looks something like this: <done16=""118"" done18=""$ title=""thisisatitle"" href=""/JoeBanana" alt=""Joe""><done16=""118"" done18=""$ title=""thisisatitle"" href=""/GeraldGiraffe" alt=""Gerald""> What kind of SED command would I need to use to extract...

4. Shell Programming and Scripting

How to extract url from html page?

for example, I have an html file, contain <a href="http://awebsite" id="awebsite" class="first">website</a>and sometime a line contains more then one link, for example <a href="http://awebsite" id="awebsite" class="first">website</a><a href="http://bwebsite" id="bwebsite"...

5. UNIX for Dummies Questions & Answers

Awk: print all URL addresses between iframe tags without repeating an already printed URL

Here is what I have so far: find . -name "*php*" -or -name "*htm*" | xargs grep -i iframe | awk -F'"' '/<iframe*/{gsub(/.\*iframe>/,"\"");print $2}' Here is an example content of a PHP or HTM(HTML) file: <iframe src="http://ADDRESS_1/?click=5BBB08\" width=1 height=1...

6. Shell Programming and Scripting

How to use GREP to extract URL from file

Hi All , Here is what I want to do: Given a line: 98.70.217.222 - - "GET /liveupdate-aka.symantec.com/1340071490jtun_nav2k8enn09m25.m25?h=abcdefgh HTTP/1.1" 200 159229484 "-" "hBU1OhDsPXknMepDBJNScBj4BQcmUz5TwAAAAA" "-" 1. Get the URL component: ...

LEARN ABOUT DEBIAN

feed::find

Feed::Find(3pm) 					User Contributed Perl Documentation					   Feed::Find(3pm)

NAME

       Feed::Find - Syndication feed auto-discovery

SYNOPSIS

	   use Feed::Find;
	   my @feeds = Feed::Find->find('http://example.com/');

DESCRIPTION

       Feed::Find implements feed auto-discovery for finding syndication feeds, given a URI. It (currently) passes all of the auto-discovery tests
       at http://diveintomark.org/tests/client/autodiscovery/.

       Feed::Find will discover the following feed formats:

       o   RSS 0.91

       o   RSS 1.0

       o   RSS 2.0

       o   Atom

USAGE

   Feed::Find->find($uri)
       Given a URI $uri, use a variety of techniques to find the feeds associated with that page. If $uri itself points to a feed (i.e., if the
       Content-Type of the response is a recognized feed type), returns $uri.

       Returns a list of feed URIs.

       The following techniques are used:

       1. <link> tag auto-discovery
	   If the page contains any <link> tags in the <head> section, these tags are examined for recognized feed content types. The following
	   content types are treated as feeds: application/x.atom+xml, application/atom+xml, application/xml, text/xml, application/rss+xml, and
	   application/rdf+xml.

       2. Scanning <a> tags
	   If the page does not contain any known <link> tags, the page is then scanned for <a> tags for links to URIs with certain file
	   extensions. The following extensions are treated as feeds: .rss, .xml, and .rdf.

	   Note that this technique is employed only if the first technique returns no results.

   Feed::Find->find_in_html($html [, $base_uri ])
       Given a reference to a string $html containing an HTML page, uses the same techniques as described above in find to find the feeds
       associated with that page.

       If you know the URI of the page, you should provide it in $base_uri, so that relative links can be properly made absolute. Feed::Find will
       attempt to determine the correct base URI, but unless that URI is specified in the HTML itself (in a "<meta>" tag), you'll need to supply
       it yourself.

       Returns a list of feed URIs.

LICENSE

       Feed::Find is free software; you may redistribute it and/or modify it under the same terms as Perl itself.

AUTHOR &; COPYRIGHT
       Except where otherwise noted, Feed::Find is Copyright 2004 Benjamin Trott, ben+cpan@stupidfool.org. All rights reserved.

perl v5.10.1							    2011-01-28							   Feed::Find(3pm)