Extract URL from RSS Feed in AWK Post: 302449051

6 More Discussions You Might Find Interesting

1. What is on Your Mind?

Post Your Favorite UNIX/Linux Related RSS Feed Links

Hello, I am planning to revise the RSS News subforum areas, here: News, Links, Events and Announcements - The UNIX Forums ... maybe with a subforum for each OS specific news, like HP-UX, Solaris, RedHat, OSX, etc. RSS subforums.... Please post your favorite OS specific RSS (RSS2) link...

2. Shell Programming and Scripting

replace last form feed with line feed

Hi I have a file with lots of line feeds and form feeds (page break). Need to replace last occurrence of form feed (created by - echo "\f" ) in the file with line feed. Please advise how can i achieve this. TIA Prvn

3. Shell Programming and Scripting

SED extract url - please help a lamer

Hello everybody. I have lines that looks something like this: <done16=""118"" done18=""$ title=""thisisatitle"" href=""/JoeBanana" alt=""Joe""><done16=""118"" done18=""$ title=""thisisatitle"" href=""/GeraldGiraffe" alt=""Gerald""> What kind of SED command would I need to use to extract...

4. Shell Programming and Scripting

How to extract url from html page?

for example, I have an html file, contain <a href="http://awebsite" id="awebsite" class="first">website</a>and sometime a line contains more then one link, for example <a href="http://awebsite" id="awebsite" class="first">website</a><a href="http://bwebsite" id="bwebsite"...

5. UNIX for Dummies Questions & Answers

Awk: print all URL addresses between iframe tags without repeating an already printed URL

Here is what I have so far: find . -name "*php*" -or -name "*htm*" | xargs grep -i iframe | awk -F'"' '/<iframe*/{gsub(/.\*iframe>/,"\"");print $2}' Here is an example content of a PHP or HTM(HTML) file: <iframe src="http://ADDRESS_1/?click=5BBB08\" width=1 height=1...

6. Shell Programming and Scripting

How to use GREP to extract URL from file

Hi All , Here is what I want to do: Given a line: 98.70.217.222 - - "GET /liveupdate-aka.symantec.com/1340071490jtun_nav2k8enn09m25.m25?h=abcdefgh HTTP/1.1" 200 159229484 "-" "hBU1OhDsPXknMepDBJNScBj4BQcmUz5TwAAAAA" "-" 1. Get the URL component: ...

LEARN ABOUT DEBIAN

xml::rss::headline::useperljournals

XML::RSS::Headline::UsePerlJournals(3pm)		User Contributed Perl Documentation		  XML::RSS::Headline::UsePerlJournals(3pm)

NAME

       XML::RSS::Headline::UsePerlJournals - XML::RSS::Headline Example Subclass

VERSION

       2.2

SYNOPSIS

       You can also subclass XML::RSS::Headline to tweak the rss content to your liking.  In this example. I change the headline to remove the
       date/time and add the Use Perl Journal author's ID.  Also in this use Perl; rss feed you get the actual link to the journal entry, rather
       than the link just to the user's journal.  (meaning that the journal URLs contain the entry's ID)

	   use XML::RSS::Feed;
	   use XML::RSS::Headline::UsePerlJournals;
	   use LWP::Simple qw(get);

	   my $feed = XML::RSS::Feed->new(
	       name  => "useperljournals",
	       hlobj => "XML::RSS::Headline::UsePerlJournals",
	       delay => 60,
	       url   => "http://use.perl.org/search.pl?tid=&query=&"
			. "author=&op=journals&content_type=rss",
	   );

	   while(1) {
	       $feed->parse(get($feed->url));
	       print $_->headline . "
" for $feed->late_breaking_news;
	       sleep($feed->delay);
	   }

       Here is the output from rssbot on irc.perl.org in channel #news (which uses these modules)

	   <rssbot>  + [pudge] New Cool Journal RSS Feeds at use Perl;
	   <rssbot>    http://use.perl.org/~pudge/journal/21884

MUTAITED METHOD

       $headline->item( $item )

       Init the object for a parsed RSS item returned by XML::RSS.

AUTHOR

       Jeff Bisbee, "<jbisbee at cpan.org>"

BUGS

       Please report any bugs or feature requests to "bug-xml-rss-feed at rt.cpan.org", or through the web interface at
       <http://rt.cpan.org/NoAuth/ReportBug.html?Queue=XML-RSS-Feed>.  I will be notified, and then you'll automatically be notified of progress
       on your bug as I make changes.

SUPPORT

       You can find documentation for this module with the perldoc command.

	   perldoc XML::RSS::Headline::UsePerlJournals

       You can also look for information at:

       * AnnoCPAN: Annotated CPAN documentation
	   <http://annocpan.org/dist/XML-RSS-Feed>

       * CPAN Ratings
	   <http://cpanratings.perl.org/d/XML-RSS-Feed>

       * RT: CPAN's request tracker
	   <http://rt.cpan.org/NoAuth/Bugs.html?Dist=XML-RSS-Feed>

       * Search CPAN
	   <http://search.cpan.org/dist/XML-RSS-Feed>

ACKNOWLEDGEMENTS

       Special thanks to Rocco Caputo, Martijn van Beers, Sean Burke, Prakash Kailasa and Randal Schwartz for their help, guidance, patience, and
       bug reports. Guys thanks for actually taking time to use the code and give good, honest feedback.

COPYRIGHT &; LICENSE
       Copyright 2006 Jeff Bisbee, all rights reserved.

       This program is free software; you can redistribute it and/or modify it under the same terms as Perl itself.

SEE ALSO

       XML::RSS::Feed, XML::RSS::Headline, XML::RSS::Headline::PerlJobs, XML::RSS::Headline::Fark, POE::Component::RSSAggregator

perl v5.8.8							    2006-07-17				  XML::RSS::Headline::UsePerlJournals(3pm)