Sponsored Content
Top Forums Shell Programming and Scripting Pattern matching extracting urls from rss, shell scripts Post 302312107 by BremboloIV on Thursday 30th of April 2009 11:24:47 AM
Old 04-30-2009
Pattern matching extracting urls from rss, shell scripts

Hi all, how could i do ?

I have a Rss file, i want to extract only the Urls (many) matching http://www.xxx.com/trailers/ from that file and copy into another file.


like

"
<pubDate>Wed, 29 Apr 2009 00:00:00 PST</pubDate>
<content:encoded><![CDATA[<table><tr valign="top"><td width="67"><a href="Apple - Movie Trailers - The Hangover"><img src="http://images.apple.com/moviesxml/s/wb/posters/thehangover_m200904271417.jpg" width="65" height="97" border="0"></a></td><td> &nbsp; </td><td><a href="Apple - Movie Trailers - The Hangover/"><span style="font-size: 16px; font-weight: 900; text-decoration: underline;">The Hangover - Trailer 2</span></a><br /><span style="font-size: 12px;">Two days before his wedding, Doug and his three friends drive to Las Vegas for a blow-out bachelor party they&#x2019;ll never forget. But, in fact, when the three groomsmen wake up the ustin Bartha</span></td></tr></table>]]></content:encoded> .....

all made with bash script file.

thanks 4 help !

Smilie
 

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

shell pattern matching

Hello Members I am facing a problem regarding pattern matching.please guide me to solve the issue.My requirement is like: There is table in oracle database, in that table contain columns ,inside the column so many files are there. my requirement is that to search a pattern for example: pattern... (5 Replies)
Discussion started by: rakeshforum
5 Replies

2. Shell Programming and Scripting

Problem extracting just a part of a matching pattern

Hello everyone, this is my first post so please give me a hand. I apologize for my English, I'll try to be clear with my request. I need to write a script (Bash) which finds all the variables defined in the file .h of the folder and then writes the name of the files .c where these variables are... (1 Reply)
Discussion started by: paxilpaz
1 Replies

3. Shell Programming and Scripting

help extracting a matching pattern and next lines of match

Hi there, i'm having some problems just making an awk script (i've tried this way, but other way can be posible for sure), for the next file file.txt <register> <createProfile> <result>0</result> <description><!]></description> <msisdn>34661461174</msisdn> <inputOmvID>1</inputOmvID>... (6 Replies)
Discussion started by: vicious
6 Replies

4. Shell Programming and Scripting

shell script pattern matching

Hi, I need to create a shell script through which i need to populate email addresses in email columns of database table in mysql. Let say if email contains yahoo, hotmail, gtalk than email addresses need to move in their respective columns. # !/bin/sh yim="example@yahoo.com"... (3 Replies)
Discussion started by: mirfan
3 Replies

5. Shell Programming and Scripting

Extracting the strings matching a pattern from a word

Hi All , I need to extract the strings that are matching with the pattern : CUST.<AnyStringOfAnyLength>.<AnyStringOfAnyLength> from a file and then write all these string into another file. e.g. If a file SOURCE contains following lines : IF(CUST.ABCD.EFGH==1) THEN CUST.ABCD.EFGH =... (7 Replies)
Discussion started by: swapnil.nawale
7 Replies

6. Shell Programming and Scripting

Extracting a string matching a pattern from a line

Hi All, I am pretty new to pattern matching and extraction using shell scripting. Could anyone please help me in extracting the word matching a pattern from a line in bash. Input Sample (can vary between any of the 3 samples below): 1) Adaptec SCSI RAID 5445 2) Adaptec SCSI 5445S RAID 3)... (8 Replies)
Discussion started by: jharish
8 Replies

7. Shell Programming and Scripting

Korn Shell for pattern matching and extracting

Guys, i'm new to shell scripting. Here's what i need. I need a shell script which would read a file containing only 1 line which never changes. File containts - SQL_Mgd_Svc_ELONMCL54496 |EMEA\brookkev, EMEA\fieldgra, EMEA\tidmamar, EMEA\attfiste, EMEA\baldogar, EMEA\clarkia2, EMEA\conwasha,... (9 Replies)
Discussion started by: butterfly20
9 Replies

8. Shell Programming and Scripting

Pattern Matching and extracting the required fields in Perl

Hi All, I am writing the following Perl Scrip and need your help in Pattern matching : I have the following Shell Script that would read line by line from the file (file_svn) and would inturn calls the Perl Script: #!/bin/bash perl_path="/home/dev/filter"... (2 Replies)
Discussion started by: filter
2 Replies

9. UNIX for Dummies Questions & Answers

Extracting sub-string matching the pattern.

Hi, I have a string looks like the following: USERS 32767.9844 UNDOTBS1 32767.9844 SYSAUX 32767.9844 SYSTEM 32767.9844 EMS 8192 EMS 8192 EMS_INDEXES 4096 EMS_INDEXES 4096 8 rows selected. How do I extract a sub-string to get the expected output as following: EMS 8192 EMS_INDEXES 4096 ... (3 Replies)
Discussion started by: NetBear
3 Replies

10. Shell Programming and Scripting

Extracting urls from curl output

Hello. I use curl to fetch a website, then, I want to extract the URLs from this curls output. I tried both sed and grep, but couldnt figure it out. Ive tried : sed -n 's/href="\(*\).*/\1/p' results.txt and grep -o grep -o '<a href="http://*.*.*/*">' results.txt. What pattern... (6 Replies)
Discussion started by: jozo95
6 Replies
MIRRORTOOL(1)							OMT documentation.						     MIRRORTOOL(1)

NAME
mirrortool.pl - OpaL Mirror Tool (OMT) DESCRIPTION
Creates a mirror of a webpage. It has a number of features such as link rewriting and more. (See the options below). USAGE
mirrortool.pl [options] [url] [options] [url] [...] OPTIONS
--images : Include <img src=xxx>:s in the download. (default) --noimages : Do not include <img src=xxx>:s in the download. --depth n : Maximum recursion depth. (default 1) --store "regexp" : Files matching regexp are actually stored locally. : It is possible to | separate (with or). --rewrite "from=>to" : Urls are rewritten using this rules. : It is possible to | separate (with or). : Do not rewrite the dir, because that it will affect : later lookup. Have to fix this sometime. --what "regexp" : Files matching regexp are downloaded and traversed. : It is possible to | separate (with or). --dir basedir : Where to store local files. --nohostcheck : Do not check if url points to other host. --notreecheck : Do not check if url points to other dirtree. --force : Overwrite all files. --debug : Print debug-messages. --retry n : Number of times an url will be retried (default 1) --auth user:pass : use Basic Authentication --proxy url : Use a proxy server (like http://u:p@localhost/). --help : Print this text. AUTHOR
Ola Lundqvist <opal@lysator.liu.se> SEE ALSO
mirrortool.pl(1) perl v5.8.8 2002-04-15 MIRRORTOOL(1)
All times are GMT -4. The time now is 04:42 PM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy