02-17-2011
Make your question easier by displaying part of the input data and desired output.
10 More Discussions You Might Find Interesting
1. Linux
Hi All,
I have following example file
i want to remove all html tags only,
Input File:
<html>
<head>
<title>Software Solutions Inc., </title>
<meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1">
</head>
<body bgcolor=white leftmargin="0" topmargin="0"... (2 Replies)
Discussion started by: btech_raju
2 Replies
2. Solaris
Hi All,
I have an external scsi harddrive (HD) connected directly to the workstation. I understand when the external HD is connected and turned on, and type in "devfsadm" command. Unix will detect it but not mount the drive.
So by typing in "format" command it will display the following:
#... (6 Replies)
Discussion started by: tlee
6 Replies
3. Shell Programming and Scripting
I'm going to have a text file formatted something like this:
some_name http://www.someurl.com/
another_name http://www.anotherurl.com/
third_name http://www.thirdurl.com/
I need to write a script that can rsync from a file path I'll set, to each URL in the list.
Any ideas? (8 Replies)
Discussion started by: ibsen
8 Replies
4. Shell Programming and Scripting
Hello,
i try to extract urls from google-search-results, but i have problem with sed filtering of html-code.
what i wont is just list of urls thay apears between ........<p><a href=" and next following " in html code.
here is my code, i use wget and pipelines to filtering. wget works, but... (13 Replies)
Discussion started by: L0rd
13 Replies
5. Web Development
Hi, I have problems with mod rewrite. I will try to describe...
I want clean urls but fail to make it work propperly. Maybe I have problems, because the content displayed is fetched from my other site...
There is a lot of stuff I already red about this, but somehow I can not find a solution... (2 Replies)
Discussion started by: lowmaster
2 Replies
6. Shell Programming and Scripting
Hi,
I need to basically get a list of all the tarballs located at uri
I am currently doing a wget on urito get the index.html page
Now this index page contains the list of uris that I want to use in my bash script.
can someone please guide me ,.
I am new to Linux and shell scripting.
... (5 Replies)
Discussion started by: mnanavati
5 Replies
7. Shell Programming and Scripting
Does anybody know how to remove all urls from html files?
all urls are links with anchor texts in the form of
<a href="http://www.anydomain.com">ANCHOR</a>
they may start with www or not.
Goal is to delete all urls and keep the ANCHOR text and if possible to change tags around anchor to... (2 Replies)
Discussion started by: georgi58
2 Replies
8. Shell Programming and Scripting
I am trying to remove a multiline HTML tag and its contents from a few HTML files following the same basic pattern. So far using regex and sed have been unsuccessful. The HTML has a basic structure like this (with the normal HTML stuff around it):
<div id="div1">
<div class="div2">
<other... (4 Replies)
Discussion started by: threesixtyfive
4 Replies
9. Shell Programming and Scripting
I am working on a web-concordance of Old Avestan and my concordance has produced a HTML file
The sort deployed by the HTML file is not something which we normally use. I have tried my best to force a sort within the concordance itself, but the sort order does not work.
I am giving below the sort... (6 Replies)
Discussion started by: gimley
6 Replies
10. UNIX for Beginners Questions & Answers
Hi All,
We have a HTML source which will be processed using a informatica workflow. In between these two we have a Unix script which transforms the file.
We are getting an error from past week in the informatica saying invalid format, because the file has unused html reference (0-8,14-31 etc)... (2 Replies)
Discussion started by: karthik adiga
2 Replies
LEARN ABOUT SUSE
html::assubs
HTML::AsSubs(3) User Contributed Perl Documentation HTML::AsSubs(3)
NAME
HTML::AsSubs - functions that construct a HTML syntax tree
SYNOPSIS
use HTML::AsSubs;
$h = body(
h1("This is the heading"),
p("This is the first paragraph which contains a ",
a({href=>'link.html'}, "link"),
" and an ",
img({src=>'img.gif', alt=>'image'}),
"."
),
);
print $h->as_HTML;
DESCRIPTION
This module exports functions that can be used to construct various HTML elements. The functions are named after the tags of the
correponding HTML element and are all written in lower case. If the first argument is a hash reference then it will be used to initialize
the attributes of this element. The remaining arguments are regarded as content.
For a similar idea (i.e., it's another case where the syntax tree of the Perl source mirrors the syntax tree of the HTML produced), see
HTML::Element's "new_from_lol" method.
For what I now think is a cleaner implementation of this same idea, see the excellent module "XML::Generator", which is what I suggest for
actual real-life use. (I suggest this over "HTML::AsSubs" and over "CGI.pm"'s HTML-making functions.)
ACKNOWLEDGEMENT
This module was inspired by the following message:
Date: Tue, 4 Oct 1994 16:11:30 +0100
Subject: Wow! I have a large lightbulb above my head!
Take a moment to consider these lines:
%OVERLOAD=( '""' => sub { join("", @{$_[0]}) } );
sub html { my($type)=shift; bless ["<$type>", @_, "</$type>"]; }
:-) I *love* Perl 5! Thankyou Larry and Ilya.
Regards,
Tim Bunce.
p.s. If you didn't get it, think about recursive data types: html(html())
p.p.s. I'll turn this into a much more practical example in a day or two.
p.p.p.s. It's a pity that overloads are not inherited. Is this a bug?
BUGS
The exported link() function overrides the builtin link() function. The exported tr() function must be called using &tr(...) syntax
because it clashes with the builtin tr/../../ operator.
SEE ALSO
HTML::Element, XML::Generator
Private Functions
_elem()
The _elem() function is wrapped by all the html 'tag' functions. It takes a tag-name, optional hashref of attributes and a list of content
as parameters.
perl v5.12.1 2006-08-04 HTML::AsSubs(3)