03-01-2012
[solved]extracting Line between HTML tag[/solved]
Hi everyone:
I want to extract string which is in between certain html tag.
e.g.
Quote:
<tag>I_want_extract_this_line.com</tag>
I tried with grep,cut, awk but could not find exact syntax for this one.
PS>Sorry about bad english.
Last edited by newlook2011; 03-01-2012 at 11:08 PM..
Reason: solved
10 More Discussions You Might Find Interesting
1. UNIX for Dummies Questions & Answers
I have a html file called myfile. If I simply put "cat myfile.html" in UNIX, it shows all the html tags like <a href=r/26><img src="http://www>. But I want to extract only text part.
Same problem happens in "type" command in MS-DOS.
I know you can do it by opening it in Internet Explorer,... (4 Replies)
Discussion started by: los111
4 Replies
2. Shell Programming and Scripting
Hai friends
I have a small doubt..
how can we use html tag in shell scripting
code :
echo "<html>"
echo "<body>"
echo " welcome to peace world "
echo "</body>"
echo "</html>"
output displayed like this:
<html>
<body>
welcome to peace world
</body>
</html> (5 Replies)
Discussion started by: jrex1983
5 Replies
3. Shell Programming and Scripting
Input:
<table class="pixelBorderTable faqTable" width="100%" border="1" cellpadding="3" cellspacing="0">
<tbody><tr>
<td class="pixelBorderTableHeaderTd" valign="top" width="20%" bgcolor="#666666"><p> </p></td>
<td class="pixelBorderTableHeaderTd" valign="top"... (1 Reply)
Discussion started by: cola
1 Replies
4. Shell Programming and Scripting
Guys,
I have a little script that I got of the internet and that I use in Squid to block ads.
I used that script with linux but now i have moved my servers to freebsd. I have a step learning curve there but it is fun: Back to the script issue.
The script used to work i with linux but... (15 Replies)
Discussion started by: zongo
15 Replies
5. Shell Programming and Scripting
Hi All,
Find the following code:
<Universal>D38x82j1JJ
</Universal>
I want to retrieve the value of <Universal> tag as below:
Please help me. (3 Replies)
Discussion started by: mjavalkar
3 Replies
6. Shell Programming and Scripting
Hi,
i have 30 html files and i want to add the html tag first (<html>) and end of the line </html> tag..How to do it in script.
Thanks, (7 Replies)
Discussion started by: bmk
7 Replies
7. Shell Programming and Scripting
Hi
I am new to string extractions in shell script... I am trying to extract a string such as #1753 from html tag looks like below.
<a class="model-link tl-tr" href="lastSuccessfulBuild/">Last successful build (#1753), 40 min ago</a>
and want the value as
1753
Could someone help me to... (3 Replies)
Discussion started by: hicharbo
3 Replies
8. Shell Programming and Scripting
I want to print from <fruits> to </fruits> tag which have <fruit> as mango. Also i want both <fruits> and </fruits> in output. Please help
eg.
<fruits>
<fruit id="111">mango<fruit>
.
another 20 lines
.
</fruits> (3 Replies)
Discussion started by: Ashik409
3 Replies
9. Shell Programming and Scripting
Hi,
I have a html line as below :-... (6 Replies)
Discussion started by: satishmallidi
6 Replies
10. Shell Programming and Scripting
In a huge log file (43MB, 43k lines) I am trying to extract data between two tag pairs on same line and export it to a file so I can pull it into Excel for a report.
One Pair is <Text>data I need</Text>
Other pair follows on same line and is <TimeStamp>more data I need</TimeStamp>
I would need... (2 Replies)
Discussion started by: NanookArctic
2 Replies
LEARN ABOUT DEBIAN
html::quoted
HTML::Quoted(3pm) User Contributed Perl Documentation HTML::Quoted(3pm)
NAME
HTML::Quoted - extract structure of quoted HTML mail message
SYNOPSIS
use HTML::Quoted;
my $html = '...';
my $struct = HTML::Quoted->extract( $html );
DESCRIPTION
Parses and extracts quotation structure out of a HTML message. Purpose and returned structures are very similar to Text::Quoted.
SUPPORTED FORMATS
Variouse MUAs use quite different approaches for quoting in mails.
Some use blockquote tag and it's quite easy to parse.
Some wrap text into p tags and add '>' in the beginning of the paragraphs.
Things gettign messier when it's an HTML reply on plain text mail thread.
If you found format that is not supported then file a bug report via rt.cpan.org with as short as possible example. Test file is even
better. Test file with patch is the best. Not obviouse patches without tests suck.
METHODS
extract
my $struct = HTML::Quoted->extract( $html );
Takes a string with HTML and returns array reference. Each element in the array either array or hash. For example:
[
{ 'raw' => 'Hi,' },
{ 'raw' => '<div><br><div>On date X wrote:<br>' },
[
{ 'raw' => '<blockquote>' },
{ 'raw' => 'Hello,' },
{ 'raw' => '<div>How are you?</div>' },
{ 'raw' => '</blockquote>' }
],
...
]
Hashes represent a part of the html. The following keys are meaningful at the moment:
o raw - raw HTML
o quoter_raw, quoter - raw and decoded (entities are converted) quoter if block is prefixed with quoting characters
AUTHOR
Ruslan.Zakirov <ruz@bestpractical.com>
LICENSE
Under the same terms as perl itself.
perl v5.10.1 2011-01-09 HTML::Quoted(3pm)