10 More Discussions You Might Find Interesting
1. Shell Programming and Scripting
I'm extracting text between table tags in HTML
<th><a href="/wiki/Buick_LeSabre" title="Buick LeSabre">Buick LeSabre</a></th>
using this:
awk -F "</*th>" '/<\/*th>/ {print $2}' auto2 > auto3
then this (text between a href):
sed -e 's/\(<*>\)//g' auto3 > auto4
How to shorten this into one... (8 Replies)
Discussion started by: p1ne
8 Replies
2. Shell Programming and Scripting
Hi Expert,
Is there any other way to print and write to a same filename the content between two html tags?
Here the sample:
cat file.html
<div id="outline">
hello world<br>
</div>
<div id="container_faq">
test1<br>
</div>
<div class="widget_quick">
thead test<br>
</div>
... (3 Replies)
Discussion started by: lxdorney
3 Replies
3. UNIX for Dummies Questions & Answers
Ok, so this is stupid simple, and I know I am going to feel like an idiot when I get help.
I am altering a HTML report that has contraband in it so that the links to said contraband and the images are not shown.
The link/img pairs are in the form of :
<a... (5 Replies)
Discussion started by: twjolson
5 Replies
4. Shell Programming and Scripting
Hi, I'm trying to get some data from an html file, but the problem is before it can extract the information I have multiple patterns that need to be passed through.
https://www.unix.com/shell-programming-scripting/150711-extract-data-awk-html-files.html
Is a similar problem. The only... (5 Replies)
Discussion started by: counfhou
5 Replies
5. Shell Programming and Scripting
I am attempting to extract weather data from the following website, but for the Victoria area only:
Text Forecasts - Environment Canada
I use this:
sed -n "/Greater Victoria./,/Fraser Valley./p"
But that phrasing does not sometimes get it all and think perhaps the website has more... (2 Replies)
Discussion started by: lagagnon
2 Replies
6. Shell Programming and Scripting
I have pasted the contents of a log file (swmbackup.wrkstn.1262071383.sales2a) below:
Workstation: sales2a<BR
Vault sales2a-hogwarts will be initialized.<BR
<font color="red"There was a problem mounting /mnt/sales2a/desktop$ </FONT<BR
<font color="red"There was a problem mounting... (4 Replies)
Discussion started by: bigtonydallas
4 Replies
7. Shell Programming and Scripting
Hello,
i try to extract urls from google-search-results, but i have problem with sed filtering of html-code.
what i wont is just list of urls thay apears between ........<p><a href=" and next following " in html code.
here is my code, i use wget and pipelines to filtering. wget works, but... (13 Replies)
Discussion started by: L0rd
13 Replies
8. Shell Programming and Scripting
Hi All,
I'm trying to extract some floating point numbers from within some HTML code like this:
<TR><TD class='awrc'>Parse CPU to Parse Elapsd %:</TD><TD ALIGN='right' class='awrc'> 64.50</TD><TD class='awrc'>% Non-Parse CPU:</TD><TD ALIGN='right' class='awrc'> ... (2 Replies)
Discussion started by: pondlife
2 Replies
9. UNIX for Advanced & Expert Users
Hiya,
I am trying to extract a news article from a web page. The sed I have written brings back a lot of Javascript code and sometimes advertisments too. Can anyone please help with this one ??? I need to fix this sed so it picks up the article ONLY (don't worry about the title or date .. i got... (2 Replies)
Discussion started by: stargazerr
2 Replies
10. Shell Programming and Scripting
I am cleaning up HTML with sed. With the regexp
<a name="+"></a><h>*<span class="mw-headline" >+</span></h>
I can find the tags I need. But when I place them in a sed command, sed fails. So I started building up from a smaller command. This is where I am now:
sed -r -e s/"<a... (3 Replies)
Discussion started by: DocBrewer
3 Replies