06-21-2012
Removing all except couple of html tags from html file
I tried to find elegant (or at least simple) way to remove all but couple of html tags from html file, but all examples I found dealt with removing all the tags.
The logic of the script would be:
- if there is <li> or <ul> on the line, do nothing (=write same line to output)
- if there is:
font class="titleA"
substitute it with:
<h2>
- otherwise if there is html tag, remove it (=write the lines to output without tags, just content)
Could please someone tell me how to approach this problem? I know some perl but my skills are rusty (years from last time I used perl).
Last edited by juubuntu; 06-21-2012 at 09:12 AM..
Reason: typo
10 More Discussions You Might Find Interesting
1. UNIX for Dummies Questions & Answers
I have a html file called myfile. If I simply put "cat myfile.html" in UNIX, it shows all the html tags like <a href=r/26><img src="http://www>. But I want to extract only text part.
Same problem happens in "type" command in MS-DOS.
I know you can do it by opening it in Internet Explorer,... (4 Replies)
Discussion started by: los111
4 Replies
2. Shell Programming and Scripting
Hi all-
I have a variable that contains a web page:
echo $STUFF
<html> <head> <title>my page</title></head> <body> blah blah etc..
Can I use the shell's parameter expansion abilities to remove just the tags?
I thought that FIXHTML=${STUFF//<*>/} might do it, but it didn't seem to... (2 Replies)
Discussion started by: rev66
2 Replies
3. Shell Programming and Scripting
I generally save a lot of web pages for reading offline which works out great for school. Now I have to spend a lot of time on the bus and I am looking for the best way to read some of these webpages using my Nokia 7610.
I have uploaded the files to my phone, but they are deadly deadly slow to... (2 Replies)
Discussion started by: naphelge
2 Replies
4. Red Hat
Hi there..
I need a proper "mutt" command to send a mail with html body and html attachment at a time.
Also if possible let me know the other commands to do this task.
Please help me.. (2 Replies)
Discussion started by: vickramshetty
2 Replies
5. UNIX for Advanced & Expert Users
Hello Unix Gurus
I am having a problem with one of the files that i am generating using a Unix Script. This Unix Scripts connects to the MY SQL Server and loads the data into a Text file. While generating the Text file for one of the tables the value in one of the column is as follows.
<p>... (3 Replies)
Discussion started by: chetan.mudike
3 Replies
6. Shell Programming and Scripting
Hi there, I'm quite new to the forum and shell scripting.
I want to filter out the "166.0 points". The results, that i found in google / the forum search didn't helped me :(
<a href="/user/test" class="headitem menu" style="color:rgb(83,186,224);">test</a><a href="/points" class="headitem... (1 Reply)
Discussion started by: Mysthik
1 Replies
7. Shell Programming and Scripting
I store different variance of the below in an xml file. and apparently, xml has an issue loading up data like this because it contains html tags. i would like to preserve this data as it is, but unfortunately, xml says i cant.
so i have to strip out all the html tags.
the examples i found... (9 Replies)
Discussion started by: SkySmart
9 Replies
8. UNIX for Advanced & Expert Users
Hi all:
Been racking my brain on this for the last couple of days and what has been most frustrating is that this is the last piece I need to complete a project.
There are numerous posts discussing mutt in this forum and others but I have been unable to find similar issues.
Running with... (1 Reply)
Discussion started by: raggmopp
1 Replies
9. Homework & Coursework Questions
Use and complete the template provided. The entire template must be completed. If you don't, your post may be deleted!
1. The problem statement, all variables and given/known data:
You will write a script that will remove all HTML tags from an HTML document and remove any consecutive... (3 Replies)
Discussion started by: tburns517
3 Replies
10. Shell Programming and Scripting
Hi there,
I am new to shell scripting and have been struggling with this example.
I have an input variable that looks like that:
FILELIST="100_*_123.txt"
that will produce a list of files if you use
ls ${FILELIST}
The output looks like 100_EN_123.txt 100_FR_123.txt
I am building... (4 Replies)
Discussion started by: ornesey
4 Replies
pfstag(1) General Commands Manual pfstag(1)
NAME
pfstag - Set or remove tags to/from pfs stream
SYNOPSIS
pfstag [--set [channel:]name=value] [--remove [channel:]name]
DESCRIPTION
Use this command to set or remove tags from the pfs-stream. Tags are used to add additional information to pfs frames and they are in the
format: 'name=value'. To learn more about tags, read 'Specification of the PFS File Format'.
Tags are set/removed to/from all pfs frames in the stream.
Note that currently only OpenEXR file format supports tags.
OPTIONS
--set [channel:]name=value], -s [channel:]name=value], --add [channel:]name=value]
Change existing or add a new tag of the given name. If no channel is given, tags are added to the frame.
--remove [channel:]name], -r [channel:]name]
Remove tag of the given name. Ignore if the tag does not exist. If no channel is given, tags are removed from the frame.
EXAMPLES
pfsin memorial.hdr | pfstag --add "EXTRA_INFO=foo" | pfsout memorial_ei.exr
Add tag "EXTRA_INFO=foo" to the memorial image and save it as memorial_ei.exr.
SEE ALSO
pfsin(1) pfsout(1)
BUGS
Please report bugs and comments to Rafal Mantiuk <mantiuk@mpi-sb.mpg.de>.
pfstag(1)