04-25-2008
How to supplement HTML tags with SED
I am cleaning up HTML with sed. With the regexp
<a name="[A-Za-z0-9 ?_.]+"></a><h[123]>[ ]*<span class="mw-headline" >[A-Za-z0-9 ?_.]+</span></h[123]>
I can find the tags I need. But when I place them in a sed command, sed fails. So I started building up from a smaller command. This is where I am now:
sed -r -e s/"<a name=\"/replacement/ <in >out
This works. But when I enter:
sed -r -e s/"<a name=\"[A-Za-z0-9 ?_.]+"/replacement/ <in >out
it fails with:
sed: can't read <in: Invalid argument
sed: can't read >out: Invalid argument
But the in file is really there. How can I get the regexp in the sed command? I have tried escaping/not escaping chars, but sed does not seem to accept it.
10 More Discussions You Might Find Interesting
1. Shell Programming and Scripting
Hi,
I am trying to strip html tags of a string for example
<TD>no problem</TD>
the sesult should be
no problem
but could never get rid off all the tags
sed 's/<..D>//g'
Please help, I am new (3 Replies)
Discussion started by: zap
3 Replies
2. Shell Programming and Scripting
Hello,
I am using sed as follows -
sed 's/CONTACT SYSTEMS! Some payments have been rejected/<B><font color="red" size="5.0pt"CONTACT SYSTEMS! Some payments have been rejected</font></B>/' $REPORT_FILE
But while executing this, I am getting the error as -
sed: command garbled
&... (5 Replies)
Discussion started by: The Observer
5 Replies
3. Shell Programming and Scripting
Hi, I am working on transforming html code text into the .vert text format. I want to use linux utility sed. I have this regexp which should do the work: s/ \(?!*>\)/\n/g. I use it like this with sed: echo "you <we try> there" | sed 's/ \(?!*>\)/\n/g' ... The demanded output should be:
you
<we... (5 Replies)
Discussion started by: matt1311
5 Replies
4. Shell Programming and Scripting
How to use sed to remove html tags including text between them?
Example: User <b> rolvak </b> is stupid. It does not using <b>OOP</b>!
and should output: User is stupid. It does not using !
Thank you.. (2 Replies)
Discussion started by: alphagon
2 Replies
5. Shell Programming and Scripting
I have pasted the contents of a log file (swmbackup.wrkstn.1262071383.sales2a) below:
Workstation: sales2a<BR
Vault sales2a-hogwarts will be initialized.<BR
<font color="red"There was a problem mounting /mnt/sales2a/desktop$ </FONT<BR
<font color="red"There was a problem mounting... (4 Replies)
Discussion started by: bigtonydallas
4 Replies
6. Shell Programming and Scripting
Hi
I've searched for it for few hours now and i can't seem to find anything working like i want. I've got webpage, saved in file par with form like this:
<html><body><form name='sendme' action='http://example.com/' method='POST'>
<textarea name='1st'>abc123def678</textarea>
<textarea... (9 Replies)
Discussion started by: seb001
9 Replies
7. Shell Programming and Scripting
I tried to find elegant (or at least simple) way to remove all but couple of html tags from html file, but all examples I found dealt with removing all the tags.
The logic of the script would be:
- if there is <li> or <ul> on the line, do nothing (=write same line to output)
- if there is:... (0 Replies)
Discussion started by: juubuntu
0 Replies
8. UNIX for Dummies Questions & Answers
Ok, so this is stupid simple, and I know I am going to feel like an idiot when I get help.
I am altering a HTML report that has contraband in it so that the links to said contraband and the images are not shown.
The link/img pairs are in the form of :
<a... (5 Replies)
Discussion started by: twjolson
5 Replies
9. Shell Programming and Scripting
I need all the end tags of </font> to be replaced with new line yet enclosing tag to be retained </font>. Please help me in this regard.
Input:
<font>abc</font>def<font>ghi</font>
Output:
<font>abc</font>
def
<font>ghi</font> (3 Replies)
Discussion started by: Badhrish
3 Replies
10. UNIX for Beginners Questions & Answers
Hi,
im trying to read a Temperature value from html code.
So far i have managed to reduce the whole html page down to this single line with the following sed command:sed -n '/Temperature/p' $temp_temperature | tee temp_string
<TD width='350'>Temperature :</td><td>25... (2 Replies)
Discussion started by: naittis
2 Replies
LEARN ABOUT REDHAT
nwbpset
NWBPSET(1) nwbpset NWBPSET(1)
NAME
nwbpset - Create a bindery property or set its value
SYNOPSIS
nwbpset [ -h ] [ -S server ] [ -U user name ] [ -P password | -n ] [ -C ]
DESCRIPTION
nwbpset Reads a property specification from the standard input and creates and sets the corresponding property. The format is determined by
the output of 'nwbpvalues -c'. nwbpset will hopefully become an important part of the bindery management suite of ncpfs, together with
As another example, look at the following command line:
nwbpvalues -t 1 -o supervisor -p user_defaults -c |
sed '2s/.*/ME/'|
sed '3s/.*/LOGIN_CONTROL/'|
nwbpset
With this command, the property user_defaults of the user object 'supervisor' is copied into the property login_control of the user object
'me'.
nwbpvalues -t 1 -o me -p login_control -c |
sed '9s/.*/ff/'|
nwbpset
This command disables the user object me.
Feel free to contribute other examples!
nwbpset looks up the file $HOME/.nwclient to find a file server, a user name and possibly a password. See nwclient(5) for more information.
Please note that the access permissions of $HOME/.nwclient MUST be 600 for security reasons.
OPTIONS
-h
-h is used to print out a short help text.
-S server
server is the name of the server you want to use.
-U user
user is the user name to use for login.
-P password
password is the password to use for login. If neither -n nor -P are given, and the user has no open connection to the server, nwbpset
prompts for a password.
-n
-n should be given if no password is required for the login.
-C
By default, passwords are converted to uppercase before they are sent to the server, because most servers require this. You can turn off
this conversion by -C.
AUTHORS
nwbpset was written by Volker Lendecke. See the Changes file of ncpfs for other contributors.
nwbpset 8/7/1996 NWBPSET(1)