Removing unwanted tags from xml file Post: 302569572

10 More Discussions You Might Find Interesting

1. UNIX for Dummies Questions & Answers

Removing leading and trailing spaces of data between the tags in xml.

I am having xml document as below. <transactionid> 00 </transactionid> <tracknumber> 0 </tracknumber> <key> N/A </key> But the data contains leading and trailing spaces between the tags. Please let me know how can i remove these leading and trailing spaces between the tags....

2. Shell Programming and Scripting

Remove unwanted XML Tags

I have set of sources and the respective resolution. Please advice how to resolve the same using Unix shell scripting. Source 1: ======= <ext:ContactInfo xmlns:ext="urn:AOL.FLOWS.Extensions"> <ext:InternetEmailAddress>AOL@AOL.COM</ext:InternetEmailAddress> </ext:ContactInfo> Resoultion...

3. UNIX for Dummies Questions & Answers

Removing spaces between XML tags<XX XX> -> <XXXX>

hey guys, i have an XML like this: <documents> <document> <Object ID>100114699999</Object ID> <Object Create Date Time>2008-04-07T00:00:00</Object Create Date Time> </document> <documents> I need all my tags within the XML to not include any spaces. i.e. everything between <t a g> in...

4. Shell Programming and Scripting

Help in removing xml tags

Hi, I have a input xml file like this <postalAddress:>379 PROSPECT ST </postalAddress:> <street:>STE B </street:> <l:>TORRINGTON </l:> <st:>CT</st:> <postalCode:>067905238</postalCode:>...

5. Shell Programming and Scripting

removing unwanted characters from a file

i have a file like this 1111_2222#$#$dudgfdk 11111111_343434#$#$334 1111_22222#43445667 i want to remove all those charachetrs from # how can i do this Thank in advance Saravanan

6. Shell Programming and Scripting

Filter a .kml file (xml) to remove unwanted entries

Ok, i have a .kml file that that i want to trim down and get rid of the rubbish from. its formatted like so: <Placemark> <name><!]></name> <description><! Frequency: 2437 Timestamp: 1304892397000 Date: 2011-05-08...

7. Shell Programming and Scripting

How to add the multiple lines of xml tags before a particular xml tag in a file

Hi All, I'm stuck with adding multiple lines(irrespective of line number) to a file before a particular xml tag. Please help me. <A>testing_Location</A> <value>LA</value> <zone>US</zone> Region <value>Russia</value> <zone>Washington</zone> <C>Country</C>...

8. Shell Programming and Scripting

Removing extra unwanted spaces

hi, i need to remove the extra spaces in the filed. Sample: abc~bd ~bkd123 .. 1space abc~badf ~bakdsf123 .. 2space abc~bqed ~bakuowe .. 3space output: abc~bd ~bkd123 .. 1space abc~badf~bakdsf123 .. 2space abc~bqed~bakuowe .. 3space i used the following command,

9. Shell Programming and Scripting

Removing extra unwanted spaces

10. UNIX for Beginners Questions & Answers

Removing unwanted symbols with sed

I would like produce blue, green, red, yellowfrom"blue:,*green:,*red:,*yellowI can remove the colon with echo "blue:,*green:,*red:,*yellow" | sed 's/://g'which givesblue,*green,*red,*yellowbut when I try echo "blue:,*green:,*red:,*yellow" | sed 's/://g'; 's/*//g'I get bash: s/*//g: No such...

LEARN ABOUT DEBIAN

mkdoc::xml::stripper

MKDoc::XML::Stripper(3pm)				User Contributed Perl Documentation				 MKDoc::XML::Stripper(3pm)

NAME

       MKDoc::XML::Stripper - Remove unwanted XML / XHTML tags and attributes

SYNOPSIS

	 use MKDoc::XML::Stripper;

	 my $stripper = new MKDoc::XML::Stripper;
	 $stripper->allow (qw /p class id/);

	 my $ugly = '<p class="para" style="color:red">Hello, <strong>World</strong>!</p>';
	 my $neat = $stripper->process_data ($ugly);
	 print $neat;

       Should print:

	 <p class="para">Hello, World!</p>

SUMMARY

       MKDoc::XML::Stripper is a class which lets you specify a set of tags and attributes which you want to allow, and then cheekily strip any
       XML of unwanted tags and attributes.

       In MKDoc, this is used so that editors use structural XHTML rather than presentational tags, i.e. strip anything which looks like a <font>
       tag, a 'style' attribute or other tags which would break separation of structure from content.

DISCLAIMER

       This module does low level XML manipulation. It will somehow parse even broken XML and try to do something with it. Do not use it unless
       you know what you're doing.

API

   my $stripper = MKDoc::XML::Stripper->new()
       Instantiates a new MKDoc::XML::Stripper object.

   $stripper->load_def ($def_name);
       Loads a definition located somewhere in @INC under MKDoc/XML/Stripper.

       Available definitions are:

       xhtml10frameset
       xhtml10strict
       xhtml10transitional
       mkdoc16 - MKDoc 1.6. XHTML structural markup

       You can also load your own definition file, for instance:

	 $stripper->load_def ('my_def.txt');

       Definitions are simple text files as follows:

	 # allow p with 'class' and id
	 p class
	 p id

	 # allow more stuff
	 td class
	 td id
	 td style

	 # etc...

   $stripper->allow ($tag, @attributes)
       Allows "<$tag>" to appear in the stripped XML. Additionally, allows @attributes to appear as attributes of <$tag>, so for instance:

	 $stripper->allow ('p', 'class', 'id');

       Will allow the following:

	 <p>
	 <p class="foo">
	 <p id="bar">
	 <p class="foo" id="bar">

       However any extra attributes will be stripped, i.e.

	 <p class="foo" id="bar" style="font-color: red">

       Will be rewritten as

	 <p class="foo" id="bar">

   $stripper->disallow ($tag)
       Explicitly disallows a tag and all its associated attributes.  By default everything is disallowed.

   $stripper->process_data ($some_xml);
       Strips $some_xml according to the rules that were given with the allow() and disallow() methods and returns the result. Does not modify
       $some_xml in place.

   $stripper->process_file ('/an/xml/file.xml');
       Strips '/an/xml/file.xml' according to the rules that were given with the allow() and disallow() methods and returns the result. Does not
       modify '/an/xml/file.xml' in place.

NOTES

       MKDoc::XML::Stripper does not really parse the XML file you're giving to it nor does it care if the XML is well-formed or not. It uses
       MKDoc::XML::Tokenizer to turn the XML / XHTML file into a series of MKDoc::XML::Token objects and strictly operates on a list of tokens.

       For this same reason MKDoc::XML::Stripper does not support namespaces.

AUTHOR

       Copyright 2003 - MKDoc Holdings Ltd.

       Author: Jean-Michel Hiver

       This module is free software and is distributed under the same license as Perl itself. Use it at your own risk.

SEE ALSO

       MKDoc::XML::Tokenizer MKDoc::XML::Token

perl v5.10.1							    2004-10-06						 MKDoc::XML::Stripper(3pm)

10 More Discussions You Might Find Interesting

1. UNIX for Dummies Questions & Answers

Removing leading and trailing spaces of data between the tags in xml.

Discussion started by: jhmr7

2. Shell Programming and Scripting

Remove unwanted XML Tags

Discussion started by: ambals123

3. UNIX for Dummies Questions & Answers

Removing spaces between XML tags<XX XX> -> <XXXX>

Discussion started by: sharoff

4. Shell Programming and Scripting

Help in removing xml tags

Discussion started by: pintoo

5. Shell Programming and Scripting

removing unwanted characters from a file

Discussion started by: saravanan71184

6. Shell Programming and Scripting

Filter a .kml file (xml) to remove unwanted entries

Discussion started by: Phear46

7. Shell Programming and Scripting

How to add the multiple lines of xml tags before a particular xml tag in a file

Discussion started by: mjavalkar

8. Shell Programming and Scripting

Removing extra unwanted spaces

Discussion started by: anshaa

9. Shell Programming and Scripting

Removing extra unwanted spaces

Discussion started by: anshaa

10. UNIX for Beginners Questions & Answers

Removing unwanted symbols with sed

Discussion started by: Xubuntu56

LEARN ABOUT DEBIAN

mkdoc::xml::stripper