Help in parsing xml file (sed/nawk) Post: 302546423

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

can i do XML parsing usind sed

Hi all... I want to parse a xml filein unix .. Can i use SED or unix script to parse the xml file .. If so can anyone show a sample script that will parse the xml file .. Thanks in advance, Arun ,,,,

2. UNIX for Advanced & Expert Users

Parsing xml file using Sed

Hi All, I have this(.xml) file as:  <instance name='ins_C2Londondev' user='' group='' fullname='B2%20-%20London%20(dev)' > <property> </property> </instance> I want output as:  <instance...

3. Shell Programming and Scripting

parsing xml with awk/sed

Hi people!, I need extract from the file (test-file.txt) the values between <context> and </context> tag's , the total are 7 lines,but i can only get 5 or 2 lines!!:confused: Please look my code: #awk '/context/{flag=1} /\/context/{flag=0} !/context/{ if (flag==1) p rint $0; }'...

4. Shell Programming and Scripting

parsing(xml) using nawk/awk

Hi , I have an xml format as shown below: <Info> <last name="sean" first name="john"/> <period="5" time="11"/> <test value="1",test2 value="2",test3 value="3",test4 value="5"> <old> <value1>1</value1> <value2>2</value2> </old> <new> <value1>4</value1> <value2>3</value2> </new>...

5. Shell Programming and Scripting

how to parse the file in xml format using awk/nawk

Hi All, I have an xml file with the below format. <a>111</a><b>222</b><c>333<c><d><e>123</e><f>234</f><d><e>456</e><f>789</f> output needed is 111,222,333,123,234 111,222,333,456,789 nawk 'BEGIN{FS="<|>"} {print a,b,c,e,f a="" ...

6. Shell Programming and Scripting

Parsing xml file

hi guys, great help to the original question, can i expand please? i have large files filled with blocks like this <Placemark> network type: hot line1 line2 line3 <styleUrl>red.png</styleUrl> </Placemark> <Placemark> network type: cold line1 line2 line3...

7. Shell Programming and Scripting

Need help parsing data with sed and/or nawk

Good day all. I have the following entries of data in a file in a column, however, I need this data written on a single line with several parameters in a different order. Current format: Treatment ,parmeter1=value ,parmeter2=value ,parmeter3=value ,parmeter4=value...

8. Shell Programming and Scripting

XML parsing using nawk help needed

i need one help, below is one more xml file with diff pattern i tried it but dint get it , iam sure its a peice of cake for you guys. <xn:MeContext id="LSVLKY001"> <xn:ManagedElement id="1"> <un:RncFunction id="1"> <un:UtranCell...

9. Shell Programming and Scripting

XML: parsing of the Google contacts XML file

I am trying to parse the XML Google contact file using tools like xmllint and I even dived into the XSL Style Sheets using xsltproc but I get nowhere. I can not supply any sample file as it contains private data but you can download your own contacts using this script: #!/bin/sh # imports...

10. UNIX for Dummies Questions & Answers

Parsing XML file

I want to parse xml file sample file....... <name locale="en">my_name<>/name><lastChanged>somedate</lastChanged><some more code here> <name locale="en">tablename1<>/name><lastChanged>somedate</lastChanged> <definition><dbquery><sources><sql type="cognos">select * from...

LEARN ABOUT DEBIAN

xml_pp

XML_PP(1p)						User Contributed Perl Documentation						XML_PP(1p)

NAME

       xml_pp - xml pretty-printer

SYNOPSYS

       xml_pp [options] [<files>]

DESCRIPTION

       XML pretty printer using XML::Twig

OPTIONS

       -i[<extension>]
	   edits the file(s) in place, if an extension is provided (no space between "-i" and the extension) then the original file is backed-up
	   with that extension

	   The rules for the extension are the same as Perl's (see perldoc perlrun): if the extension includes no "*" then it is appended to the
	   original file name, If the extension does contain one or more "*" characters, then each "*" is replaced with the current filename.

       -s <style>
	   the style to use for pretty printing: none, nsgmls, nice, indented, record, or record_c (see XML::Twig docs for the exact description
	   of those styles), 'indented' by default

       -p <tag(s)>
	   preserves white spaces in tags. You can use several "-p" options or quote the tags if you need more than one

       -e <encoding>
	   use XML::Twig output_encoding (based on Text::Iconv or Unicode::Map8 and Unicode::String) to set the output encoding. By default the
	   original encoding is preserved.

	   If this option is used the XML declaration is updated (and created if there was none).

	   Make sure that the encoding is supported by the parser you use if you want to be able to process the pretty_printed file (XML::Parser
	   does not support 'latin1' for example, you have to use 'iso-8859-1')

       -l  loads the documents in memory instead of outputing them as they are being parsed.

	   This prevents a bug (see BUGS) but uses more memory

       -f <file>
	   read the list of files to process from <file>, one per line

       -v  verbose (list the current file being processed)

       --  stop argument processing (to process files that start with -)

       -h  display help

EXAMPLES

	 xml_pp foo.xml > foo_pp.xml	       # pretty print foo.xml
	 xml_pp < foo.xml > foo_pp.xml	       # pretty print from standard input

	 xml_pp -v -i.bak *.xml 	       # pretty print .xml files, with backups
	 xml_pp -v -i'orig_*' *.xml	       # backups are named orig_<filename>

	 xml_pp -i -p pre foo.xhtml	       # preserve spaces in pre tags

	 xml_pp -i.bak -p 'pre code' foo.xml   # preserve spaces in pre and code tags
	 xml_pp -i.bak -p pre -p code foo.xml  # same

	 xml_pp -i -s record mydb_export.xml   # pretty print using the record style

	 xml_pp -e utf8 -i foo.xml	       # output will be in utf8
	 xml_pp -e iso-8859-1 -i foo.xml       # output will be in iso-8859-1

	 xml_pp -v -i.bak -f lof	       # pretty print in place files from lof

	 xml_pp -- -i.xml		       # pretty print the -i.xml file

	 xml_pp -l foo.xml		       # loads the entire file in memory
					       # before pretty printing it

	 xml_pp -h			       # display help

BUGS

       Elements with mixed content that start with an embedded element get an extra 


	 <elt><b>b</b>toto<b>bold</b></elt>

       will be output as

	 <elt>
	   <b>b</b>toto<b>bold</b></elt>

       Using the "-l" option solves this bug (but uses more memory)

TODO

       update XML::Twig to use Encode with perl 5.8.0

AUTHOR

       Michel Rodriguez <mirod@xmltwig.com>

perl v5.12.4							    2011-05-18								XML_PP(1p)