Shell script to extract data in repeating tags from xml Post: 302698127

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

extract data from xml- shell script using awk

Hi, This is the xml file that i have. - <front-servlet platform="WAS4.0" request-retriever="SiteMinder-aware" configuration-rescan-interval="60000"> <concurrency-throttle maximum-concurrency="50" redirect-page="/jsp/defaulterror.jsp" /> - <loggers> <instrumentation...

2. Shell Programming and Scripting

Help with shell script to extract data from XML file

Hello Scripting Gurus, I need help with extracting data from the XML file using shell script. The data is in a large XML and I need to extract the id values of all completedworkflows. Here is a sample of it. Input and output data is also in the attached text files. <wfregistry>...

3. Shell Programming and Scripting

Perl script for extract data from xml files

Hi All, Prepare a perl script for extracting data from xml file. The xml data look like as AC StartTime="1227858839" ID="88" ETime="1227858837" DSTFlag="false" Type="2" Duration="303" /> <AS StartTime="1227858849" SigPairs="119 40 98 15 100 32 128 18 131 23 70 39 123 20 120 27 100 17 136 12...

4. UNIX for Dummies Questions & Answers

Extract repeating data from file

I want to extract the last rows of a data file, similar to that one below: C1 xxx C2 rrr C3 ttt .... Cn-1 hhh Cn bbb C1 yyy C2 sss C3 uuu ... Cn-1 iii Cn ccc ... I just want to extract the final rows between C1 and Cn at each data file. n is not a constant,...

5. Shell Programming and Scripting

awk and or sed command to sum the value in repeating tags in a XML

I have a XML in which <Amt Ccy="EUR">3.1</Amt> tag repeats. This is under another tag <Main>. I need to sum all the values of <Amt Ccy=""> (Ccy may vary) coming under <Main> using awk and or sed command. can some help? Sample looks like below <root> <Main> ...

6. UNIX for Advanced & Expert Users

Shell Script to read XML tags and the data within that tag

Hi unix Gurus, I am really new to Unix Scripting. Please help me to create a shell script which reads the xml file and from that i need to fetch a particular information. For example <SOURCE BUSINESSNAME ="" DATABASETYPE ="Teradata" DBDNAME ="DWPROD3" DESCRIPTION ="" NAME...

7. Shell Programming and Scripting

How to extract data from xml file using shell scripting?

Hi evry1, This is my 1st post in this forum.Pls help me I want to extract some data froma xml file which has 2000 lines using shell scripting. Actually my xml file has some "audio and video codes" which i need to arrange in a column wise format after extracting it using shell scripting.I...

8. Shell Programming and Scripting

How to extract data from XML file using shell scripting?

Hi , I have input file as XML. following are input data #complex.xml Code: <?xml version="1.0" encoding="UTF-8"?><TEST_doc xmlns="http://www.w3.org/2001/XMLSchema-instance"> <ENTRY uid="123456"> <protein> <name>PROT001</name> <organism>Human</organism> ...

9. Shell Programming and Scripting

Parse xml in shell script and extract records with specific condition

Hi I have xml file with multiple records and would like to extract records from xml with specific condition if specific tag is present extract entire row otherwise skip . <logentry revision="21510"> <author>mantest</author> <date>2015-02-27</date> <QC_ID>334566</QC_ID>...

10. UNIX for Beginners Questions & Answers

Extract XML block when value is matched (Shell script)

Hi everyone, So i'm struggling with an xml (log file) where we get information about some devices, so the logfile is filled with multiple "blocks" like that. Based on the <devId> i want to extract this part of the xml file. If possible I want it to have an script for this, cause we'll use...

LEARN ABOUT DEBIAN

xml_pp

XML_PP(1p)						User Contributed Perl Documentation						XML_PP(1p)

NAME

       xml_pp - xml pretty-printer

SYNOPSYS

       xml_pp [options] [<files>]

DESCRIPTION

       XML pretty printer using XML::Twig

OPTIONS

       -i[<extension>]
	   edits the file(s) in place, if an extension is provided (no space between "-i" and the extension) then the original file is backed-up
	   with that extension

	   The rules for the extension are the same as Perl's (see perldoc perlrun): if the extension includes no "*" then it is appended to the
	   original file name, If the extension does contain one or more "*" characters, then each "*" is replaced with the current filename.

       -s <style>
	   the style to use for pretty printing: none, nsgmls, nice, indented, record, or record_c (see XML::Twig docs for the exact description
	   of those styles), 'indented' by default

       -p <tag(s)>
	   preserves white spaces in tags. You can use several "-p" options or quote the tags if you need more than one

       -e <encoding>
	   use XML::Twig output_encoding (based on Text::Iconv or Unicode::Map8 and Unicode::String) to set the output encoding. By default the
	   original encoding is preserved.

	   If this option is used the XML declaration is updated (and created if there was none).

	   Make sure that the encoding is supported by the parser you use if you want to be able to process the pretty_printed file (XML::Parser
	   does not support 'latin1' for example, you have to use 'iso-8859-1')

       -l  loads the documents in memory instead of outputing them as they are being parsed.

	   This prevents a bug (see BUGS) but uses more memory

       -f <file>
	   read the list of files to process from <file>, one per line

       -v  verbose (list the current file being processed)

       --  stop argument processing (to process files that start with -)

       -h  display help

EXAMPLES

	 xml_pp foo.xml > foo_pp.xml	       # pretty print foo.xml
	 xml_pp < foo.xml > foo_pp.xml	       # pretty print from standard input

	 xml_pp -v -i.bak *.xml 	       # pretty print .xml files, with backups
	 xml_pp -v -i'orig_*' *.xml	       # backups are named orig_<filename>

	 xml_pp -i -p pre foo.xhtml	       # preserve spaces in pre tags

	 xml_pp -i.bak -p 'pre code' foo.xml   # preserve spaces in pre and code tags
	 xml_pp -i.bak -p pre -p code foo.xml  # same

	 xml_pp -i -s record mydb_export.xml   # pretty print using the record style

	 xml_pp -e utf8 -i foo.xml	       # output will be in utf8
	 xml_pp -e iso-8859-1 -i foo.xml       # output will be in iso-8859-1

	 xml_pp -v -i.bak -f lof	       # pretty print in place files from lof

	 xml_pp -- -i.xml		       # pretty print the -i.xml file

	 xml_pp -l foo.xml		       # loads the entire file in memory
					       # before pretty printing it

	 xml_pp -h			       # display help

BUGS

       Elements with mixed content that start with an embedded element get an extra 


	 <elt><b>b</b>toto<b>bold</b></elt>

       will be output as

	 <elt>
	   <b>b</b>toto<b>bold</b></elt>

       Using the "-l" option solves this bug (but uses more memory)

TODO

       update XML::Twig to use Encode with perl 5.8.0

AUTHOR

       Michel Rodriguez <mirod@xmltwig.com>

perl v5.12.4							    2011-05-18								XML_PP(1p)