Sponsored Content
Top Forums Shell Programming and Scripting Help with shell script to extract data from XML file Post 302232088 by fpmurphy on Wednesday 3rd of September 2008 09:50:34 PM
Old 09-03-2008
Rather than using grep, sed or awk to transform the XML into the required output, it is better to use an XSLT processor.

If you have (Gnome) libxsl/libxslt installed, it comes with xsltproc a command line interface to a XSLT v1.0 compliant processor.

Here is a stylesheet which performs the required transformation using xsltproc:
Code:
$ cat file.xsl
<?xml version="1.0"?>
<xsl:stylesheet version="1.0" xmlns:xsl="http://www.w3.org/1999/XSL/Transform">

<xsl:output method="text" omit-xml-declaration="yes" />

<xsl:template match="id">
    <xsl:value-of select="@v"/><xsl:text>&#xa;</xsl:text>
</xsl:template>

<xsl:template match="/">
  <xsl:apply-templates select="/wfregistry/completedworkflows/id" />
</xsl:template>

</xsl:stylesheet>

$ xsltproc file.xsl file.xml
3381
3399
3415
3431
3447
3463
3479
3495
3511
3527
3543
3559
3575
3591
3607
$

 

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

extract data from xml- shell script using awk

Hi, This is the xml file that i have. - <front-servlet platform="WAS4.0" request-retriever="SiteMinder-aware" configuration-rescan-interval="60000"> <concurrency-throttle maximum-concurrency="50" redirect-page="/jsp/defaulterror.jsp" /> - <loggers> <instrumentation... (5 Replies)
Discussion started by: nishana
5 Replies

2. Shell Programming and Scripting

Convert XML to Data File in Shell Script

Hi All, I will be getting a huge XML file with a lot of records in it. I need to convert it into multiple data files. SAMPLE XML FILE <ABSProductCatalog xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"> - <ProductSalesHierachy> - <Portfolios> - <Portfolio productCode="P1"> ... (8 Replies)
Discussion started by: ragha81
8 Replies

3. Shell Programming and Scripting

need a shell script to extract data from a log file.

If I have a log like : Mon Jul 19 05:07:34 2010; TCP; eth3; 52 bytes; from abc to def Mon Jul 19 05:07:35 2010; UDP; eth3; 46 bytes; from aaa to bbb Mon Jul 19 05:07:35 2010; TCP; eth3; 52 bytes; from def to ghi I will need an output like this : Time abc to def... (1 Reply)
Discussion started by: hitha87
1 Replies

4. Shell Programming and Scripting

Shell script to extract data in repeating tags from xml

Hi, I am new to shell scripting. I need to extract data between repeating tags from an xml file and store the data in an array to process it further. <ns1:root xmlns:ns1="http://example.com/config"> <ns1:interface>in1</ns1:interface> <ns1:operation attribute1="true" attribute2="abd"... (2 Replies)
Discussion started by: sailendra
2 Replies

5. Shell Programming and Scripting

How to extract data from xml file using shell scripting?

Hi evry1, This is my 1st post in this forum.Pls help me I want to extract some data froma xml file which has 2000 lines using shell scripting. Actually my xml file has some "audio and video codes" which i need to arrange in a column wise format after extracting it using shell scripting.I... (4 Replies)
Discussion started by: arun_kohan
4 Replies

6. Shell Programming and Scripting

How to extract data from XML file using shell scripting?

Hi , I have input file as XML. following are input data #complex.xml Code: <?xml version="1.0" encoding="UTF-8"?><TEST_doc xmlns="http://www.w3.org/2001/XMLSchema-instance"> <ENTRY uid="123456"> <protein> <name>PROT001</name> <organism>Human</organism> ... (1 Reply)
Discussion started by: arun_kohan
1 Replies

7. UNIX for Dummies Questions & Answers

Shell script to extract data from csv file

Hi Guys, I am new to shell script.I need your help to write a shell script. I need to write a shell script to extract data from a .csv file where columns are ',' separated. The file has 7 columns having values say column 1,column 2.....column 7 as below along with their values. Name, Address,... (7 Replies)
Discussion started by: Vivekit82
7 Replies

8. Shell Programming and Scripting

Shell script to extract data from csv file

Hi everyone, I have a csv file which has data with different heading and column names as below. Static Data Ingested ,,,,,,,,,,,,Known Explained Rejections Column_1,column_2,Column_3,Column_4,,Column_6,Column_7,,% Column_8,,Column_9 ,Column_10 ,... (14 Replies)
Discussion started by: Vivekit82
14 Replies

9. UNIX for Beginners Questions & Answers

Shell script to extract data in a file

I have this 5GB file, and i want to extract from the file particulars pattern. this is my script: // count=`grep -wc "MSISDN" file_name` k=1 >OUTPUT >OUTPUT_Final while do cat file_name | awk -F":" -v var="$k" '$1=="MSISDN" {m++}m==var{print; exit}' >> OUTPUT cat file_name |awk -F":"... (33 Replies)
Discussion started by: gillesi
33 Replies

10. Shell Programming and Scripting

Need get data from XML file through shell script..

hi all, here is the sample log file and these errors are repeated in log file.. i need all the repeated time stamp ,severity and message tags needs to print in output file.. through shell script <log-message> <timestamp>2019-03-13T04:52:49.648-05:00</timestamp> <severity>ERROR</severity>... (17 Replies)
Discussion started by: ravi
17 Replies
xsltproc(1)						      General Commands Manual						       xsltproc(1)

NAME
xsltproc - command line xslt processor SYNOPSIS
xsltproc [-V | -v | -o file | --timing | --repeat | --debug | --novalid | --noout | --maxdepth val | --html | --docbook | --param name value | --stringparam name value | --nonet | --warnnet | --catalogs | --xinclude | --profile] [stylesheet] [ file1 ] [ file2 ] [ .... ] INTRODUCTION
xsltproc is a command line tool for applying XSLT stylesheets to XML documents. It is part of libxslt, the XSLT C library for GNOME. While it was developed as part of the GNOME project, it can operate independently of the GNOME desktop. xsltproc is invoked from the command line with the name of the stylesheet to be used followed by the name of the file or files to which the stylesheet is to be applied. If a stylesheet is included in an XML document with a Stylesheet Processing Instruction, no stylesheet need be named at the command line. xsltproc will automatically detect the included stylesheet and use it. By default, output is to stdout. You can specify a file for output using the -o option. COMMAND LINE OPTIONS
-V or --version Show the version of libxml and libxslt used. -v or --verbose Output each step taken by xsltproc in processing the stylesheet and the document. -o or --output file Direct output to the file named file. For multiple outputs, also known as "chunking", -o directory/ directs the output files to a specified directory. The directory must already exist. --timing Display the time used for parsing the stylesheet, parsing the document and applying the stylesheet and saving the result. Displayed in milliseconds. --repeat Run the transformation 20 times. Used for timing tests. --debug Output an XML tree of the transformed document for debugging purposes. --novalid Skip loading the document's DTD. --noout Do not output the result. --maxdepth value Adjust the maximum depth of the template stack before libxslt concludes it is in an infinite loop. The default is 500. --html The input document is an HTML file. --docbook The input document is DocBook SGML. --param name value Pass a parameter of name name and value value to the stylesheet. You may pass multiple name/value pairs up to a maximum of 32. If the value being passed is a string rather than a node identifier, use --stringparam instead. --stringparam name value Pass a paramenter of name name and value value where value is a string rather than a node identifier. --nonet Do not use the Internet to fetch DTD's or entities. --warnnet Output notification when DTD's or entities are fetched over the Internet. --catalogs Use catalogs to resolve the location of external entities. This speeds DTD resolution. By having a catalog file point to a local version of the DTD, xsltproc does not have to use the Internet to fetch the DTD. xsltproc uses the catalog identified by the envi- ronmental variable SGML_CATALOG_FILES. --xinclude Process the input document using the Xinclude specification. More details on this can be found in the Xinclude specification: http://www.w3.org/TR/xinclude/. --profile or --norman Output profiling information detailing the amount of time spent in each part of the stylesheet. This is useful in optimizing stylesheet performance. RETURN VALUES
xsltproc's return codes provide information that can be used when calling it from scripts. 0: normal 1: no argument 2: too many parameters 3: unknown option 4: failed to parse the stylesheet 5: error in the stylesheet 6: error in one of the documents 7: unsupported xsl:output method 8: string parameter contains both quote and double-quotes MORE INFORMATION
libxml web page: http://www.xmlsoft.org/ W3C XSLT page: http://www.w3.org/TR/xslt AUTHOR
Copyright 2001 by John Fleck <jfleck@inkstain.net>. This is release 0.2 of the xsltproc Manual. NOTES
Source for libxslt is available in the SUNWlxslS package. Documentation for libxslt is available on-line at http://xmlsoft.org/XSLT/ 2002 Jun 27 xsltproc(1)
All times are GMT -4. The time now is 05:23 AM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy