Sponsored Content
Top Forums Shell Programming and Scripting Extract TAG name and XPATH from XML file via shellscript Post 302689213 by BithunC on Tuesday 21st of August 2012 01:31:42 AM
Old 08-21-2012
Java Extract TAG name and XPATH from XML file via shellscript

Hi,
Here is a sample xml file and expected output.
I need to extract the element/tag name (not value) and xpath (sample output.txt).
But the main problem is I put here one simple xml file where I can clearly see the number of elements, but in real time I have a xml file which have over 500 elements, so is there any option to find the elements automatically and retrive the xpath.

sample.XML:

Code:
 
<?xml version = '1.0'?>
<ROWSET>
<ROW num="1">
<EMPNO>7369</EMPNO>
<ENAME>SMITH</ENAME>
<JOB>CLERK</JOB>
<MGR>7902</MGR>
<HIREDATE>12/17/1980 0:0:0</HIREDATE>
<SAL>800</SAL>
<DEPTNO>20</DEPTNO>
</ROW>
<ROW num="2">
<EMPNO>7499</EMPNO>
<ENAME>ALLEN</ENAME>
<JOB>SALESMAN</JOB>
<MGR>7698</MGR>
<HIREDATE>2/20/1981 0:0:0</HIREDATE>
<SAL>1600</SAL>
<COMM>300</COMM>
<DEPTNO>30</DEPTNO>
</ROW>
</ROWSET>

Output.txt :
Code:
 
ROW_NUM /ROWSET/ROW/@num
EMPNO /ROWSET/ROW/EMPNO
ENAME /ROWSET/ROW/ENAME
JOB /ROWSET/ROW/JOB
MGR /ROWSET/ROW/MGR
HIREDATE /ROWSET/ROW/HIREDATE
SAL /ROWSET/ROW/SAL
COMM /ROWSET/ROW/COMM
DEPTNO /ROWSET/ROW/DEPTNO

Note: If there is a attribute like "num" and the value is changing like 1, 2 or more than that.. is there any changes in output file?


Thnx,
Bithun

Last edited by BithunC; 08-23-2012 at 07:20 AM.. Reason: code tag
 

10 More Discussions You Might Find Interesting

1. UNIX for Dummies Questions & Answers

Unable to extract a tag from a very long XML message

Hi I have a log file which contain XML message. I want to extract the value between the tag : <businessEventId>13201330</businessEventId> i.e., 13201330. I tried the following commands but as the message is very long, unable to do it. Attached is the log file. Please provide inputs. --... (3 Replies)
Discussion started by: Sapna_Sai
3 Replies

2. Shell Programming and Scripting

how to extract the info in the tag from a xml file

Hi All, Do anyone of you have any idea how to extract each<info> tag to each different file. I have 1000 raw files, which come in every 15 mins.( I am using bash) I have tried my script as below, but it took hours to finish, which is inefficiency. perl -n -e '/^<info>/ and open FH,">file".$n++;... (2 Replies)
Discussion started by: natalie23
2 Replies

3. Shell Programming and Scripting

Bash XML Parsing using Perl XPath

I have a bash script that needs to read input from an XML file, which includes varying numbers of a certain type of child node. I want to be able to iterate through all the child nodes of a given parent. I installed the Perl XML-XPath package from search.cpan.org. Once it's installed, from bash,... (4 Replies)
Discussion started by: jfmorales
4 Replies

4. Shell Programming and Scripting

extract xml tag based on condition

Hi All, I have a large xml file of invoices. The file looks like below: <INVOICES> <INVOICE> <NAME>Customer A</NAME> <INVOICE_NO>1234</INVOICE_NO> </INVOICE> <INVOICE> <NAME>Customer A</NAME> <INVOICE_NO>2345</INVOICE_NO> </INVOICE> <INVOICE> <NAME>Customer A</NAME>... (9 Replies)
Discussion started by: angshuman
9 Replies

5. Shell Programming and Scripting

XML tag replacement from different XML file

We have 2 XML file 1. ORIGINAL.xml file and 2. ATTRIBUTE.xml files, In the ORIGINAL.xml we need some modification as <resourceCode>431048</resourceCode>under <item type="Manufactured"> tag - we need to grab the 431048 value from tag and pass it to database table in unix shell script to find the... (0 Replies)
Discussion started by: balrajg
0 Replies

6. Shell Programming and Scripting

Extract multiple xml tag value into CSV format

Hi All, Need your assistance on another xml tag related issue. I have a xml file as below: <INVOICES> <INVOICE> <BILL> <BILL_NO>1234</BILL_NO> <BILL_DATE>01 JAN 2011</BILL_DATE> </BILL> <NAMEINFO> <NAME>ABC</NAME> </NAMEINFO> </INVOICE> <INVOICE> <BILL> <BILL_NO>5678</BILL_NO>... (12 Replies)
Discussion started by: angshuman
12 Replies

7. Shell Programming and Scripting

How to add the multiple lines of xml tags before a particular xml tag in a file

Hi All, I'm stuck with adding multiple lines(irrespective of line number) to a file before a particular xml tag. Please help me. <A>testing_Location</A> <value>LA</value> <zone>US</zone> <B>Region</B> <value>Russia</value> <zone>Washington</zone> <C>Country</C>... (0 Replies)
Discussion started by: mjavalkar
0 Replies

8. Shell Programming and Scripting

Extract XML tag value from file

Hello, Hope you are doing fine. I have an log file which looks like as follows: Some junk text1 Date: Thu Mar 15 13:38:46 CDT 2012 DATA SENT SUCCESSFULL: Some jun text 2 Date: Thu Mar 15 13:38:46 CDT 2012 DATA SENT SUCCESSFULL: ... (3 Replies)
Discussion started by: srattani
3 Replies

9. Shell Programming and Scripting

Moving XML tag/contents after specific XML tag within same file

Hi Forum. I have an XML file with the following requirement to move the <AdditionalAccountHolders> tag and its content right after the <accountHolderName> tag within the same file but I'm not sure how to accomplish this through a Unix script. Any feedback will be greatly appreciated. ... (19 Replies)
Discussion started by: pchang
19 Replies

10. UNIX for Beginners Questions & Answers

Grepping multiple XML tag results from XML file.

I want to write a one line script that outputs the result of multiple xml tags from a XML file. For example I have a XML file which has below XML tags in the file: <EMAIL>***</EMAIL> <CUSTOMER_ID>****</CUSTOMER_ID> <BRANDID>***</BRANDID> Now I want to grep the values of all these specified... (1 Reply)
Discussion started by: shubh752
1 Replies
XML_RDB(3pm)						User Contributed Perl Documentation					      XML_RDB(3pm)

NAME
DBIx::XML_RDB /- Perl extension for creating XML from existing DBI datasources SYNOPSIS
use DBIx::XML_RDB; my $xmlout = DBIx::XML_RDB/->new($datasource, "ODBC", $userid, $password, $dbname) || die "Failed to make new xmlout"; $xmlout/->DoSql("select * from MyTable"); print $xmlout/->GetData; DESCRIPTION
This module is a simple creator of XML data from DBI datasources. It allows you to easily extract data from a database, and manipulate later using XML::Parser. One use of this module might be (and will be soon from me) to extract data on the web server, and send the raw data (in XML format) to a client's browser, and then use either XML::Parser from PerlScript, or MSXML from VBScript/JavaScript on the client's machine to generate HTML (obviously this relies upon using MS IE for their Active Scripting Engine, and MSXML comes with IE5beta). Another use is a simple database extraction tool, which is included, called sql2xml. This tool simply dumps a table in a database to an XML file. This can be used in conjunction with xml2sql (part of the XML::DBI(?) package) to transfer databases from one platform or data- base server to another. Binary data is encoded using UTF-8. This is automatically decoded when parsing with XML::Parser. Included with the distribution is a "Scriptlet" /- this is basically a Win32 OLE wrapper around this class, allowing you to call this mod- ule from any application that supports OLE. To install it, first install the scriptlets download from microsoft at http://msdn.micro- soft.com/scripting. Then right-click on XMLDB.sct in explorer and select "Register". Create your object as an instance of "XMLDB.Script- let". FUNCTIONS
new new ( $datasource, $dbidriver, $userid, $password [, $dbname] ) See the DBI documentation for what each of these means, except for $dbname which is for support of Sybase and MSSQL server database names (using "use $dbname"). DoSql DoSql ( $sql ) Takes a simple Sql command string (either a select statement or on some DBMS's can be a stored procedure call that returns a result set /- Sybase and MSSql support this, I don't know about others). This doesn't do any checking if the sql is valid, if it fails, the procedure will "die", so if you care about that, wrap it in an eval{} block. The result set will be appended to the output. Subsequent calls to DoSql don't overwrite the output, rather they append to it. This allows you to call DoSql multiple times before getting the output (via GetData()). GetData Simply returns the XML generated from this SQL call. Unfortunately it doesn't stream out as yet. I may add this in sometime in the future (this will probably mean an IO handle being passed to new()). The format of the XML output is something like this: <?xml version="1.0"?> <DBI driver="dbi:Sybase:database=foo"> <RESULTSET statement="select * from Table"> <ROW> <Col1Name>Data</Col1Name> <Col2Name>Data</Col2Name> ... </ROW> <ROW> ... </ROW> </RESULTSET> <RESULTSET statement="select * from OtherTable"> ... </RESULTSET> </DBI> This is quite easy to parse using XML::Parser. AUTHOR
Matt Sergeant, matt@sergeant.org SEE ALSO
XML::Parser perl v5.8.8 2005-01-18 XML_RDB(3pm)
All times are GMT -4. The time now is 09:18 PM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy