Extract xml data and create word document using perl.


 
Thread Tools Search this Thread
Top Forums Programming Extract xml data and create word document using perl.
# 1  
Old 10-13-2011
Extract xml data and create word document using perl.

Hi,
I have large xml data file.I need to extract node and some tags in the node and after I need to create word document. my XMl data is look like as below
Code:
-<student>
         <number>24</number>
          <education>bachelor</bachelor>
          <specialization>computers</specialization>
          -<Address>
              <house_number="128"/>
              <street number="38"/>
               <proddutoor/>
             <Address/>    
           -<Details>
                <name="xxxx"/>
                 <age="24"/>
                  <sex= "m"/>
                <Details/>
   </student>...........

i have xml data for more number of students........ like that but now i need to extract only number and address of students.my output look like this below
Code:
number: 24
adress: <Address>
              <house_number="128"/>
              <street number="38"/>
               <proddutoor/>
             <Address/>
number: 25
adress: <Address>
              <house_number="128"/>
              <street number="87"/>
               <pulivendula/>
             <Address/>..........

I need to extract xml data like above format and create word document for that information.

Last edited by vbe; 10-14-2011 at 04:56 AM.. Reason: slash missing in tag....
# 2  
Old 10-17-2011
I love perl, but parsing XML documents with it is not my idea of fun. The XML::Lite module should be able to handle what you need. However, spitting out a Word document is another story. Can you output PDF or RTF instead?
# 3  
Old 10-18-2011
Hi,
I extracted some information from my xml data, I stored in a hash. i am printing my hash my output look like this
Code:
$var1={
        'stud.xml'={ 
                   '24'=>'<address>
                           <streetname="xxxx"/>
                            <housenum="138"/">
                           </address>'
                     '20'=>'<address>
                           <streetname="xxxx"/>
                            <housenum="110"/">
                           </address>'
                         }
         'pav.xml'={ 
                   '26'=>'<address>
                           <streetname="xxxx"/>
                            <housenum="138"/">
                           </address>'
                     '27'=>'<address>
                           <streetname="xxxx"/>
                            <housenum="110"/">
                           </address>'
                         }
                         .
                         .
                         .
                 }

because I have xml files in different folders so I searched and extracted some information from all xml files and stored in hash. now I need to convert this data into pdf. I decided to convert into pdf by using latex. so first I need to convert from xml to latex template by using perl and I will run it on latex compiler.how to convert my resultant xml to template.
# 4  
Old 10-18-2011
Wow! You've made some progress. There's a module for perl to output pdf: using "PDF::API2".

This document describes how to generate LaTeX templates that can be then used to generated PDFs.
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Extract strings from XML files and create a new XML

Hello everybody, I have a double mission with some XML files, which is pretty challenging for my actual beginner UNIX knowledge. I need to extract some strings from multiple XML files and create a new XML file with the searched strings.. The original XML files contain the source code for... (12 Replies)
Discussion started by: milano.churchil
12 Replies

2. Shell Programming and Scripting

Extract Data from XML file.

Hi Guys, I am in a need to extract data from a xml file. The XML file format is as below. <data jsxnamespace="propsbundle" locales=""> <locale> <!--Error messages starts--> <record jsxid="CHARPAIR001" jsxtext=" must be selected"></record> <record... (1 Reply)
Discussion started by: Showdown
1 Replies

3. Shell Programming and Scripting

Extract data from XML file

Hi , I have input file as XML. following are input data #complex.xml <?xml version="1.0" encoding="UTF-8"?> <TEST_doc xmlns="http://www.w3.org/2001/XMLSchema-instance"> <ENTRY uid="123456"> <protein> <name>PROT001</name> <organism>Human</organism> ... (1 Reply)
Discussion started by: mohan sharma
1 Replies

4. Shell Programming and Scripting

Extract part of word from XML

Hi All, Can Someone help me in capturing a word from xml Using sed or awk or any other way in unix. i have file abc.xml like this <?xml version="1.0" encoding="ISO-8859-1" standalone="no" ?> - <NREC xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"> - <HEADER> <SOURCE>MAIL</SOURCE>... (5 Replies)
Discussion started by: naveenkumarc
5 Replies

5. Programming

extract xml data and create word document using perl.

hi, i have large xml file which contains students information, i need to extract student number and some address tags and create a word document for the extracted data. my data looking llike this <student> <number>24</number> <education>bachelors</education> ... (1 Reply)
Discussion started by: veerubiji
1 Replies

6. Shell Programming and Scripting

extract whole thing in word, leaving behind last word. - perl

Hi, i've a string /u/user/DTE/T_LOGS/20110622_011532_TEST_11_HD_120/HD/TESi T_11_HD_120/hd-12 i need to get string, like /u/user/DTE/T_LOGS/20110622_011532_TEST_11_HD_120/HD the words from HD should get deleted, i need only a string till HD, i dont want to use any built in... (4 Replies)
Discussion started by: asak
4 Replies

7. Shell Programming and Scripting

Data Extract from XML Log File

Please help me out to extract the Data from the XML Log files. So here is the data ERROR|2010-08-26 00:05:52,958|SERIAL_ID=128279996|ST=2010-08-2600:05:52|DEVICE=113.2.21.12:601|TYPE=TransactionLog... (9 Replies)
Discussion started by: raghunsi
9 Replies

8. Shell Programming and Scripting

XML data extract

Hi all, I have the following xml document : <HEADER><El1>asdf</El1> <El2>3</El2> <El3>asad</El3> <El4>asasdf</El4> <El5>asdf</El5> <El6>asdf</El6> <El7>asdf</El7> <El8>A</El8> <El9>0</El9> <El10>75291028141917</El10> <El11>asdf</El11> <El12>sdf</El12> <El13>er</El13> <El14><El15>asdf... (1 Reply)
Discussion started by: nthed
1 Replies

9. Shell Programming and Scripting

Extract xml data

Hi all, I have the following xml file : <xmlhead><xmlelement1>element1value</xmlelement1>\0a<xmlelement2>jjasd</xmlelement2>...</xmlhead> As you can see there are no lines or spaces seperating the elements, just the character \0a. How can i find and print the values of a specific element?... (1 Reply)
Discussion started by: nthed
1 Replies

10. Shell Programming and Scripting

Perl script for extract data from xml files

Hi All, Prepare a perl script for extracting data from xml file. The xml data look like as AC StartTime="1227858839" ID="88" ETime="1227858837" DSTFlag="false" Type="2" Duration="303" /> <AS StartTime="1227858849" SigPairs="119 40 98 15 100 32 128 18 131 23 70 39 123 20 120 27 100 17 136 12... (3 Replies)
Discussion started by: allways4u21
3 Replies
Login or Register to Ask a Question