Sponsored Content
Top Forums Programming extract xml data and create word document using perl. Post 302564450 by veerubiji on Thursday 13th of October 2011 05:04:00 PM
Old 10-13-2011
extract xml data and create word document using perl.

hi,
i have large xml file which contains students information, i need to extract student number and some address tags and create a word document for the extracted data. my data looking llike this
Code:
<student>
     <number>24</number>
      <education>bachelors</education>
      <specialization>computers </specialization>
    -<address>
      <house_number="128"/>
         <street number="38"/>
          <proddutoor/>
        <address/>
    -<details>
        <name="clar"/>
         <age="20"/>
          <sex="m"/>
        </details>
  </student>............

i need to extract student number and address only , my output look like this
Code:
student number: 24
address:  <address>
      <house_number="128"/>
         <street number="38"/>
          <proddutoor/>
        <address/>
student number: 25
address:  <address>
      <house_number="187"/>
         <street number="10"/>
          <proddutoor/>
        <address/>............

after extracting the data i need to create a word document for this extracted data.

Last edited by veerubiji; 10-13-2011 at 06:15 PM..
 

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Perl script for extract data from xml files

Hi All, Prepare a perl script for extracting data from xml file. The xml data look like as AC StartTime="1227858839" ID="88" ETime="1227858837" DSTFlag="false" Type="2" Duration="303" /> <AS StartTime="1227858849" SigPairs="119 40 98 15 100 32 128 18 131 23 70 39 123 20 120 27 100 17 136 12... (3 Replies)
Discussion started by: allways4u21
3 Replies

2. Shell Programming and Scripting

Extract xml data

Hi all, I have the following xml file : <xmlhead><xmlelement1>element1value</xmlelement1>\0a<xmlelement2>jjasd</xmlelement2>...</xmlhead> As you can see there are no lines or spaces seperating the elements, just the character \0a. How can i find and print the values of a specific element?... (1 Reply)
Discussion started by: nthed
1 Replies

3. Shell Programming and Scripting

XML data extract

Hi all, I have the following xml document : <HEADER><El1>asdf</El1> <El2>3</El2> <El3>asad</El3> <El4>asasdf</El4> <El5>asdf</El5> <El6>asdf</El6> <El7>asdf</El7> <El8>A</El8> <El9>0</El9> <El10>75291028141917</El10> <El11>asdf</El11> <El12>sdf</El12> <El13>er</El13> <El14><El15>asdf... (1 Reply)
Discussion started by: nthed
1 Replies

4. Shell Programming and Scripting

Data Extract from XML Log File

Please help me out to extract the Data from the XML Log files. So here is the data ERROR|2010-08-26 00:05:52,958|SERIAL_ID=128279996|ST=2010-08-2600:05:52|DEVICE=113.2.21.12:601|TYPE=TransactionLog... (9 Replies)
Discussion started by: raghunsi
9 Replies

5. Shell Programming and Scripting

extract whole thing in word, leaving behind last word. - perl

Hi, i've a string /u/user/DTE/T_LOGS/20110622_011532_TEST_11_HD_120/HD/TESi T_11_HD_120/hd-12 i need to get string, like /u/user/DTE/T_LOGS/20110622_011532_TEST_11_HD_120/HD the words from HD should get deleted, i need only a string till HD, i dont want to use any built in... (4 Replies)
Discussion started by: asak
4 Replies

6. Programming

Extract xml data and create word document using perl.

Hi, I have large xml data file.I need to extract node and some tags in the node and after I need to create word document. my XMl data is look like as below -<student> <number>24</number> <education>bachelor</bachelor> <specialization>computers</specialization> ... (3 Replies)
Discussion started by: veerubiji
3 Replies

7. Shell Programming and Scripting

Extract part of word from XML

Hi All, Can Someone help me in capturing a word from xml Using sed or awk or any other way in unix. i have file abc.xml like this <?xml version="1.0" encoding="ISO-8859-1" standalone="no" ?> - <NREC xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"> - <HEADER> <SOURCE>MAIL</SOURCE>... (5 Replies)
Discussion started by: naveenkumarc
5 Replies

8. Shell Programming and Scripting

Extract data from XML file

Hi , I have input file as XML. following are input data #complex.xml <?xml version="1.0" encoding="UTF-8"?> <TEST_doc xmlns="http://www.w3.org/2001/XMLSchema-instance"> <ENTRY uid="123456"> <protein> <name>PROT001</name> <organism>Human</organism> ... (1 Reply)
Discussion started by: mohan sharma
1 Replies

9. Shell Programming and Scripting

Extract Data from XML file.

Hi Guys, I am in a need to extract data from a xml file. The XML file format is as below. <data jsxnamespace="propsbundle" locales=""> <locale> <!--Error messages starts--> <record jsxid="CHARPAIR001" jsxtext=" must be selected"></record> <record... (1 Reply)
Discussion started by: Showdown
1 Replies

10. Shell Programming and Scripting

Extract strings from XML files and create a new XML

Hello everybody, I have a double mission with some XML files, which is pretty challenging for my actual beginner UNIX knowledge. I need to extract some strings from multiple XML files and create a new XML file with the searched strings.. The original XML files contain the source code for... (12 Replies)
Discussion started by: milano.churchil
12 Replies
AMC-ANNOTE(1)						       Auto Multiple Choice						     AMC-ANNOTE(1)

NAME
AMC-annote - completed answer sheets annotation after marking for AMC multiple choice exams. SYNOPSIS
auto-multiple-choice annote --projet project-dir --data project-data-dir --fich-assoc assoc.xml [annotation options...] DESCRIPTION
The AMC-annote.pl command annotates papers scans with scores for all questions, global score and mark, resulting in JPEG files (one for each page). --projet project-dir sets the project directory. --data project-data-dir gives the directory where data files are (see for example AMC-prepare(1)). --cr project-cr-dir sets the project reports directory, where to find data capture reports prepared by AMC-analyse(1), and where to output annotated scans. Default value is project-dir/cr. --fich-noms students-list.csv sets the students list file name. --noms-encodage encoding selects an encoding for file students-list.csv (default is utf-8). --qualite qual sets JPEG quality for output files. Default value is 65. --taille-max maxsize sets the maximal size for output files (in the format widthxheight, in pixels). Default value is 1000x1500. --pointsize-nl lines sets the size of texts requiring that with this size, the number of text lines that fits on the page height must be lines. Default value is 60. --verdict header sets the text header to be printed on each first page of the students copies. This text may contain CR characters for multi-line headers. Some sequences are substituted in it: %S is replaced by the student total score. %M is replaced by the maximum total score. %s is replaced by the student mark. %m is replaced by the maximum mark. %(ID) is replaced by the student's name. %(COL) is replaced by the value of column COL in the students list for the curent student. --ch-sign nc Sets the number nc of significant digits to use when printing global mark. --symbols symbols-definition defines how boxes will be annotated. symbols-definition is a comma separated list of H-A:shape:colour constructs, where H is 0 or 1 depending on wether the box has to be checked or not, A is 0 or 1 depending on wether the box is actually checked or not, shape is one of none, circle, mark, box and coulour is a colour (name or #RGB, see ImageMagick or GraphicsMagick documentation for details). Default value is "0-0:none,0-1:circle:red,1-0:mark:red,1-1:mark:blue". --position pos sets the position of individual questions scores. pos may be 'none' (do dot write individual scores), 'marge' (write scores in the margin) or 'case' (write scores at the left of the left-most box for the question. --ecart dx sets the horizontal space to let between individual question score and the leftmost box when --position case is used. --ecart-marge x sets the x position for individual questions scores when written in the margin (in em units). --line-width lw sets the line width for symbols drwan on the scans (in pixels). Default value is 2. --indicatives bool say if indicative questions are to be annotated (use 1 for a true value; default is false). --debug file.log gives a file to fill with debugging information. AUTHORS
Alexis Bienvenue <paamc@passoire.fr> Main author Jean Berard Translation from French Georges Khaznadar Translation from French COPYRIGHT
Copyright (C) 2008-2012 Alexis Bienvenue This document can be used according to the terms of the GNU General Public License, version 2 or later. Auto Multiple Choice 1.1.1 06/19/2012 AMC-ANNOTE(1)
All times are GMT -4. The time now is 02:23 PM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy