data extraction from xml file Post: 302448480

Sponsored Content

Top Forums Shell Programming and Scripting data extraction from xml file Post 302448480 by shashi792 on Thursday 26th of August 2010 06:38:33 AM

08-26-2010

Registered User

data extraction from xml file

I have an of xml file as shown below

Code:

<?xml version='1.0' encoding='ASCII' standalone='yes' ?>
<Station Index="10264" >
   <Number Value="237895890" />
   <Position Lat="-29.5" Lon="3.5" />
   <MaxDepth Value="-4939" />
   <VeloLines Count="24">
      <VeloLine Index="0" >
         <Depth Value="0" />
         <Temperature Count="12" >
            2183
            2200
            2253
            2135
            2028
            1859
            1831
            1751
            1740
            1762
            1869
            1996
         </Temperature>
         <Salinity Count="12" >
            3577
            3586
            3583
            3582
            3575
            3580
            3576
            3575
            3566
            3567
            3561
            3569
         </Salinity>
      </VeloLine>
      <VeloLine Index="1" >
         <Depth Value="10" />
         <Temperature Count="12" >
            2155
            2188
            2254
            2128
            2020
            1854
            1810
            1739
            1732
            1749
            1850
            1964
         </Temperature>
         <Salinity Count="12" >
            3576
            3583
            3573
            3581
            3575
            3580
            3575
            3574
            3567
            3567
            3562
            3577
         </Salinity>

The temp gives me the temp for 12 months and salinity for 12 months. the file runs till 752 lines

Question : I want to extract some specific data (depth ,Temp and salinity) from the file i.e temp of 1st month (i.e first value in temp), corresponding salinity in tht month(i.e first value in salinity)

Code:

depth     temp    salinity
    0          2183     3577
    10        xxxxxx   xxxxx
    20        yyyyyy   yyyyyy
etc

to a file

My idea : Temp :- is starting from Line 10 for depth 0 and from line 41 for depth to and so on, so 31 lines diff bw them. and similarly for Salinity and similarly for depth - using awk and for loop for data extraction. Will this work ?. or any other ideas to implement this

I have attached the xml file for reference

lat_060_lon_003.xml (14.8 KB)

shashi792

View Public Profile for shashi792

Find all posts by shashi792

10 More Discussions You Might Find Interesting

1. UNIX for Advanced & Expert Users

extraction of data from a text file which follows certain pattern

hi everybody, i have a file, in it I need to extract some data that follows a particular pattern.. For example: my file contains like now running Speak225 sep 22 mon 16:34:05 2008 -------------------------------- ...

2. Shell Programming and Scripting

Data Extraction From a File

Hi All, I have a requirement where I have to search the file with some text say "Exception". This exception word can be repeated for more then 10 times. Suppose the "Exception" word is repeated at line numbers say x=10, 50, 60, 120. Now I want to extract all the lines starting from x-5 to...

3. Shell Programming and Scripting

Help needed XML Field Extraction

I had an immediate work to sort out the error code and error message which are associated within the log. But here im facing an problem to extract 3 different fields from the XML log can some one please help. I tried using different script including awk & nawk, but not getting the desired output. ...

4. Shell Programming and Scripting

data extraction from a file

Hi Freinds, I have a file1.txt in the following format File1.txt I want to get 2 files from the above file filextra.txt should have the lines which are ending with "<" and remaining lines in the filecompare.txt file. Please help.

5. Shell Programming and Scripting

Data extraction from .txt file

Hey all, i�ve got the following problem: i�m aquiring data with an instrument and i get data in a .txt file. This is how the txt file looks like: Report of AU program poptau F1P=-49.986ppm F2P=-110.014ppm Target directory for serfile: D:/data/Spect500/nmr/Thoma/882 Linear...

6. Shell Programming and Scripting

CSV file data extraction

Hi I am writing a shell script to parse a CSV file , in which i am facing a problem to separate the columns . Could some one help me with it. IN301330/00001 pvavan kumar limited xyz@ttccpp.com IN302148/00002 PRECIOUS SECURITIES (P) LTD viash@yahoo.co.in IN300239/00000 CENTRE india...

7. Shell Programming and Scripting

Data extraction from .xml file

Hello, I'm attempting to extract 13 digit numbers beginning with 978 from a data file with the following command: awk '{ for(i=1;i<=NF;i++) if($i ~ /^978/) print $i; }' datafile > outfile This typically works. However, the new data file is an .xml file, and this command is no longer working...

8. Shell Programming and Scripting

Help with XML tag value extraction based on condition

sample xml file part <?xml version="1.0" encoding="UTF-8"?><ContractWorkspace xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" _LoadId="export_AJ6iAFmh+pQHq1" xsi:noNamespaceSchemaLocation="ContractWorkspace.xsd"> <_LocalId>CW2218471</_LocalId> <Active>true</Active> ...

9. Shell Programming and Scripting

Help with tag value extraction from xml file based on a matching condition

Hi , I have a situation where I need to search an xml file for the presence of a tag <FollowOnFrom> and also , presence of partial part of the following tag <ContractRequest _LoadId and if these 2 exist ,then extract the value from the following tag <_LocalId> which is "CW2094139". There...

10. UNIX for Beginners Questions & Answers

Data extraction and converting into .csv file.

Hi All, I have a data file and need to extract and convert it into csv format: 1) Read and extract the line containing string ending with "----" (file sample_linebyline.txt file) and to make a .csv file from this. 2) To read the flat file flatfile_sample.txt which consists of similar data (...

LEARN ABOUT CENTOS

xml_pp

XML_PP(1)						User Contributed Perl Documentation						 XML_PP(1)

NAME

       xml_pp - xml pretty-printer

SYNOPSYS

       xml_pp [options] [<files>]

DESCRIPTION

       XML pretty printer using XML::Twig

OPTIONS

       -i[<extension>]
	   edits the file(s) in place, if an extension is provided (no space between "-i" and the extension) then the original file is backed-up
	   with that extension

	   The rules for the extension are the same as Perl's (see perldoc perlrun): if the extension includes no "*" then it is appended to the
	   original file name, If the extension does contain one or more "*" characters, then each "*" is replaced with the current filename.

       -s <style>
	   the style to use for pretty printing: none, nsgmls, nice, indented, record, or record_c (see XML::Twig docs for the exact description
	   of those styles), 'indented' by default

       -p <tag(s)>
	   preserves white spaces in tags. You can use several "-p" options or quote the tags if you need more than one

       -e <encoding>
	   use XML::Twig output_encoding (based on Text::Iconv or Unicode::Map8 and Unicode::String) to set the output encoding. By default the
	   original encoding is preserved.

	   If this option is used the XML declaration is updated (and created if there was none).

	   Make sure that the encoding is supported by the parser you use if you want to be able to process the pretty_printed file (XML::Parser
	   does not support 'latin1' for example, you have to use 'iso-8859-1')

       -l  loads the documents in memory instead of outputing them as they are being parsed.

	   This prevents a bug (see BUGS) but uses more memory

       -f <file>
	   read the list of files to process from <file>, one per line

       -v  verbose (list the current file being processed)

       --  stop argument processing (to process files that start with -)

       -h  display help

EXAMPLES

	 xml_pp foo.xml > foo_pp.xml	       # pretty print foo.xml
	 xml_pp < foo.xml > foo_pp.xml	       # pretty print from standard input

	 xml_pp -v -i.bak *.xml 	       # pretty print .xml files, with backups
	 xml_pp -v -i'orig_*' *.xml	       # backups are named orig_<filename>

	 xml_pp -i -p pre foo.xhtml	       # preserve spaces in pre tags

	 xml_pp -i.bak -p 'pre code' foo.xml   # preserve spaces in pre and code tags
	 xml_pp -i.bak -p pre -p code foo.xml  # same

	 xml_pp -i -s record mydb_export.xml   # pretty print using the record style

	 xml_pp -e utf8 -i foo.xml	       # output will be in utf8
	 xml_pp -e iso-8859-1 -i foo.xml       # output will be in iso-8859-1

	 xml_pp -v -i.bak -f lof	       # pretty print in place files from lof

	 xml_pp -- -i.xml		       # pretty print the -i.xml file

	 xml_pp -l foo.xml		       # loads the entire file in memory
					       # before pretty printing it

	 xml_pp -h			       # display help

BUGS

       Elements with mixed content that start with an embedded element get an extra 


	 <elt><b>b</b>toto<b>bold</b></elt>

       will be output as

	 <elt>
	   <b>b</b>toto<b>bold</b></elt>

       Using the "-l" option solves this bug (but uses more memory)

TODO

       update XML::Twig to use Encode with perl 5.8.0

AUTHOR

       Michel Rodriguez <mirod@xmltwig.com>

perl v5.16.3							    2012-11-14								 XML_PP(1)