Extract and parse XML data (statistic value) to csv

01-06-2012

Registered User

13, 0

Join Date: Dec 2011

Last Activity: 15 August 2016, 7:40 PM EDT

Posts: 13

Thanks Given: 6

Thanked 0 Times in 0 Posts

Extract and parse XML data (statistic value) to csv

Hi All,

I need to parse some statistic data from the "measInfo" -eg. 25250000 (as highlighted) and return the result into line by line, and erasing all other unnecessary info/tag.
Thought of starting with grep "measInfoID="25250000" but this only returns 1 line. How do I get all the output below this measInfoID? and return each of the value, line by line as per my desired output? I am assuming sed is needed to erase some of the data, and perhaps awk to loop?

Any help would be appreciated. Thanks all

Long xml data

Code:

.
.
.
.
<measInfo measInfoId="15150000">
<granPeriod duration="PT3600S" endTime="2011-12-19T11:00:00+11:00"/>
<repPeriod duration="PT3600S"/>
<measTypes>15153111 15153112 15153119 15153120 15153121 15153123 15153124 15153127 15153128 15154169 15154778 15150512 15151757 15151758 15151759 15159900 </measTypes>
<measValue measObjLdn="Label=Site-O:MD0035-O-A-2, ID=59135">
<measResults>0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 </measResults>
</measValue>
<measValue measObjLdn="Label=Site-O:MA0340-O-A-2, ID=56575">
<measResults>0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 </measResults>
</measValue>
<measValue measObjLdn="Label=Site-O:MD8001-O-A-3, ID=59646">
<measResults>0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 </measResults>
</measValue> 
</measInfo>
<measInfo measInfoId="25250000">
<granPeriod duration="PT3600S" endTime="2011-12-19T11:00:00+11:00"/>
<repPeriod duration="PT3600S"/>
<measTypes>25254177 25254178 25254179 25254806 25254807 25254808 25254809 25254810 25254811 25254812 25254860 25254861 25254862 25254863 25254864 </measTypes>
<measValue measObjLdn="Label=Site-O:MD0035-O-A-2, ID=59135">
<measResults>0 0 0 27300 100194 141378 2282 0 0 379 5849362 0 0 2497 0 </measResults>
</measValue>
<measValue measObjLdn="Label=Site-O:MA0340-O-A-2, ID=56575">
<measResults>0 0 0 2099 11649 11091 28 0 0 74 249108 0 0 119 0 </measResults>
</measValue>
<measValue measObjLdn="Label=Site-O:MD8001-O-A-3, ID=59646">
<measResults>0 0 0 0 549 0 0 0 0 0 1967 0 0 0 0 </measResults>
</measValue>
<measValue measObjLdn="Label=Site-O:MA0056-O-A-2, ID=59155">
<measResults>0 0 0 0 1571 37 0 0 0 41 24453 0 0 0 0 </measResults>
</measValue>
<measValue measObjLdn="Label=Site-O:MA0056-O-A-1, ID=59154">
<measResults>0 0 0 1349 4921 878 0 0 0 48 24651 0 0 0 0 </measResults>
</measValue>
<measValue measObjLdn="Label=Site-O:MA0146-O-A-3, ID=57106">
<measResults>0 0 0 0 7018 106949 0 0 0 10 3928360 0 0 0 0 </measResults>
</measValue>
.
.
. (a lot more data).
.
<measValue measObjLdn="Label=Site-O:MA0120-O-B-3, ID=12561">
<measResults>0 0 0 8021 31504 1743 53 0 0 12 3939629 0 0 0 0 </measResults>
</measValue>
<measValue measObjLdn="Label=Site-O:MA8105-O-A-3, ID=58896">
<measResults>0 0 0 0 2807 195 0 0 0 0 50977 0 0 0 0 </measResults>
</measValue>
<measValue measObjLdn="Label=Site-O:MA0289-O-A-3, ID=57616">
<measResults>0 0 0 0 15665 10976 0 0 0 4 692551 0 0 831 0 </measResults>
</measValue>
<measValue measObjLdn="Label=Site-O:MA0146-O-A-1, ID=57104">
<measResults>0 0 0 0 1884 237 0 0 0 1 13943 0 0 0 0 </measResults>
</measValue>
</measInfo>
<measInfo measInfoId="25350000">
<granPeriod duration="PT3600S" endTime="2011-12-19T11:00:00+11:00"/>
<repPeriod duration="PT3600S"/>
<measTypes>25353111 25353112 25353119 25353120 25353121 25353123 25353124 25353127 25353128 25354169 25354778 25350512 25351757 25351758 25351759 25359900 </measTypes>
<measValue measObjLdn="Label=Site-O:MD0035-O-A-2, ID=59135">
<measResults>0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 </measResults>
</measValue>
<measValue measObjLdn="Label=Site-O:MA0340-O-A-2, ID=56575">
<measResults>0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 </measResults>
</measValue>
<measValue measObjLdn="Label=Site-O:MD8001-O-A-3, ID=59646">
<measResults>0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 </measResults>
</measValue> 
</measInfo>
.
.
.
.
.

Desired output
And the desired output should be in csv format (not sure if "," is needed...I just want the easily further processed with awk ' using variable $1...$n)

Code:

Site-O:MD0035-O-A-2, ID=59135, 0, 0, 0, 27300, 100194, 141378, 2282, 0, 0, 379, 5849362, 0, 0, 2497
Site-O:MA0340-O-A-2, ID=56575, 0, 0, 0, 2099, 11649, 11091, 28, 0, 0, 74, 249108, 0, 0, 119, 0
Site-O:MD8001-O-A-3, ID=59646, 0, 0, 0, 0, 549, 0, 0, 0, 0, 0, 1967, 0, 0, 0, 0
Site-O:MA0056-O-A-2, ID=59155, 0, 0, 0, 0, 1571, 37, 0, 0, 0, 41, 24453, 0, 0, 0, 0
Site-O:MA0056-O-A-1, ID=59154, 0, 0, 0, 1349, 4921, 878, 0, 0, 0, 48, 24651, 0, 0, 0, 0
Site-O:MA0146-O-A-3, ID=57106, 0, 0, 0, 0, 7018, 106949, 0, 0, 0, 10, 3928360, 0, 0, 0, 0 
.
.
. (a lot more data).
.
Site-O:MA0120-O-B-3, ID=12561, 0, 0, 0, 8021, 31504, 1743, 53, 0, 0, 12, 3939629, 0, 0, 0, 0
Site-O:MA8105-O-A-3, ID=58896, 0, 0, 0, 0, 2807, 195, 0, 0, 0, 0, 50977, 0, 0, 0, 0
Site-O:MA0289-O-A-3, ID=57616, 0, 0, 0, 0, 15665, 10976, 0, 0, 0, 4, 692551, 0, 0, 831, 0
Site-O:MA0146-O-A-1, ID=57104, 0, 0, 0, 0, 1884, 237, 0, 0, 0, 1, 13943, 0, 0, 0, 0

Last edited by Franklin52; 01-06-2012 at 03:27 AM.. Reason: Please use code tags for code and data samples, thank you

jackma

View Public Profile for jackma

Find all posts by jackma

01-06-2012

Registered User

686, 179

Join Date: Mar 2011

Last Activity: 17 March 2020, 9:58 PM EDT

Posts: 686

Thanks Given: 51

Thanked 179 Times in 171 Posts

Try this out:

Code:

mID=25250000
sed -n '/<measInfo measInfoId="'$mID'">/,/<\/measInfo>/  {/^<measValue / {s/.*Label=\([^"]*\).*/\1/ ;x; n;  s/^<measResults>\([0-9 ]*\).*/\1/ ;H; x; s/\n//; p  } }' stats.xml

Let me explain this mess

Code:

mID=25250000  #use a variable
sed -n '   
/<measInfo measInfoId="'$mID'">/,/<\/measInfo/{  #consider only the section inbetween measInfo tags
    /^<measValue / {  #on lines that start with measValue tag
        s/.*Label=\([^"]*\).*/\1/ ;  #get the stuff behind 'Label='
        x;    #and put it into hold buffer
        n;    #read next line
        s/^<measResults>\([0-9 ]*\).*/\1/ ;  #extract just the numbers 
        H;    #and append them to hold buffer
        x;     #retrieve hold buffer
        s/\n//;  #get rid of an extra newline
        p      #and print out
    } 
}' stats.xml

This assumes that the <measResults> data is always on the next line after <measValue>.
If you put this code into your script, make sure to keep the comments

mirni

View Public Profile for mirni

Find all posts by mirni

01-06-2012

Registered User

211, 8

Join Date: Sep 2009

Last Activity: 12 December 2018, 2:49 PM EST

Location: America

Posts: 211

Thanks Given: 0

Thanked 8 Times in 7 Posts

Have you checked xmllint and using Xpath ?

chakrapani

View Public Profile for chakrapani

Find all posts by chakrapani

01-06-2012

Registered User

13, 0

Join Date: Dec 2011

Last Activity: 15 August 2016, 7:40 PM EDT

Posts: 13

Thanks Given: 6

Thanked 0 Times in 0 Posts

Thanks mirni.
That code is very complicated lol. I can never understand sed as its syntax is too confusing. I tried it but it returned no result. Something must have gone wrong.

Code:

root@localhost:~/xmlproj> sed -n '/<measInfo measInfoId="25250000">/,/<\/measInfo>/  {/^<measValue / {s/.*Label=\([^"]*\).*/\1/ ;x; n;  s/^<measResults>\([0-9 ]*\).*/\1/ ;H; x; s/\n//; p  } }' sampleCellbasedscript.txt
root@localhost:~/xmlproj> sed -n '/<measInfo measInfoId="25250000">/,/<\/measInfo>/  {/^<measValue / {s/.*Label=\([^"]*\).*/\1/ ;x; n;  s/^<measResults>\([0-9 ]*\).*/\1/ ;H; x; s/\n//; p  } }' sampleCellbasedscript.txt
root@localhost:~/xmlproj>

Hi chapakrani,
Whats xmlint and Xpath? I tried searching online for xml to csv parser but I could not find any useful one.

Last edited by fpmurphy; 01-06-2012 at 11:41 AM.. Reason: code tags please!

jackma

View Public Profile for jackma

Find all posts by jackma

01-06-2012

Registered User

686, 179

Join Date: Mar 2011

Last Activity: 17 March 2020, 9:58 PM EDT

Posts: 686

Thanks Given: 51

Thanked 179 Times in 171 Posts

Hmm... it works for me:

Code:

$ sed -n '/<measInfo measInfoId="25250000">/,/<\/measInfo>/  {/^<measValue / {s/.*Label=\([^"]*\).*/\1/^<measResults>\([0-9 ]*\).*/\1/ ;H; x; s/\n//; p  } }' stats.xml 
Site-O:MD0035-O-A-2, ID=591350 0 0 27300 100194 141378 2282 0 0 379 5849362 0 0 2497 0 
Site-O:MA0340-O-A-2, ID=565750 0 0 2099 11649 11091 28 0 0 74 249108 0 0 119 0 
Site-O:MD8001-O-A-3, ID=596460 0 0 0 549 0 0 0 0 0 1967 0 0 0 0 
Site-O:MA0056-O-A-2, ID=591550 0 0 0 1571 37 0 0 0 41 24453 0 0 0 0 
Site-O:MA0056-O-A-1, ID=591540 0 0 1349 4921 878 0 0 0 48 24651 0 0 0 0 
Site-O:MA0146-O-A-3, ID=571060 0 0 0 7018 106949 0 0 0 10 3928360 0 0 0 0 
Site-O:MA0120-O-B-3, ID=125610 0 0 8021 31504 1743 53 0 0 12 3939629 0 0 0 0 
Site-O:MA8105-O-A-3, ID=588960 0 0 0 2807 195 0 0 0 0 50977 0 0 0 0 
Site-O:MA0289-O-A-3, ID=576160 0 0 0 15665 10976 0 0 0 4 692551 0 0 831 0 
Site-O:MA0146-O-A-1, ID=571040 0 0 0 1884 237 0 0 0 1 13943 0 0 0 0

Where stats.xml is the copied'n'pasted stuff from your first post.
Does your file sampleCellbasedscript.txt contain exactly what you posted? Are there by any chance any whitespace characters at the beggining of the line with measValue tag?

Sed is infamous for its obscurity, but that is just on the first sight. Once you understand how it works, it is no mystery.

Try this:

Code:

sed -n '/<measInfo measInfoId="25250000">/,/<\/measInfo>/ p' inputFile

It should print the section between measInfo tags.

mirni

View Public Profile for mirni

Find all posts by mirni

01-06-2012

Registered User

23,310, 4,623

Join Date: Aug 2005

Last Activity: 7 July 2020, 11:47 AM EDT

Location: Saskatchewan

Posts: 23,310

Thanks Given: 1,331

Thanked 4,623 Times in 4,217 Posts

Another way, using awk:

Code:

awk -v ID="2520000" '/<measInfo / {  P=match($0, "\""ID"\""); } P; /<\/measInfo/ { P=0 }' datafile.xml

Corona688

View Public Profile for Corona688

Visit Corona688's homepage!

Find all posts by Corona688

01-09-2012

Registered User

13, 0

Join Date: Dec 2011

Last Activity: 15 August 2016, 7:40 PM EDT

Posts: 13

Thanks Given: 6

Thanked 0 Times in 0 Posts

Quote:

Originally Posted by mirni

Hmm... it works for me:

Code:

$ sed -n '/<measInfo measInfoId="25250000">/,/<\/measInfo>/  {/^<measValue / {s/.*Label=\([^"]*\).*/\1/^<measResults>\([0-9 ]*\).*/\1/ ;H; x; s/\n//; p  } }' stats.xml 
Site-O:MD0035-O-A-2, ID=591350 0 0 27300 100194 141378 2282 0 0 379 5849362 0 0 2497 0 
Site-O:MA0340-O-A-2, ID=565750 0 0 2099 11649 11091 28 0 0 74 249108 0 0 119 0 
Site-O:MD8001-O-A-3, ID=596460 0 0 0 549 0 0 0 0 0 1967 0 0 0 0 
Site-O:MA0056-O-A-2, ID=591550 0 0 0 1571 37 0 0 0 41 24453 0 0 0 0 
Site-O:MA0056-O-A-1, ID=591540 0 0 1349 4921 878 0 0 0 48 24651 0 0 0 0 
Site-O:MA0146-O-A-3, ID=571060 0 0 0 7018 106949 0 0 0 10 3928360 0 0 0 0 
Site-O:MA0120-O-B-3, ID=125610 0 0 8021 31504 1743 53 0 0 12 3939629 0 0 0 0 
Site-O:MA8105-O-A-3, ID=588960 0 0 0 2807 195 0 0 0 0 50977 0 0 0 0 
Site-O:MA0289-O-A-3, ID=576160 0 0 0 15665 10976 0 0 0 4 692551 0 0 831 0 
Site-O:MA0146-O-A-1, ID=571040 0 0 0 1884 237 0 0 0 1 13943 0 0 0 0

Code:

sed -n '/<measInfo measInfoId="25250000">/,/<\/measInfo>/ p' inputFile

It should print the section between measInfo tags.

Thank you so much, Mirni.
Yes, there are 4 white spaces before the meaValue and 9 before meaResult. I have tried to ammend the code to following, and it now gives close to my desidered result. However, how do I add extra "," in between the last string returned by sedding <measResults> ?

root@localhost:~/xmlproj> sed -n '/<measInfo measInfoId="25250000">/,/<\/measInfo>/ {/^<measValue / {s/.*Label=$[^"]*$.*/\1/ ;x; n; s/^<measResults>$[0-9 ]*$.*/\1/ ;H; x; s/\n//; p } }' sampleCellbasedscript.txt
(initial code - no result due to white spaces)

root@localhost:~/xmlproj> sed -n '/<measInfo measInfoId="25250000">/,/<\/measInfo>/ {/<measValue / {s/.*Label=$[^"]*$.*/\1/ ;x; n; s/^<measResults>$[0-9 ]*$.*/\1/ ;H; x; s/\n//; p } }' sampleCellbasedscript.txt
Site-O:MD0035-O-A-2, ID=59135 <measResults>0 0 0 27300 100194 141378 2282 0 0 379 5849362 0 0 2497 0 </measResults>
Site-O:MA0340-O-A-2, ID=56575 <measResults>0 0 0 2099 11649 11091 28 0 0 74 249108 0 0 119 0 </measResults>
Site-O:MD8001-O-A-3, ID=59646 <measResults>0 0 0 0 549 0 0 0 0 0 1967 0 0 0 0 </measResults>
Site-O:MA0056-O-A-2, ID=59155 <measResults>0 0 0 0 1571 37 0 0 0 41 24453 0 0 0 0 </measResults>
Site-O:MA0056-O-A-1, ID=59154 <measResults>0 0 0 1349 4921 878 0 0 0 48 24651 0 0 0 0 </measResults>
Site-O:MA0146-O-A-3, ID=57106 <measResults>0 0 0 0 7018 106949 0 0 0 10 3928360 0 0 0 0 </measResults>
Site-O:MA0120-O-B-3, ID=12561 <measResults>0 0 0 8021 31504 1743 53 0 0 12 3939629 0 0 0 0 </measResults>
Site-O:MA8105-O-A-3, ID=58896 <measResults>0 0 0 0 2807 195 0 0 0 0 50977 0 0 0 0 </measResults>
Site-O:MA0289-O-A-3, ID=57616 <measResults>0 0 0 0 15665 10976 0 0 0 4 692551 0 0 831 0 </measResults>
Site-O:MA0146-O-A-1, ID=57104 <measResults>0 0 0 0 1884 237 0 0 0 1 13943 0 0 0 0 </measResults>

root@localhost:~/xmlproj> sed -n '/<measInfo measInfoId="25250000">/,/<\/measInfo>/ {/<measValue / {s/.*Label=$[^"]*$.*/\1/ ;x; n; s/.*<measResults>$[0-9 ]*$.*/\1/ ;H; x; s/\n//; p } }' sampleCellbasedscript.txt
Site-O:MD0035-O-A-2, ID=591350 0 0 27300 100194 141378 2282 0 0 379 5849362 0 0 2497 0
Site-O:MA0340-O-A-2, ID=565750 0 0 2099 11649 11091 28 0 0 74 249108 0 0 119 0
Site-O:MD8001-O-A-3, ID=596460 0 0 0 549 0 0 0 0 0 1967 0 0 0 0
Site-O:MA0056-O-A-2, ID=591550 0 0 0 1571 37 0 0 0 41 24453 0 0 0 0
Site-O:MA0056-O-A-1, ID=591540 0 0 1349 4921 878 0 0 0 48 24651 0 0 0 0
Site-O:MA0146-O-A-3, ID=571060 0 0 0 7018 106949 0 0 0 10 3928360 0 0 0 0
Site-O:MA0120-O-B-3, ID=125610 0 0 8021 31504 1743 53 0 0 12 3939629 0 0 0 0
Site-O:MA8105-O-A-3, ID=588960 0 0 0 2807 195 0 0 0 0 50977 0 0 0 0
Site-O:MA0289-O-A-3, ID=576160 0 0 0 15665 10976 0 0 0 4 692551 0 0 831 0
Site-O:MA0146-O-A-1, ID=571040 0 0 0 1884 237 0 0 0 1 13943 0 0 0 0
root@localhost:~/xmlproj>

//add an "," in between the result after ID=XXXXX
root@localhost:~/xmlproj> sed -n '/<measInfo measInfoId="25250000">/,/<\/measInfo>/ {/<measValue / {s/.*Label=$[^"]*$.*/\1,/ ;x; n; s/.*<measResults>$[0-9 ]*$.*/\1/ ;H; x; s/\n//; p } }' sampleCellbasedscript.txt
Site-O:MD0035-O-A-2, ID=59135,0 0 0 27300 100194 141378 2282 0 0 379 5849362 0 0 2497 0
Site-O:MA0340-O-A-2, ID=56575,0 0 0 2099 11649 11091 28 0 0 74 249108 0 0 119 0
Site-O:MD8001-O-A-3, ID=59646,0 0 0 0 549 0 0 0 0 0 1967 0 0 0 0
Site-O:MA0056-O-A-2, ID=59155,0 0 0 0 1571 37 0 0 0 41 24453 0 0 0 0
Site-O:MA0056-O-A-1, ID=59154,0 0 0 1349 4921 878 0 0 0 48 24651 0 0 0 0
Site-O:MA0146-O-A-3, ID=57106,0 0 0 0 7018 106949 0 0 0 10 3928360 0 0 0 0
Site-O:MA0120-O-B-3, ID=12561,0 0 0 8021 31504 1743 53 0 0 12 3939629 0 0 0 0
Site-O:MA8105-O-A-3, ID=58896,0 0 0 0 2807 195 0 0 0 0 50977 0 0 0 0
Site-O:MA0289-O-A-3, ID=57616,0 0 0 0 15665 10976 0 0 0 4 692551 0 0 831 0
Site-O:MA0146-O-A-1, ID=57104,0 0 0 0 1884 237 0 0 0 1 13943 0 0 0 0

Question:
1) How to remove the space before ID
2) How to print , after each result just like the desired output?
3) If there are decimal point in the input file, then this code fails to output the float number. I think the ([0-9 ]*\).*/\1/ only returns any number between 0 to 9, and float number will fail. How do I resolve this?

I also tried piping the result and run a 2nd sed to print , yet could not erase the existing , (hence causing double ,,)
root@localhost:~/xmlproj> sed -n '/<measInfo measInfoId="25250000">/,/<\/measInfo>/ {/<measValue / {s/.*Label=$[^"]*$.*/\1,/ ;x; n; s/.*<measResults>$[0-9 ]*$.*/\1/ ;H; x; s/\n//; p } }' sampleCellbasedscript.txt | sed -e 's/ /,/g'
Site-O:MD0035-O-A-2,,ID=59135,0,0,0,27300,100194,141378,2282,0,0,379,5849362,0,0,2497,0,
Site-O:MA0340-O-A-2,,ID=56575,0,0,0,2099,11649,11091,28,0,0,74,249108,0,0,119,0,
Site-O:MD8001-O-A-3,,ID=59646,0,0,0,0,549,0,0,0,0,0,1967,0,0,0,0,
Site-O:MA0056-O-A-2,,ID=59155,0,0,0,0,1571,37,0,0,0,41,24453,0,0,0,0,
Site-O:MA0056-O-A-1,,ID=59154,0,0,0,1349,4921,878,0,0,0,48,24651,0,0,0,0,
Site-O:MA0146-O-A-3,,ID=57106,0,0,0,0,7018,106949,0,0,0,10,3928360,0,0,0,0,
Site-O:MA0120-O-B-3,,ID=12561,0,0,0,8021,31504,1743,53,0,0,12,3939629,0,0,0,0,
Site-O:MA8105-O-A-3,,ID=58896,0,0,0,0,2807,195,0,0,0,0,50977,0,0,0,0,
Site-O:MA0289-O-A-3,,ID=57616,0,0,0,0,15665,10976,0,0,0,4,692551,0,0,831,0,
Site-O:MA0146-O-A-1,,ID=57104,0,0,0,0,1884,237,0,0,0,1,13943,0,0,0,0,

Desired Output
Site-O:MD0035-O-A-2,ID=59135,0,0,0,27300,100194,141378,2282,0,0,379,5849362,0,0,2497,0,
Site-O:MA0340-O-A-2,ID=56575,0,0,0,2099,11649,11091,28,0,0,74,249108,0,0,119,0,
Site-O:MD8001-O-A-3,ID=59646,0,0,0,0,549,0,0,0,0,0,1967,0,0,0,0,
Site-O:MA0056-O-A-2,ID=59155,0,0,0,0,1571,37,0,0,0,41,24453,0,0,0,0,
Site-O:MA0056-O-A-1,ID=59154,0,0,0,1349,4921,878,0,0,0,48,24651,0,0,0,0,
Site-O:MA0146-O-A-3,ID=57106,0,0,0,0,7018,106949,0,0,0,10,3928360,0,0,0,0,
Site-O:MA0120-O-B-3,ID=12561,0,0,0,8021,31504,1743,53,0,0,12,3939629,0,0,0,0,
Site-O:MA8105-O-A-3,ID=58896,0,0,0,0,2807,195,0,0,0,0,50977,0,0,0,0,
Site-O:MA0289-O-A-3,ID=57616,0,0,0,0,15665,10976,0,0,0,4,692551,0,0,831,0,
Site-O:MA0146-O-A-1,ID=57104,0,0,0,0,1884,237,0,0,0,1,13943,0,0,0,0,

Any help would be appreciated. thanks guys.

---------- Post updated at 04:06 PM ---------- Previous update was at 04:04 PM ----------

Quote:

Originally Posted by Corona688

Another way, using awk:

Code:

awk -v ID="2520000" '/<measInfo / {  P=match($0, "\""ID"\""); } P; /<\/measInfo/ { P=0 }' datafile.xml

Thanks corona688.
I tried the awk code but it didnt work. Hmm

jackma

View Public Profile for jackma

Find all posts by jackma

Shell Programming and Scripting

Extract and parse XML data (statistic value) to csv

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Need a script to parse data and output to csv

Discussion started by: sidnow

2. Shell Programming and Scripting

Parse xml in shell script and extract records with specific condition

Discussion started by: madankumar.t@hp

3. Shell Programming and Scripting

BASH script to parse XML and generate CSV

Discussion started by: bhaskar_m

4. Shell Programming and Scripting

Extract data from XML file and write in CSV file

Discussion started by: mohan sharma

5. Shell Programming and Scripting

How to Parse the XML data along with the URL in Shell Script?

Discussion started by: Megala

6. Shell Programming and Scripting

Extract multiple xml tag value into CSV format

Discussion started by: angshuman

7. Shell Programming and Scripting

Extract data from an XML file & write into a CSV file

Discussion started by: ss_ss

8. Shell Programming and Scripting

Extract and parse data between two strings

Discussion started by: jaygamini

9. Shell Programming and Scripting

Extract xml data

Discussion started by: nthed

10. Shell Programming and Scripting

Parse XML file into CSV with shell?

Discussion started by: Pcushing