Extract data from a file


 
Thread Tools Search this Thread
Top Forums Shell Programming and Scripting Extract data from a file
# 1  
Old 05-15-2013
Extract data from a file

Hello All,

I have a small xml file which looks like below:
Code:
<Check:defaultval Val="crash" value="crash_report_0013&#xA;generate_check_0020 generate_check_0022&#xA;&#xA;This is where the fault is."/>
<Check:defaultval Val="crash" value="crash_report_1001&#xA;generate_check_1001 generate_check_102 generate_check_0050&#xA;&#xA;This is where the fault is."/>
<Check:defaultval Val="crash" value="crash_report_4211&#xA;generate_check_1190&#xA;&#xA;This is where the fault is."/>
<Check:defaultval Val="crash" value="crash_report_0123&#xA;generate_check_0091&#xA;&#xA;This is where the fault is."/>

I want to extract the below data in the below format from the xml file.
Code:
crash_report_0013  generate_check_0020 generate_check_0022
crash_report_1001  generate_check_1001 generate_check_102 generate_check_0050
crash_report_4211  generate_check_1190
crash_report_0123  generate_check_0091

Any solution using any script is welcome.

Thanks,
Suvendu
# 2  
Old 05-15-2013
Code:
awk -F'=' '{gsub(/"|\&#xA\;\&#xA\;.*|\&#xA\;/,OFS,$3);sub(/^ /,X,$3);$0=$3}1' file

This User Gave Thanks to Yoda For This Post:
# 3  
Old 05-15-2013
Code:
perl -ne '/(crash_report_\d+)/ && print "$1 "; while(/(generate_check_\d+)/g){print "$1 "}; print "\n"' file

This User Gave Thanks to balajesuri For This Post:
# 4  
Old 05-15-2013
Code:
awk -F"\"|&#xA\;" '{print $4,$5}' xmlfile

Code:
crash_report_0013 generate_check_0020 generate_check_0022
crash_report_1001 generate_check_1001 generate_check_102 generate_check_0050
crash_report_4211 generate_check_1190
crash_report_0123 generate_check_0091


Last edited by Jotne; 05-15-2013 at 10:43 AM..
This User Gave Thanks to Jotne For This Post:
# 5  
Old 05-15-2013
Hello Yoda,

Your solution works.
Its great help.

But if my xml file is big,which contains different information with #xA attribute,then the output may be give lot of information apart from what i need.I think putting a filter using crash_report and generate_check will be a good idea.

Any suggestion how to do it using awk will be of good help

---------- Post updated at 08:55 AM ---------- Previous update was at 08:53 AM ----------

Hello Balajesuri,

It works as always expected.

If i have a huge file where i have to search only these attributes then it will give me a file where i will have lot of new line(\n).

What is the best way to minimize the new line in order to search the items in a big file.

Thanks,
Suvendu
# 6  
Old 05-15-2013
Quote:
Originally Posted by suvendu4urs
I think putting a filter using crash_report and generate_check will be a good idea
Yes, you can search for records containing pattern: crash_report & generate_check and work only on them:
Code:
awk -F'=' '/crash_report/ && /generate_check/ {gsub(/"|&#xA\;&#xA\;.*|&#xA\;/,OFS,$3);sub(/^ /,X,$3);print $3}' file

This User Gave Thanks to Yoda For This Post:
# 7  
Old 05-15-2013
Yes Yoda,

Your solution works.

Its an useful information.
Thanks a lot.
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Extract data from a file

Hi , I am having a file which is PIPE delimited like this : file.txt aus|start|10:00:00 nz|start|11:00:00 aus|end|10:10:00 us|start|10:00:00 nz|end|11:10:00 us|end|11:00:00 . . . I want to extract an output file like this based on start time and end time for each countries: (9 Replies)
Discussion started by: rohit_shinez
9 Replies

2. Shell Programming and Scripting

Extract data from a file

I have a text file that contains the following data. For example, aa.txt has some numbers. I need to extract the continuous numbers(minimum 3 numbers) from it.How can I do this with awk? >aa.txt 31 35 36 37 38 39 44 169 170 173 174 175 177 206 >1a.txt 39 (5 Replies)
Discussion started by: rahmanabdulla
5 Replies

3. Shell Programming and Scripting

Extract header data from one file and combine it with data from another file

Hi, Great minds, I have some files, in fact header files, of CTD profiler, I tried a lot C programming, could not get output as I was expected, because my programming skills are very poor, finally, joined unix forum with the hope that, I may get what I want, from you people, Here I have attached... (17 Replies)
Discussion started by: nex_asp
17 Replies

4. Shell Programming and Scripting

Help with File Data Extract

Hello, Hope you are doing fine. I have been struggling with it for some time now and I would really appreciate your help. Following is file format: Currency,Name,Date, Term USD, ABC, 2011/11/11, T0, S1, S2, S3, S4 , , ,T1, 5.6, 2.3, 6.5, 4.5 , ... (5 Replies)
Discussion started by: srattani
5 Replies

5. Shell Programming and Scripting

Extract data based on match against one column data from a long list data

My input file: data_5 Ali 422 2.00E-45 102/253 140/253 24 data_3 Abu 202 60.00E-45 12/23 140/23 28 data_1 Ahmad 256 7.00E-45 120/235 140/235 22 data_4 Aman 365 8.00E-45 15/65 140/65 20 data_10 Jones 869 9.00E-45 65/253 140/253 18... (12 Replies)
Discussion started by: patrick87
12 Replies

6. Shell Programming and Scripting

Extract Data from a file

I need to create a script to extract some specific data from a file. I locate the file using the find command: find . -name "rpbol*" -print | xargs grep -li Once I locate the file I need using the above command, I would like to extract some data from that file. The data is always located... (2 Replies)
Discussion started by: jevaba
2 Replies

7. Shell Programming and Scripting

extract data from file

I m new to shell scripting & i need a help.... i have file like.... Name := sachin address:=something phone:=111 ... Note: There might be or not space between Name & := and between := & sachin. I need to extract the data from each line of file as var1=Name value1=sachin same for... (13 Replies)
Discussion started by: ps_sach
13 Replies

8. Shell Programming and Scripting

extract data from file

Hello again, how do you extract data from a file? I have created a file with PID #s in it, I need to be able to take the PID from each line and kill it. How is this done? (4 Replies)
Discussion started by: raidzero
4 Replies

9. Shell Programming and Scripting

Extract data from file

Dear All , I am posting first time in this forum . Please ignore my mistakes . I am learning Unix and i need help to extract specific data from file . 1. I want to grep number of fails from log . The file contains "fails" word in line if test cases are failed . 2. The log contains... (20 Replies)
Discussion started by: getdpg
20 Replies

10. Shell Programming and Scripting

extract data from file

My file in ksh consists of message data of varying lengths (lines), separated with headers. I would like to find a string from this file, and print out the whole message data including the headers. my plan of attack is to search the strings, print the top header, and print the whole message... (2 Replies)
Discussion started by: apalex
2 Replies
Login or Register to Ask a Question