Splitting XML file on basis of line number into multiple file


 
Thread Tools Search this Thread
Top Forums Shell Programming and Scripting Splitting XML file on basis of line number into multiple file
# 1  
Old 05-07-2014
Splitting XML file on basis of line number into multiple file

Hi All,

I have more than half million lines of XML file , wanted to split in four files in a such a way that top 7 lines should be present in each file on top and bottom line of should be present in each file at bottom.

from the 8th line actual record starts and each record contains 15 lines means from 8th to 22nd line is the first record of the file.
so the total number of actual records are varying each time.

wanted to divide this actual record in four chunks and each chunks should move to new four files respectively below to the top 7 and above the bottom line.

say...
Code:
cat ABCD.xml |wc -l
 690728
Actual Record lines = 690728 -[top 7] -[bottom1]= 690720
690720/15 = Actual Record = 46048

then
46048/4 = 11512 or [if it is not exactly divisible then the remainder record should move to the last ]


so first 11512 record move to ABCD_part1.xml
second 11512 will move to ABCD_part2.xml
third 11512 will move move to ABCD_part3.xml
Fourth/remaining records 11512 will move to to ABCD_part4.xml

Please help on this .

Last edited by Don Cragun; 05-17-2014 at 05:08 PM.. Reason: Add CODE tags again.
# 2  
Old 05-07-2014
You already gave math.
You can use
Code:
sed -n 'x,yp' file

where x is the starting line number and y is the ending.
# 3  
Old 05-08-2014
CLX,
thnx, wanted to make it generalized the scenario the math i have given you that is the example records may vary for each new file.
# 4  
Old 05-15-2014
any hint in shell scripting to process the below request ?
# 5  
Old 05-15-2014
For any number of records, below code will create 4 parts with 7 lines of header and a trailer in each file
But, make sure to provide the file name twice as in the code
Code:
awk '
BEGIN{n = 1;
  prt = 4}
NR == FNR {
  if(FNR <= 7)
    {hd = (hd == "") ? $0 : (hd "\n" $0)}
  else
    {tr = $0;
    n = FNR}
  next}
FNR == 1 {fc = (n - 8) / prt;
  c = 0;
  next}
FNR > 7 && FNR <= (fc * c + 7) && FNR < n {
  print $0 > "ABCD_part" c ".xml";
  next}
FNR > (fc * c + 7) && FNR < n {
  if(FNR != 8)
    {print tr > "ABCD_part" c ".xml"};
    c++;
    print hd > "ABCD_part" c ".xml"
 print $0 > "ABCD_part" c ".xml"}
END {print tr > "ABCD_part" c ".xml"}' ABCD.xml ABCD.xml


Last edited by SriniShoo; 05-15-2014 at 05:50 AM..
# 6  
Old 05-16-2014
Thank a lot ! it is as expected.

Only one thing is that in original file on seventh line we have a tag like

Code:
<TotalRecord>46048</TotalRecord>

and in each file it is replicating the same even though the records are 1/4th of the original means in each file the above tag is to be like 46048/4 =11512

part1
Code:
<TotalRecord>11512</TotalRecord>

part2
Code:
<TotalRecord>11512</TotalRecord>

......so on..

Last edited by Don Cragun; 05-17-2014 at 02:21 AM.. Reason: Add CODE tags again.
# 7  
Old 05-16-2014
Do you have a blank line at the beginning of the file or anywhere before 7 lines
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. UNIX for Beginners Questions & Answers

Split a txt file on the basis of line number

I have to split a file containing 100 lines to 5 files say from lines ,1-20 ,21-30 ,31-40 ,51-60 ,61-100 Here is i can do it for 2 file but how to handle it for more than 2 files awk 'NR < 21{ print >> "a"; next } {print >> "b" }' $input_file Please advidse. Thanks (4 Replies)
Discussion started by: abhaydas
4 Replies

2. Shell Programming and Scripting

Splitting a single xml file into multiple xml files

Hi, I'm having a xml file with multiple xml header. so i want to split the file into multiple files. Sample.xml consists multiple headers so how can we split these multiple headers into multiple files in unix. eg : <?xml version="1.0" encoding="UTF-8"?> <ml:individual... (3 Replies)
Discussion started by: Narendra921631
3 Replies

3. Shell Programming and Scripting

Splitting xml file into several xml files using perl

Hi Everyone, I'm new here and I was checking this old post: /shell-programming-and-scripting/180669-splitting-file-into-several-smaller-files-using-perl.html (cannot paste link because of lack of points) I need to do something like this but understand very little of perl. I also check... (4 Replies)
Discussion started by: mcosta
4 Replies

4. Shell Programming and Scripting

Looping through XML file on basis of a node

<?xml version="1.0" encoding="UTF-8"?> <Document> <FIToFICstmrCdtTrf> <GrpHdr> <MsgId>10001</MsgId> <NbOfTxs>1</NbOfTxs> <IntrBkSttlmDt>2015-05-06</IntrBkSttlmDt> <SttlmInf> <SttlmMtd>CLRG</SttlmMtd> </SttlmInf> <PmtTpInf> ... (2 Replies)
Discussion started by: harish2015
2 Replies

5. UNIX for Dummies Questions & Answers

Splitting the file basis of Line Number

Can u pls advise the unix command as I have a file which contain the records in the below format 333434 435435 435443 434543 343536 Now the total line count is 89380 , now i want to create a separate I am trying to split my large big file into small bits using the line... (2 Replies)
Discussion started by: punpun66
2 Replies

6. Shell Programming and Scripting

Splitting a file based on line number

Hi I have a file with over a million lines (rows) and I want to split everything from 500,000 to a million into another file (to make the file smaller). Is there a simple command for this? Thank you Phil (4 Replies)
Discussion started by: phil_heath
4 Replies

7. Shell Programming and Scripting

Help required in Splitting a xml file into multiple and appending it in another .xml file

HI All, I have to split a xml file into multiple xml files and append it in another .xml file. for example below is a sample xml and using shell script i have to split it into three xml files and append all the three xmls in a .xml file. Can some one help plz. eg: <?xml version="1.0"?>... (4 Replies)
Discussion started by: ganesan kulasek
4 Replies

8. Shell Programming and Scripting

splitting a huge line of file into multiple lines with fixed number of columns

Hi, I have a huge file with a single line. But I want to break that line into lines of with each line having five columns. My file is like this: code: "hi","there","how","are","you?","It","was","great","working","with","you.","hope","to","work","you." I want it like this: code:... (1 Reply)
Discussion started by: rajsharma
1 Replies

9. Shell Programming and Scripting

Split one file to Multiple file with report basis in unix

Hi, Please help on this. i want split the below file(11020111.CLT) to more files with some condition. :b: 1) %s stating of the report 2) %e ending of the report example starting of the report: %sAEGONCA| |MUMBAI | :EXPC|N|D ending of the report %eAEGONCA| |MUMBAI | :EXPC 3)so the... (10 Replies)
Discussion started by: krbala1985
10 Replies

10. Shell Programming and Scripting

splitting a file (xml) into multiple files

To split the files Hi, I'm having a xml file with multiple xml header. so i want to split the file into multiple files. Test.xml --------- <?xml version="UTF_8"> <emp: ....> <name>a</name> <age>10</age> </emp> <?xml version="UTF_8"> <emp: ....> <name>b</name> <age>10</age>... (11 Replies)
Discussion started by: sasi_u
11 Replies
Login or Register to Ask a Question