07-09-2008
Split Large File
HI,
i've to split a large file which inputs seems like :
Input file name_file.txt
00001|AAAA|MAIL|DATEOFBIRTHT|.......
00001|AAAA|MAIL|DATEOFBIRTHT|.......
00002|BBBB|MAIL|DATEOFBIRTHT|.......
00002|BBBB|MAIL|DATEOFBIRTHT|.......
00003|CCCC|MAIL|DATEOFBIRTHT|.......
00004|DDDD|MAIL|DATEOFBIRTHT|.......
00005|EEEE|MAIL|DATEOFBIRTHT|.......
00006|FFFF|MAIL|DATEOFBIRTHT|.......
00007|MMMM|MAIL|DATEOFBIRTHT|.......
00008|TTTT|MAIL|DATEOFBIRTHT|.......
00008|TTTT|MAIL|DATEOFBIRTHT|.......
00009|HHHH|MAIL|DATEOFBIRTHT|.......
00010|KKKK|MAIL|DATEOFBIRTHT|.......
IF I WANT TO SPLIT EVERY 5 LINES BASED ON THE FIRST AND SECOND COMUMNS FOR EXAMPLE I MUST HAVE:
out1.txt
00001|AAAA|MAIL|DATEOFBIRTHT|.......
00001|AAAA|MAIL|DATEOFBIRTHT|.......
00002|BBBB|MAIL|DATEOFBIRTHT|.......
00002|BBBB|MAIL|DATEOFBIRTHT|.......
00003|CCCC|MAIL|DATEOFBIRTHT|.......
00004|DDDD|MAIL|DATEOFBIRTHT|.......
00005|EEEE|MAIL|DATEOFBIRTHT|.......
out2.txt
00006|FFFF|MAIL|DATEOFBIRTHT|.......
00007|MMMM|MAIL|DATEOFBIRTHT|.......
00008|TTTT|MAIL|DATEOFBIRTHT|.......
00008|TTTT|MAIL|DATEOFBIRTHT|.......
00009|HHHH|MAIL|DATEOFBIRTHT|.......
00010|KKKK|MAIL|DATEOFBIRTHT|.......
if i have two lines with the same value in the first and second column i will count them as one line.
plesz help
10 More Discussions You Might Find Interesting
1. Shell Programming and Scripting
Hi,
I have a large file(csv format) that I need to split into 2 files. The file looks something like
Original_file.txt
first name, family name, address
a, b, c,
d, e, f,
and so on for over 100,00 lines
I need to create two files from this one file. The condition is i need to ensure... (4 Replies)
Discussion started by: nbvcxzdz
4 Replies
2. Shell Programming and Scripting
Hi,
I have a large file with a repeating pattern in it. Now i want the file split into the block of patterns with a specified no. of lines in each file.
i.e. The file is like
1...
2...
2...
3...
1...
2...
3...
1...
2...
2...
2...
2...
2...
3...
where 1 is the start of the block... (5 Replies)
Discussion started by: sudhamacs
5 Replies
3. Shell Programming and Scripting
I have a file containing date/time sorted data of the form
...
2009/06/10,20:59:59.950,XAG/USD,Q,1,1115, 14.3025,100,1,1
2009/06/10,20:59:59.950,XAG/USD,Q,1,1116, 14.3026,125,1,1
2009/06/10,20:59:59.950,XAG/USD,R,0,0, , 0,0,0
2009/06/10,20:59:59.950,XAG/USD,R,1,0, 14.1910,100,1,1... (6 Replies)
Discussion started by: asriva
6 Replies
4. Shell Programming and Scripting
Hello Everyone,
I have a large file that needs to be split into many seperate files, however the text in between the blank lines need to be intact. The file looks like
SomeText
SomeText
SomeText
SomeOtherText
SomeOtherText
....
Since the number of lines of text are different for... (3 Replies)
Discussion started by: jwillis0720
3 Replies
5. Shell Programming and Scripting
Hello,
What's the best way to split a large into multiple files based on the last digit in the first column.
input file:
f
2738483300000x0y03772748378831x1y13478378358383x2y23743878383802x3y33787828282820x4y43748838383881x5y5
Desired Output:
f0
3738483300000x0y03787828282820x4y4
f1... (9 Replies)
Discussion started by: alain.kazan
9 Replies
6. Shell Programming and Scripting
I have a 3 GB text file that I would like to split. How can I do this?
It's a giant comma-separated list of numbers. I would like to make it into about 20 files of ~100 MB each, with a custom header and footer. The file can only be split on commas, but they're plentiful.
Something like... (3 Replies)
Discussion started by: CRGreathouse
3 Replies
7. UNIX for Dummies Questions & Answers
hi ,
I have a requirement
input file:
1 1111111111111 108
1 1111111111111 109
1 1111111111111 109
1 1111111111111 110
1 1111111111111 111
1 1111111111111 111
1 1111111111111 111
1 1111111111111 112
1 1111111111111 112
1 1111111111111 112
The output should be, (19 Replies)
Discussion started by: mechvijays
19 Replies
8. UNIX for Beginners Questions & Answers
Dear Users,
Appreciate your help if you could help me with splitting a large file > 1 million lines with sed or awk. below is the text in the file
input file.txt
scaffold1 928 929 C/T +
scaffold1 942 943 G/C +
scaffold1 959 960 C/T +... (6 Replies)
Discussion started by: kapr0001
6 Replies
9. UNIX for Advanced & Expert Users
Hi,
I have received a file which is 20 GB. We would like to split the file into 4 equal parts and process it to avoid memory issues.
If the record delimiter is unix new line, I could use split command either with option l or b.
The problem is that the line terminator is |##|
How to use... (5 Replies)
Discussion started by: Ravi.K
5 Replies
10. UNIX for Beginners Questions & Answers
Trying to split a 35gb file into 1000mb parts. My research shows I should you this. split -b 1000m file.txt and my return is "split: cannot open 'crunch1.txt' for reading: No such file or directory" so I tried split -b 1000m Documents/Wordlists/file.txt and I get nothing other than the curser just... (3 Replies)
Discussion started by: sub terra
3 Replies
LEARN ABOUT MOJAVE
net::dns::rr::aaaa5.18
Net::DNS::RR::AAAA(3) User Contributed Perl Documentation Net::DNS::RR::AAAA(3)
NAME
Net::DNS::RR::AAAA - DNS AAAA resource record
SYNOPSIS
use Net::DNS;
$rr = new Net::DNS::RR('name IN AAAA address');
$rr = new Net::DNS::RR(
name => 'example.com',
type => 'AAAA',
address => '2001:DB8::8:800:200C:417A'
);
DESCRIPTION
Class for DNS IPv6 Address (AAAA) resource records.
METHODS
The available methods are those inherited from the base class augmented by the type-specific methods defined in this package.
Use of undocumented package features or direct access to internal data structures is discouraged and could result in program termination or
other unpredictable behaviour.
address
$IPv6_address = $rr->address;
Returns the text representation of the IPv6 address.
address_long
$IPv6_address = $rr->address_long;
Returns the text representation specified in RFC3513, 2.2fIs0(1).
address_short
$IPv6_address = $rr->address_short;
Returns the textual form of address recommended by RFC5952.
COPYRIGHT
Copyright (c)1997-1998 Michael Fuhr.
Portions Copyright (c)2002-2004 Chris Reinhardt.
Portions Copyright (c)2012 Dick Franks.
Package template (c)2009,2012 O.M.Kolkman and R.W.Franks.
All rights reserved.
This program is free software; you may redistribute it and/or modify it under the same terms as Perl itself.
SEE ALSO
perl, Net::DNS, Net::DNS::RR, RFC3596, RFC3513, RFC5952
perl v5.18.2 2014-01-16 Net::DNS::RR::AAAA(3)