Splitting the Huge file into several files... Post: 302404375

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Splitting huge XML Files into fixsized wellformed parts

Hi, I need to split xml-files with sizes greater than 2 gb into smaler chunks. As I dont want to end up with billions of files, I want those splitted files to have configurable sizes like 250 MB. Each file should be well formed having an exact copy of the header (and footer as the closing of the...

2. Shell Programming and Scripting

splitting huge xml into multiple files

hi all i have a some huge html files (500MB to 1GB). Each file has multiple <html></html> tags <html> ................. .................... .................... </html> <html> ................. .................... .................... </html> <html> ....................

3. Shell Programming and Scripting

Help on splitting this huge file

Hi , i have files coming in my system which are very huge in MB and GBs, all these files are in a single line, there is no newline character. I need to get only last 700 bytes of these files, of this i am splitting the files by "split -b 700 filename" but this gives all the splitted...

4. Shell Programming and Scripting

Splitting files from one file

Hi, I have an input file like: 111 abcdefgh asdfghjk dfghjkl 222 aaaaaaa bbbbbb 333 djfhfgjktitjhgfkg 444 djdhfjkhfjkghjkfg hsbfjksdbhjkgherjklg fjkhfjklsahjgh fkrjkgnj I want to read this input file and make separate output files with the header as numric value like "111"...

5. Shell Programming and Scripting

Splitting file into 2 files ?

Hi extending to one of my previous posted query .... I am using nawk -v invar1="$aa" '{print > ("ABS\_"((/\|/)?"A\_":"B\_")invar1"\_NETWORKID.txt")}' spfile.txt to get 2 different files based on split condition i.e. "|" Similar to invar1 variable in nawk I also need one more variable...

6. Shell Programming and Scripting

Help- counting delimiter in a huge file and split data into 2 files

I’m new to Linux script and not sure how to filter out bad records from huge flat files (over 1.3GB each). The delimiter is a semi colon “;” Here is the sample of 5 lines in the file: Name1;phone1;address1;city1;state1;zipcode1 Name2;phone2;address2;city2;state2;zipcode2;comment...

7. Shell Programming and Scripting

splitting a huge line of file into multiple lines with fixed number of columns

Hi, I have a huge file with a single line. But I want to break that line into lines of with each line having five columns. My file is like this: code: "hi","there","how","are","you?","It","was","great","working","with","you.","hope","to","work","you." I want it like this: code:...

8. Shell Programming and Scripting

Need help splitting huge single record file

I was given a data file that I need to split into multiple lines/records based on a key word. The problem is that it is 2.5GB or bigger and everything I try in perl or sed causes a Segmentation fault. Can someone give me some other ideas. The data is of the form:...

9. UNIX for Dummies Questions & Answers

File comparison of huge files

Hi all, I hope you are well. I am very happy to see your contribution. I am eager to become part of it. I have the following question. I have two huge files to compare (almost 3GB each). The files are simulation outputs. The format of the files are as below For clear picture, please see...

10. UNIX for Dummies Questions & Answers

Split a huge 7 GB File Based on Pattern into 4 files

Hi, I have a Huge 7 GB file which has around 1 million records, i want to split this file into 4 files to contain around 250k messages each. Please help me as Split command cannot work here as it might miss tags.. Format of the file is as below ...

LEARN ABOUT OPENSOLARIS

split

split(1)							   User Commands							  split(1)

NAME

       split - split a file into pieces

SYNOPSIS

       split [-linecount | -l linecount] [-a suffixlength]
	    [file [name]]

       split [-b n | nk | nm] [-a suffixlength] [file [name]]

DESCRIPTION

       The  split  utility reads file and writes it in linecount-line pieces into a set of output-files. The name of the first output-file is name
       with aa appended, and so on lexicographically, up to zz (a maximum of 676 files). The maximum length of name is 2 characters less than  the
       maximum	filename length allowed by the filesystem. See statvfs(2). If no output name is given, x is used as the default (output-files will
       be called xaa, xab, and so forth).

OPTIONS

       The following options are supported:

       -linecount | -l linecount

	   Number of lines in each piece. Defaults to 1000 lines.

       -a suffixlength

	   Uses suffixlength letters to form the suffix portion of the filenames of the split file. If -a is not  specified,  the  default  suffix
	   length  is  2. If the sum of the name operand and the suffixlength option-argument would create a filename exceeding NAME_MAX bytes, an
	   error will result; split will exit with a diagnostic message and no files will be created.

       -b n

	   Splits a file into pieces n bytes in size.

       -b nk

	   Splits a file into pieces n*1024 bytes in size.

       -b nm

	   Splits a file into pieces n*1048576 bytes in size.

OPERANDS

       The following operands are supported:

       file    The path name of the ordinary file to be split. If no input file is given or file is -, the standard input will be used.

       name    The prefix to be used for each of the files resulting from the split operation. If no name argument is given, x will be used as the
	       prefix  of  the	output	files.	The  combined  length of the basename of prefix and suffixlength cannot exceed NAME_MAX bytes. See
	       OPTIONS.

USAGE

       See largefile(5) for the description of the behavior of split when encountering files greater than or equal to 2 Gbyte ( 2^31 bytes).

ENVIRONMENT VARIABLES

       See environ(5) for descriptions of the following environment variables that affect the execution of split: LANG, LC_ALL, LC_CTYPE,  LC_MES-
       SAGES, and NLSPATH.

EXIT STATUS

       The following exit values are returned:

       0     Successful completion.

       >0    An error occurred.

ATTRIBUTES

       See attributes(5) for descriptions of the following attributes:

       +-----------------------------+-----------------------------+
       |      ATTRIBUTE TYPE	     |	    ATTRIBUTE VALUE	   |
       +-----------------------------+-----------------------------+
       |Availability		     |SUNWesu			   |
       +-----------------------------+-----------------------------+
       |CSI			     |Enabled			   |
       +-----------------------------+-----------------------------+
       |Interface Stability	     |Committed 		   |
       +-----------------------------+-----------------------------+
       |Standard		     |See  standards(5).	   |
       +-----------------------------+-----------------------------+

SEE ALSO

       csplit(1), statvfs(2), attributes(5), environ(5), largefile(5), standards(5)

SunOS 5.11							    16 Apr 1999 							  split(1)

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Splitting huge XML Files into fixsized wellformed parts

Discussion started by: Malapha

2. Shell Programming and Scripting

splitting huge xml into multiple files

Discussion started by: uttamhoode

3. Shell Programming and Scripting

Help on splitting this huge file

Discussion started by: Prateek007

4. Shell Programming and Scripting

Splitting files from one file

Discussion started by: saltysumi