Help needed - Split large file into smaller files based on pattern match Post: 302758061

10 More Discussions You Might Find Interesting

1. UNIX for Dummies Questions & Answers

splitting the large file into smaller files

hi all im new to this forum..excuse me if anythng wrong. I have a file containing 600 MB data in that. when i do parse the data in perl program im getting out of memory error. so iam planning to split the file into smaller files and process one by one. can any one tell me what is the code...

2. Shell Programming and Scripting

Split a file into multiple files based on the input pattern

I have a file with lines something like. ...... 123_start ...... ....... 123_end .... ..... 456_start ...... ..... 456_end .... ..... 789_start .... .... 789_end

3. Shell Programming and Scripting

Splitting large file into multiple files in unix based on pattern

I need to write a shell script for below scenario My input file has data in format: qwerty0101TWE 12345 01022005 01022005 datainala alanfernanded 26 qwerty0101mXZ 12349 01022005 06022008 datainalb johngalilo 28 qwerty0101TWE 12342 01022005 07022009 datainalc hitalbert 43 qwerty0101CFG 12345...

4. Shell Programming and Scripting

Split large file into smaller file

hi Guys i need some help here.. i have a file which has > 800,000 lines in it. I need to split this file into smaller files with 25000 lines each. please help thanks

5. Shell Programming and Scripting

split XML file into multiple files based on pattern

Hello, I am using awk to split a file into multiple files using command: nawk '{ if ( $1 == "<process" ) { n=split($2, arr, "\""); file=arr } print > file }' processes.xml <process name="Process1.process"> ...

6. Shell Programming and Scripting

Sed: Splitting A large File into smaller files based on recursive Regular Expression match

I will simplify the explaination a bit, I need to parse through a 87m file - I have a single text file in the form of : <NAME>house........ SOMETEXT SOMETEXT SOMETEXT . . . . </script> MORETEXT MORETEXT . . .

7. UNIX for Dummies Questions & Answers

Split a huge 7 GB File Based on Pattern into 4 files

Hi, I have a Huge 7 GB file which has around 1 million records, i want to split this file into 4 files to contain around 250k messages each. Please help me as Split command cannot work here as it might miss tags.. Format of the file is as below ...

8. Shell Programming and Scripting

Split Large Files Based On Row Pattern..

Hi all. I've tried searching the web but could not find similar problem to mine. I have one large file to be splitted into several files based on the matching pattern found in each row. For example, let's say the file content: ...

9. UNIX for Dummies Questions & Answers

Split large file to smaller fastly

hi , I have a requirement input file: 1 1111111111111 108 1 1111111111111 109 1 1111111111111 109 1 1111111111111 110 1 1111111111111 111 1 1111111111111 111 1 1111111111111 111 1 1111111111111 112 1 1111111111111 112 1 1111111111111 112 The output should be,

10. UNIX for Beginners Questions & Answers

Split large file into smaller files without disturbing the entry chunks

Dears, Need you help with the below file manipulation. I want to split the file into 8 smaller files but without cutting/disturbing the entries (meaning every small file should start with a entry and end with an empty line). It will be helpful if you can provide a one liner command for this...

LEARN ABOUT HPUX

diffmk

diffmk(1)						      General Commands Manual							 diffmk(1)

NAME

       diffmk - mark changes between two different versions of a file

SYNOPSIS

       prevfile currfile markfile

DESCRIPTION

       compares  the  previous	version of a file with the current version and creates a file that includes ``change mark'' commands.  prevfile is
       the name of the previous version of the file and currfile is the name of the current version of the file.  generates  markfile  which  con-
       tains all the lines of the currfile plus inserted formatter ``change mark'' requests.  When markfile is formatted, changed or inserted text
       is shown by a character at the right margin of each line.  The position of deleted text is shown by a single

       If the characters and are inappropriate, a copy of can be edited to change them because is a shell script.

EXTERNAL INFLUENCES

   International Code Set Support
       Single- and multi-byte character code sets are supported.

EXAMPLES

       A typical command line for comparing two versions of an file and generating a file with the changes marked is:

       can also be used to produce listings of C (or other) programs with changes marked.  A typical command line for such use is:

       where the file contains:

       The request can specify a different line length, depending on the nature of the program being printed.  The request is probably needed only
       for C programs.

WARNINGS

       Aesthetic considerations may dictate manual adjustment of some output.

       does  not  differentiate between changes in text and changes in formatter request coding.  Thus, file differences involving only formatting
       changes (such as replacing with in a text source file) with no change in actual text can produce change marks.

       Although unlikely, certain combinations of formatting requests can cause change marks to either disappear or  to  mark  too  much.   Manual
       intervention  may  be  required because the subtleties of various formatting macro packages and preprocessors is beyond the scope of cannot
       tolerate commands in its input (see tbl(1)), so any request that would appear inside a range  is  silently  deleted.   The  script  can	be
       changed if this action is inappropriate, or can be run on two files that have both been run through the preprocessor before any comparisons
       are made.

       uses and thus has the same limitations on file size and performance that may impose  (see diff(1)).  In particular the performance is  non-
       linear  with  the size of the file, and very large files (well over 1000 lines) may take extremely long to process.  Breaking the file into
       smaller pieces may be advisable.

       also uses the ed(1) editor.  If the file is too large for error messages may be embedded in  the  file.	 Again,  breaking  the	file  into
       smaller pieces may be advisable.

SEE ALSO

       diff(1), nroff(1).

																	 diffmk(1)

10 More Discussions You Might Find Interesting

1. UNIX for Dummies Questions & Answers

splitting the large file into smaller files

Discussion started by: vsnreddy

2. Shell Programming and Scripting

Split a file into multiple files based on the input pattern

Discussion started by: abinash

3. Shell Programming and Scripting

Splitting large file into multiple files in unix based on pattern

Discussion started by: jimmy12

4. Shell Programming and Scripting

Split large file into smaller file

Discussion started by: sitaldip