Sponsored Content
Top Forums UNIX for Dummies Questions & Answers Split a huge 7 GB File Based on Pattern into 4 files Post 302837085 by krishmaths on Thursday 25th of July 2013 05:57:22 AM
Old 07-25-2013
The logic goes like this. The code identifies the last END before line number 250000 and assigns the line number where this last END occurs to a marker variable M1.

Similarly marker variables M2 and M3 hold the line numbers where last END occurs before line number 500000 and 750000 respectively.

Now we have created 3 markers in the file for the split. Note that each marker line contains END.

The awk statement uses these 3 markers to split the file into 4.

Can you also try echoing the values for variables M1, M2 and M3 to know whether we have got the correct split.
 

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Split a file based on a pattern

Dear all, I have a large file which is composed of 8000 frames, what i would like to do is split the file into 8000 single files names file.pdb.1, file.pdb.2 etc etc each frame in the large file is seperated by a "ENDMDL" flag so my thinking is to use this flag a a point to split the files... (4 Replies)
Discussion started by: Mish_99
4 Replies

2. Shell Programming and Scripting

Split a file into multiple files based on the input pattern

I have a file with lines something like. ...... 123_start ...... ....... 123_end .... ..... 456_start ...... ..... 456_end .... ..... 789_start .... .... 789_end (6 Replies)
Discussion started by: abinash
6 Replies

3. Shell Programming and Scripting

Help- counting delimiter in a huge file and split data into 2 files

I’m new to Linux script and not sure how to filter out bad records from huge flat files (over 1.3GB each). The delimiter is a semi colon “;” Here is the sample of 5 lines in the file: Name1;phone1;address1;city1;state1;zipcode1 Name2;phone2;address2;city2;state2;zipcode2;comment... (7 Replies)
Discussion started by: lv99
7 Replies

4. Shell Programming and Scripting

split XML file into multiple files based on pattern

Hello, I am using awk to split a file into multiple files using command: nawk '{ if ( $1 == "<process" ) { n=split($2, arr, "\""); file=arr } print > file }' processes.xml <process name="Process1.process"> ... (3 Replies)
Discussion started by: chiru_h
3 Replies

5. Shell Programming and Scripting

Split a file based on pattern and size

Hello, I have a large file (2GB) that I would like to split based on pattern and size. I've used the following command to split the file (token is "HELLO") awk '/HELLO/{i++}{print > "file"i}' input.txt and the output is similar to the following (i included filesize in KB): 10 ... (2 Replies)
Discussion started by: jl487
2 Replies

6. Shell Programming and Scripting

Split the file based on pattern

Hi , I have huge files around 400 mb, which has clob data and have diffeent scenarios: I am trying to pass scenario number as parameter and and get required modified file based on the scenario number and criteria. Scenario 1: file name : scenario_1.txt ... (2 Replies)
Discussion started by: sol_nov
2 Replies

7. Shell Programming and Scripting

Help needed - Split large file into smaller files based on pattern match

Help needed urgently please. I have a large file - a few hundred thousand lines. Sample CP START ACCOUNT 1234556 name 1 CP END ACCOUNT CP START ACCOUNT 2224444 name 1 CP END ACCOUNT CP START ACCOUNT 333344444 name 1 CP END ACCOUNT I need to split this file each time "CP START... (7 Replies)
Discussion started by: frustrated1
7 Replies

8. Shell Programming and Scripting

Split Large Files Based On Row Pattern..

Hi all. I've tried searching the web but could not find similar problem to mine. I have one large file to be splitted into several files based on the matching pattern found in each row. For example, let's say the file content: ... (13 Replies)
Discussion started by: aimy
13 Replies

9. Shell Programming and Scripting

How to split a file based on pattern line number?

Hi i have requirement like below M <form_name> sdasadasdMklkM D ...... D ..... M form_name> sdasadasdMklkM D ...... D ..... D ...... D ..... M form_name> sdasadasdMklkM D ...... M form_name> sdasadasdMklkM i want split file based on line number by finding... (10 Replies)
Discussion started by: bhaskar v
10 Replies

10. UNIX for Advanced & Expert Users

Split one file to many based on pattern

Hello All, I have records in a file in a pattern A,B,B,B,B,K,A,B,B,K Is there any command or simple logic I can pull out records into multiple files based on A record? I want output as File1: A,B,B,B,B,K File2: A,B,B,K (9 Replies)
Discussion started by: deal1dealer
9 Replies
REPLMARKS(1)						      General Commands Manual						      REPLMARKS(1)

NAME
replmarks - replace triangular markers in a RADIANCE scene description SYNOPSIS
replmarks [ -e ][ -m newmod ][ -s scale ] { -x objfile | -i octree | -I mesh } modname .. [ file .. ] DESCRIPTION
Replmarks replaces triangular markers identified by the modifier modname in each scene description file and writes the result to the stan- dard output. The -x option indicates that each marker should be replaced by an appropriate xform(1) command on objfile. The -i option indicates that each marker should be replaced by an instance of octree. The -I option indicates that each marker should be replaced by an instance of the Radiance mesh mesh. One of these three options must appear on the command line, along with modname, the modifier used by markers in the file. Multiple modifiers may be given, as long as each one is preceded by its own -x or -i option. The transformation for each marker is determined by its location and orientation. A marker should be a right triangle pointing like a half-arrow in the direction of the transformed x-axis, x'. The longest side is the hypoteneuse, the second longest side is the x'-axis, and the third longest side indicates the direction of the y'-axis. Any additional sides will be ignored (ie. a quadrilateral may be used instead of a triangle if the extra side is small). The z'-axis is determined by the cross product of the x' and y' axes, and the origin is the common vertex between x' and y'. The size of the marker is ignored unless the -s option is used, where scale is a multiplier for the x'-axis length to indicate the total scale factor. For example, a scale value of 5 with a marker length of .5 would result in a total scale factor of 2.5 to be used in the transformation. The -e option causes commands in the file to be expanded, and is required to replace markers from commands in the input file. Even with this option, replmarks will not examine objects for markers. Specifically, an object included by replmarks as a result of a -x expansion will be transferred verbatim, without regard to any surfaces therein that might have been considered as marks if they were on the main input. The -m option causes all replaced objects to be given the modifier newmod. Otherwise, the new object surfaces will use their originally defined modifiers. A different replacement modifier may be given for each marker type. The marker modifier name itself is only used to identify markers, and will not appear in the output in any form. If no input file is given, the standard input is read. EXAMPLE
To replace all polygons with the modifier ``knobs'' in the file input with a transformed ``knob.rad'' and write the result to output: replmarks -x knob.rad knobs input > output To use instances of ``tree.oct'' with scaling set to three times the tree marker length: replmarks -s 3 -i tree.oct tree input > output AUTHOR
Greg Ward SEE ALSO
arch2rad(1), ies2rad(1), xform(1) RADIANCE
3/24/94 REPLMARKS(1)
All times are GMT -4. The time now is 03:03 AM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy