Sponsored Content
Top Forums Shell Programming and Scripting Divide large data files into smaller files Post 302437947 by ad23 on Friday 16th of July 2010 05:35:39 PM
Old 07-16-2010
Divide large data files into smaller files

Hello everyone!

I have 2 types of files in the following format:

Code:
1) *.fa

>1234
...some text...
>2345
...some text...
>3456
...some text...
.
.
.
.

2) *.info

>1234
...some numbers...
>2345
...some numbers...
>3456
...some numbers...
.
.
.
.

I need to split these huge files (~300-400Mb), into smaller files (around 30-40Mb each). Also, I don't want to divide the data (i.e. every record starting with '>' should have its corresponding information in one smaller file).

Can someone please suggest a way to do this using unix commands?

Thanks!!!
 

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

how to divide single large log file into multiple files.

Can you please help me with writing script for following purpose. I have to divide single large web access log file into multiple log files based on dates inside the log file. For example: if data is logged in the access file for jan-10-08 , jan-11-08 , Jan-12-08 then make small log file... (1 Reply)
Discussion started by: kamleshm
1 Replies

2. UNIX for Dummies Questions & Answers

splitting the large file into smaller files

hi all im new to this forum..excuse me if anythng wrong. I have a file containing 600 MB data in that. when i do parse the data in perl program im getting out of memory error. so iam planning to split the file into smaller files and process one by one. can any one tell me what is the code... (1 Reply)
Discussion started by: vsnreddy
1 Replies

3. UNIX for Dummies Questions & Answers

multiple smaller files from one large file

I have a file with a simple list of ids. 750,000 rows. I have to break it down into multiple 50,000 row files to submit in a batch process.. Is there an easy script I could write to accomplish this task? (2 Replies)
Discussion started by: rtroscianecki
2 Replies

4. Shell Programming and Scripting

Divide data into separate files

frnds: i want to divide data on the behalf of dotted line and redirectd into new files ) ------------------------- M-GET CONFIRMATION ( ------------------------- M-GET CONFIRMATION ( INVOKE IDENTIFIER final data shuld be into 3 files ...... (6 Replies)
Discussion started by: dodasajan
6 Replies

5. Shell Programming and Scripting

Divide data with specific column values into separate files

hello! i need a little help from you :) ... i need to split a file into separate files depending on two conditions using scripting. The file has no delimiters. The conditions are col 17 = "P" and col 81 = "*", this will go to one output file; col 17 = "R" and col 81 = " ". Here is an example. ... (3 Replies)
Discussion started by: chanclitas
3 Replies

6. Shell Programming and Scripting

Finding data in large no. of files

I need to find some data in a large no. of files. The data is in the following format : VALUE A VALUE B VALUE C VALUE D 10 4 65 1 12 4.5 65.5 2 10.75 5.1 ... (2 Replies)
Discussion started by: cooker97
2 Replies

7. Shell Programming and Scripting

Help needed - Split large file into smaller files based on pattern match

Help needed urgently please. I have a large file - a few hundred thousand lines. Sample CP START ACCOUNT 1234556 name 1 CP END ACCOUNT CP START ACCOUNT 2224444 name 1 CP END ACCOUNT CP START ACCOUNT 333344444 name 1 CP END ACCOUNT I need to split this file each time "CP START... (7 Replies)
Discussion started by: frustrated1
7 Replies

8. Shell Programming and Scripting

Divide an EBCDIC files into multiple files based on value at 45-46 bytes

Hi All, I do have an EBCDIC file sent from the z/os , this file has records with different record types in it, the type of record is identified by bytes 45-46 like value 12 has employee record value 14 has salaray record and etc.... we do now want to split the big ebcdic file into multiple... (3 Replies)
Discussion started by: okkadu
3 Replies

9. Shell Programming and Scripting

Sed: Splitting A large File into smaller files based on recursive Regular Expression match

I will simplify the explaination a bit, I need to parse through a 87m file - I have a single text file in the form of : <NAME>house........ SOMETEXT SOMETEXT SOMETEXT . . . . </script> MORETEXT MORETEXT . . . (6 Replies)
Discussion started by: sumguy
6 Replies

10. UNIX for Beginners Questions & Answers

Split large file into smaller files without disturbing the entry chunks

Dears, Need you help with the below file manipulation. I want to split the file into 8 smaller files but without cutting/disturbing the entries (meaning every small file should start with a entry and end with an empty line). It will be helpful if you can provide a one liner command for this... (12 Replies)
Discussion started by: Kamesh G
12 Replies
ISDNCONF(1)						      Linux System Utilities						       ISDNCONF(1)

NAME
isdnconf - manipulate or read ISDN phone number config files. SYNOPSIS
isdnconf DESCRIPTION
isdnconf can manipulate or read the file /etc/isdn/callerid.conf as well as ~/.isdn. Entries can be added or removed from these files. Additionally, entries can be searched for and displayed in a way similar to grep. An entry can be an own MSN ([MSN]) or a phone number ([NUMBER]). You can use this program to build your own phonebook. These files are used by many of the other ISDN utilities that use phone numbers, to display a number symbolicly instead of as a plain number. OPTIONS
Note: usage of the options -A and -D is dangerous! The complete structure of the file may be changed, and all comments are removed! Make backups of your data files before trying these. COMMAND OPTIONS: -A Add a new entry, which is read from standard input. The following values are asked for (here with examples): Alias: Fred Number: 0815/441777 SI: 0 Zone: 4 Interface: - Flags: I|O Program: /usr/local/bin/ring User: uucp Group: user Interval: Time: 8-20 Flags: (control-D here indicates end of flags) Alias: (control-D here indicated end of entries) If you want another [Flags] entry, simply enter the data for another program for this number at the point where the control-D was given above. If you want to add another number, simply enter the data for the next number at the Alias prompt. You can also pipe the data into this program; the input data then must correspond to the prompts that the program gives! Note that you can only add one number at a time then (there's no way of telling isdnconf that you want to stop giving Flags info and start giving the next Alias info). -D Delete one or more entries which match the data options given. How to supply the data to match is discussed below (see DATA OPTIONS). -V version: display the program's version and terminate. If both -A and -D are given together, isdnconf will terminate, as this is not a valid combination. If neither -A nor -D is given, then isdnconf will display entries which match the data given. DATA OPTIONS: (none applicable for -A) -n 'number' number: match the given number. It can contain wildcards. -a 'alias' alias: match the given alias name. The name can contain wildcards. Note: due to a bug, use '?' instead of '-'. -t 'SI' SI: match the given service indicator. -c 'code' code: match the area code of the phone number. Only usable for displaying (not for -A nor -D). -i ignore case for the -n and -a options. -w word: the parameters for -n and -a must match the whole value, not just a substring. Examples (here for -n): "*34*" matches 123456, 345677, 1234 "34*" matches 34567, 34111 but not 1234 "34??" matches 3411, 3456 "34" matches only 34 Without the -w option, these would match: "*34*" matches 123456, 345677, 1234 "34*" matches 123456, 345677, 1234, 34567, 34111 "34??" matches 123456, 345677, 1234, 3411, 3456 "34" matches 123456, 345677, 1234, 3411, 3456, 34567 -d and: by default the values given to options -n, -a and -t will be combined with a logical OR. If these should be combined with a logi- cal AND (which is probably what most people expect!) you must use this -d option. In this case you can only supply each of the -n, -a and -t options exactly once. OTHER OPTIONS: -q quiet: when using the -A or -D commands, the output is suppressed. When not using the -A or -D commands, only the alias of the match- ing number is shown, or just the number in case the alias is not found. -m MSN: when used in combination with the -A command, instructs isdnconf to create a new MSN entry; the default is to create a new NUMBER entry. Only the values alias, number, SI, zone and interface are applicable to an MSN entry. The following two options do not apply to the -A and -D commands. They only change the output format. -s short: only display the alias and the number. -l long: also display the programs to run ([START]). -f 'filename' file: usually isdnconf uses the /etc/isdn/callerid.conf and the ~/.isdn files. If isdnconf should be applied to another file, use this option. -g global: only applies to the -A and -D commands. Instead of editing ~/.isdn, /etc/isdn/callerid.conf is edited. -1 first: only delete or display the first entry. -M isdnmon: used internally by isdnmon to get alias info. AUTHOR
Andreas Kool <akool@isdn4linux.de> manpage adapted from the README by Paul Slootman <paul@isdn4linux.de> SEE ALSO
callerid.conf(5), isdnlog(8) isdn4k-utils-3.25 1998/12/29 ISDNCONF(1)
All times are GMT -4. The time now is 07:24 PM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy