Sponsored Content
Top Forums Shell Programming and Scripting Split file based on file size in Korn script Post 302661723 by in2nix4life on Monday 25th of June 2012 04:14:50 PM
Old 06-25-2012
This may help get you started:

Code:
#!/bin/ksh
#
#

# declare an array and populate it with files larger
# than 2GBs in the current directory
set -A files $(find . -maxdepth 1 -size +2000000 -type f | sed 's/\.\///')

# set counter
counter=0

# get number of files in the array
numfiles=${#files[*]}

# set linecount
linecount=0

# set number of lines
numlines=0

# iterate through the array files, retrieve the line count,
# divide it by 2 and feed everything to the split command
while [ $counter -lt $numfiles ]
do
    linecount=$(wc -l ${files[$counter]} | awk '{print $1}')
    numlines=$(expr $linecount / 2)
    split -l $numlines ${files[$counter]} ${files[$counter]}
    ((counter=$counter+1))
done

# done
exit 0

This User Gave Thanks to in2nix4life For This Post:
 

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

awk script to split a file based on the condition

I have the file with the records like 4234234 US phone 3244234 US cup 2342342 CA phone 8947234 US phone 2389472 CA cup 2348972 US maps 3894234 CA phone I want the records with (US,phone) as record to be in one file, (Us, cup) in another file and (CA,cup) to be in another I mean all... (12 Replies)
Discussion started by: superprogrammer
12 Replies

2. Programming

create a spool file based on values passed from korn shell to sql script

this is my issue. 4 parameters are passed from korn shell to sql script. parameter_1= varchar2 datatype or no value entered my user. parameter_2= number datatype or no value entered my user. parameter_3= number datatype or no value entered my user. parameter_4= number datatype or no... (5 Replies)
Discussion started by: megha2525
5 Replies

3. Shell Programming and Scripting

Split file based on size

Hi Friends, Below is my requirement. I have a file with the below structure. 0001A1.... 0001B1.. .... 0001L1 0002A1 0002B1 ...... 0002L1 .. the first 4 characters are the sequence numbers for a record, A record will start with A1 and end with L1 with same sequence number. Now the... (2 Replies)
Discussion started by: diva_thilak
2 Replies

4. Shell Programming and Scripting

Split a file into multiple files based on first two digits of file.

Hi , I do have a fixedwidth flatfile that has data for 10 different datasets each identified by the first two digits in the flatfile. 01 in the first two digit position refers to Set A 02 in the first two digit position refers to Set B and so on I want to genrate 10 different files from my... (6 Replies)
Discussion started by: okkadu
6 Replies

5. Shell Programming and Scripting

Split a file based on pattern and size

Hello, I have a large file (2GB) that I would like to split based on pattern and size. I've used the following command to split the file (token is "HELLO") awk '/HELLO/{i++}{print > "file"i}' input.txt and the output is similar to the following (i included filesize in KB): 10 ... (2 Replies)
Discussion started by: jl487
2 Replies

6. Shell Programming and Scripting

How to split file into multiple files using awk based on 1 field in the file?

Good day all I need some helps, say that I have data like below, each field separated by a tab DATE NAME ADDRESS 15/7/2012 LX a.b.c 15/7/2012 LX1 a.b.c 16/7/2012 AB a.b.c 16/7/2012 AB2 a.b.c 15/7/2012 LX2 a.b.c... (2 Replies)
Discussion started by: alexyyw
2 Replies

7. Shell Programming and Scripting

awk script to split file into multiple files based on many columns

So I have a space delimited file that I'd like to split into multiple files based on multiple column values. This is what my data looks like 1bc9A02 1 10 1000 FTDLNLVQALRQFLWSFRLPGEAQKIDRMMEAFAQRYCQCNNGVFQSTDTCYVLSFAIIMLNTSLHNPNVKDKPTVERFIAMNRGINDGGDLPEELLRNLYESIKNEPFKIPELEHHHHHH 1ku1A02 1 10... (9 Replies)
Discussion started by: viored
9 Replies

8. Shell Programming and Scripting

Split the File based on Size

I have a file that is about 7 GB in size. The requirement is I should split the file equally in such a way that the size of the split files is less than 2Gb. If the file is less than 2gb, than nothing needs to be done. ( need to done using shell script) Thanks, (4 Replies)
Discussion started by: rudoraj
4 Replies

9. Answers to Frequently Asked Questions

How to split a dat file based on another file ni UNIX?

i have two files , one is var.txt and another res.dat file var.txt contains informaton like below date,request,sales,item 20171015,1,123456,216 20171015,1,123456,217 20171015,2,345678,214 20171015,3,456789,218 and res.dat contains is a one huge file contains information like... (1 Reply)
Discussion started by: pogo
1 Replies

10. UNIX for Beginners Questions & Answers

File Size Split up based on Month

Hi, I have a directory in Unix and there are folders available in the directory. Files are created on different month and now i have a requirement to calculate size of the folder on month basis. Is there any Unix command to check this please?? Thanks (6 Replies)
Discussion started by: Nivas
6 Replies
PMC_CONTROL(2)						      BSD System Calls Manual						    PMC_CONTROL(2)

NAME
pmc_control, pmc_get_info -- Hardware Performance Monitoring Interface LIBRARY
Standard C Library (libc, -lc) SYNOPSIS
#include <sys/pmc.h> int pmc_control(int ctr, int op, void *argp); int pmc_get_info(int ctr, int op, void *argp); DESCRIPTION
pmc_get_info() returns the number of counters in the system or information on a specified counter ctr. The possible values for op are: PMC_INFO_NCOUNTERS When querying the number of counters in the system, ctr is ignored and argp is of type int *. Upon return, the integer pointed to by argp will contain the number of counters that are available in the system. PMC_INFO_CPUCTR_TYPE When querying the type of a counter in the system, ctr refers to the counter being queried, and argp is of type int *. Upon return, the integer pointed to by argp will contain the implementation-dependent type of the specified counter. If ctr is -1, the integer pointed to by argp will contain the machine-dependent type describing the CPU or counter configuration. PMC_INFO_COUNTER_VALUE When querying the value of a counter in the system, ctr refers to the counter being queried, and argp is of type uint64_t *. Upon return, the 64-bit integer pointed to by argp will contain the value of the specified counter. PMC_INFO_ACCUMULATED_COUNTER_VALUE When querying the value of a counter in the system, ctr refers to the counter being queried, and argp is of type uint64_t *. Upon return, the 64-bit integer pointed to by argp will contain the sum of the accumulated values of specified counter in all exited sub- processes of the current process. pmc_control() manipulates the specified counter ctr in one of several fashions. The op parameter determines the action taken by the kernel and also the interpretation of the argp parameter. The possible values for op are: PMC_OP_START Starts the specified ctr running. It must be preceded by a call with PMC_OP_CONFIGURE. argp is ignored in this case and may be NULL. PMC_OP_STOP Stops the specified ctr from running. argp is ignored in this case and may be NULL. PMC_OP_CONFIGURE Configures the specified ctr prior to running. argp is a pointer to a struct pmc_counter_cfg. struct pmc_counter_cfg { pmc_evid_t event_id; pmc_ctr_t reset_value; uint32_t flags; }; event_id is the event ID to be counted. reset_value is a value to which the counter should be reset on overflow (if supported by the implementation). This is most useful when profiling (see PMC_OP_PROFSTART, below). This value is defined to be the number of counter ticks before the next overflow. So, to get a profiling tick on every hundredth data cache miss, set the event_id to the proper value for ``dcache-miss'' and set reset_value to 100. flags Currently unused. PMC_OP_PROFSTART Configures the specified ctr for use in profiling. argp is a pointer to a struct pmc_counter_cfg as in PMC_OP_CONFIGURE, above. This request allocates a kernel counter, which will fail if any process is using the requested counter. Not all implementations or coun- ters may support this option. PMC_OP_PROFSTOP Stops the specified ctr from being used for profiling. argp is ignored in this case and may be NULL. RETURN VALUES
A return value of 0 indicates that the call succeeded. Otherwise, -1 is returned and the global variable errno is set to indicate the error. ERRORS
Among the possible error codes from pmc_control() and pmc_get_info() are [EFAULT] The address specified for the argp is invalid. [ENXIO] Specified counter is not yet configured. [EINPROGRESS] PMC_OP_START was passed for a counter that is already running. [EINVAL] Specified counter was invalid. [EBUSY] If the requested counter is already in use--either by the current process or by the kernel. [ENODEV] If and only if the specified counter event is not valid for the specified counter when configuring a counter or starting profiling. [ENOMEM] If the kernel is unable to allocate memory. SEE ALSO
pmc(1), pmc(9) HISTORY
The pmc_control() and pmc_get_info() system calls appeared in NetBSD 2.0. BSD
October 27, 2005 BSD
All times are GMT -4. The time now is 02:09 AM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy