Script Optimization - large delimited file, for loop with many greps Post: 302517433

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Directory sizes loop optimization

I have the following script: #!/usr/bin/ksh export MDIR=/datafiles NAME=$1 SERVER=$2 DIRECTORY=$3 DATABASE=$4 ID=$5 export dirlist=`/usr/bin/ssh -q $ID@$SERVER find $DIRECTORY -type d -print` for dir in $dirlist do SIZE=`</dev/null /usr/bin/ssh -q $ID@$SERVER du -ks $dir` echo...

2. UNIX for Dummies Questions & Answers

Command that creates file and also greps that file?

I have a command that does something and then creates a log file (importlog.xml). I then want to grep that newly created log (importlog.xml) file for a certain word (success). I then want to write that grep result to a new file (success.log). So far I can run the command which creates the...

3. Shell Programming and Scripting

Large pipe delimited file that I need to add CR/LF every n fields

I have a large flat file with variable length fields that are pipe delimited. The file has no new line or CR/LF characters to indicate a new record. I need to parse the file and after some number of fields, I need to insert a CR/LF to start the next record. Input file ...

4. Shell Programming and Scripting

Extracting a portion of data from a very large tab delimited text file

Hi All I wanted to know how to effectively delete some columns in a large tab delimited file. I have a file that contains 5 columns and almost 100,000 rows 3456 f g t t 3456 g h 456 f h 4567 f g h z 345 f g 567 h j k lThis is a very large data file and tab delimited. I need...

5. Shell Programming and Scripting

help with a shell script that greps an error from the logs

Hello everyone. I wrote the following script but the second part is not excecuting. It is not sending the notification by email if the error occurs. the send mail is working so i think the errorr should be in the if statement LOGDIR=/logs/out LOG=`date "+%Y%m%d"`.LOG-FILE.out #the log file ...

6. Shell Programming and Scripting

Removing dupes within 2 delimited areas in a large dictionary file

Hello, I have a very large dictionary file which is in text format and which contains a large number of sub-sections. Each sub-section starts with the following header : #DATA #VALID 1 and ends with a footer as shown below #END The data between the Header and the Footer consists of...

7. Shell Programming and Scripting

Need a script to convert comma delimited files to semi colon delimited

Hi All, I need a unix script to convert .csv files to .skv files (changing a comma delimited file to a semi colon delimited file). I am a unix newbie and so don't know where to start. The script will be scheduled using cron and needs to convert each .csv file in a particular folder to a .skv...

8. Shell Programming and Scripting

Tab Delimited file in loop

Hi, I have requirement to create tab delimited file with values coming from variables. File will contain only two columns separated by tab. Header will be added once. Values will be keep adding upon the script run. If values already exists then values will be replaced. I have done so...

9. UNIX for Advanced & Expert Users

Need optimized awk/perl/shell to give the statistics for the Large delimited file

I have a file size is around 24 G with 14 columns, delimiter with "|" My requirement- can anyone provide me the fastest and best to get the below results Number of records of the file First column and second Column- Unique counts Thanks for your time Karti ------ Post updated at...

10. UNIX for Advanced & Expert Users

Need Optimization shell/awk script to aggreagte (sum) for all the columns of Huge data file

Optimization shell/awk script to aggregate (sum) for all the columns of Huge data file File delimiter "|" Need to have Sum of all columns, with column number : aggregation (summation) for each column File not having the header Like below - Column 1 "Total Column 2 : "Total ... ......

LEARN ABOUT FREEBSD

csplit

CSPLIT(1)						    BSD General Commands Manual 						 CSPLIT(1)

NAME

     csplit -- split files based on context

SYNOPSIS

     csplit [-ks] [-f prefix] [-n number] file args ...

DESCRIPTION

     The csplit utility splits file into pieces using the patterns args.  If file is a dash ('-'), csplit reads from standard input.

     Files are created with a prefix of ``xx'' and two decimal digits.	The size of each file is written to standard output as it is created.  If
     an error occurs whilst files are being created, or a HUP, INT, or TERM signal is received, all files previously written are removed.

     The options are as follows:

     -f prefix
	     Create file names beginning with prefix, instead of ``xx''.

     -k      Do not remove previously created files if an error occurs or a HUP, INT, or TERM signal is received.

     -n number
	     Create file names beginning with number of decimal digits after the prefix, instead of 2.

     -s      Do not write the size of each output file to standard output as it is created.

     The args operands may be a combination of the following patterns:

     /regexp/[[+|-]offset]
	     Create a file containing the input from the current line to (but not including) the next line matching the given basic regular
	     expression.  An optional offset from the line that matched may be specified.

     %regexp%[[+|-]offset]
	     Same as above but a file is not created for the output.

     line_no
	     Create containing the input from the current line to (but not including) the specified line number.

     {num}   Repeat the previous pattern the specified number of times.  If it follows a line number pattern, a new file will be created for each
	     line_no lines, num times.	The first line of the file is line number 1 for historic reasons.

     After all the patterns have been processed, the remaining input data (if there is any) will be written to a new file.

     Requesting to split at a line before the current line number or past the end of the file will result in an error.

ENVIRONMENT

     The LANG, LC_ALL, LC_COLLATE and LC_CTYPE environment variables affect the execution of csplit as described in environ(7).

EXIT STATUS

     The csplit utility exits 0 on success, and >0 if an error occurs.

EXAMPLES

     Split the mdoc(7) file foo.1 into one file for each section (up to 21 plus one for the rest, if any):

	   csplit -k foo.1 '%^.Sh%' '/^.Sh/' '{20}'

     Split standard input after the first 99 lines and every 100 lines thereafter:

	   csplit -k - 100 '{19}'

SEE ALSO

     sed(1), split(1), re_format(7)

STANDARDS

     The csplit utility conforms to IEEE Std 1003.1-2001 (``POSIX.1'').

HISTORY

     A csplit command appeared in PWB UNIX.

BUGS

     Input lines are limited to LINE_MAX (2048) bytes in length.

BSD
								 February 6, 2014							       BSD