performance issue using gzcat, awk and sort Post: 302240150

10 More Discussions You Might Find Interesting

1. UNIX for Advanced & Expert Users

Performance issue

Hello all, I just stuck up in an uncertain situation related to network performance... I am trying to access one of my remote client unix machine from a distant location.. The client machine is Ultra-5_10 , with SunOS 5.5.1 The ndd result ( hme1 )shows that the machine is hooked to a...

2. AIX

performance issue

We have a AIX v5.3 on a p5 system with a poor performing Ingres database. We added one CPU to the system to see if this would help. Now there are two CPU's. with sar and topas -P I see good results: CPU usage around 30% with topas I only see good results in the process output screen, the...

3. UNIX for Advanced & Expert Users

performance issue

Hi, on a linux server I have the following : vmstat 2 10 procs memory swap io system cpu r b w swpd free buff cache si so bi bo in cs us sy id 0 4 0 675236 39836 206060 1617660 3 3 3 6 8 7 1 1 ...

4. Shell Programming and Scripting

gzcat into awk and then change FILENAME and process new FILENAME

I am trying to write a script that prompts users for date and time, then process the gzip file into awk. During the ksh part of the script another file is created and needs to be processed with a different set of pattern matches then I need to combine the two in the end. I'm stuck at the part...

5. Shell Programming and Scripting

Performance issue with awk script.

Hi, The below awk script is taking about 1 hour to fetch just 11 records(columns). There are about 48000 records. The script file name is take_first_uniq.sh #!/bin/ksh if then while read line do first=`echo $line | awk -F"|" '{print $1$2$3}'` while read line2 do...

6. Solaris

Performance issue

Hi Gurus, I am beginner in solaris and want to know what are the things we need to check for performance monitoring on our solairs OS. for DISK,CPU and MEMORY. Also how we do ipforwarding in slaris Many thanks for your help Pradeep P

7. UNIX for Dummies Questions & Answers

Performance issue

hi I am having a performance issue with the following requirement i have to create a permutation and combination on a set of three files such that each record in each file is picked and the output is redirected in a specific format but it is taking around 70 odd hours to prepare a combination...

8. Shell Programming and Scripting

awk performance issue

Hi, I have the code below as cat <filename> | tr '~' '\n' | sed '/^$/ d' | sed "s/*/|/g" > <filename> awk -F\| -vt=`date +%m%d%y%H%M%S%s` '$1=="ST",$1=="SE"{if($1=="ST"){close(f);f="214_edifile_"t"" ++i} ; $1=$1; print>f}' OFS=\| <filename> This script replaces some characters and...

9. UNIX for Dummies Questions & Answers

awk script performance issue

Hello All, I have the below excerpt of code in my shell script and it taking long time to complete, though it prints the output quickly. Is there a way to make it come out once it finds the first instance as the file size of 4.7 GB it could be going through all lines of the data file to find for...

10. UNIX for Dummies Questions & Answers

File sort performance

Hi, I have got a 9.3GB file and it is taking 1h 8min to sort file using the following code: sort -T /directory1 -t | -k9,9 -k8,8n /directory1/file1 > /directory2/file2 Is there a faster way of doing it please? Thanks Shash

LEARN ABOUT PLAN9

join

JOIN(1) 						      General Commands Manual							   JOIN(1)

NAME

       join - relational database operator

SYNOPSIS

       join [ options ] file1 file2

DESCRIPTION

       Join forms, on the standard output, a join of the two relations specified by the lines of file1 and file2.  If one of the file names is the
       standard input is used.

       File1 and file2 must be sorted in increasing ASCII collating sequence on the fields on which they are to be joined, normally the  first	in
       each line.

       There  is  one line in the output for each pair of lines in file1 and file2 that have identical join fields.  The output line normally con-
       sists of the common field, then the rest of the line from file1, then the rest of the line from file2.

       Input fields are normally separated spaces or tabs; output fields by space.  In this case, multiple separators count as	one,  and  leading
       separators are discarded.

       The following options are recognized, with POSIX syntax.

       -a n   In addition to the normal output, produce a line for each unpairable line in file n, where n is 1 or 2.

       -v n   Like -a, omitting output for paired lines.

       -e s   Replace empty output fields by string s.

       -1 m
       -2 m   Join on the mth field of file1 or file2.

       -jn m  Archaic equivalent for -n m.

       -ofields
	      Each  output  line  comprises the designated fields.  The comma-separated field designators are either 0, meaning the join field, or
	      have the form n.m, where n is a file number and m is a field number.  Archaic usage allows separate arguments for field designators.

       -tc    Use character c as the only separator (tab character) on input and output.  Every appearance of c in a line is significant.

EXAMPLES

       sort /adm/users | join -t: -a 1 -e "" - bdays
	      Add birthdays to password information, leaving unknown birthdays empty.  The layout of is given in users(6); bdays  contains  sorted
	      lines like

       tr : ' ' </adm/users | sort -k 3 3 >temp
       join -1 3 -2 3 -o 1.1,2.1 temp temp | awk '$1 < $2'
	      Print all pairs of users with identical userids.

SOURCE

       /sys/src/cmd/join.c

SEE ALSO

       sort(1), comm(1), awk(1)

BUGS

       With default field separation, the collating sequence is that of sort -b -ky,y; with -t, the sequence is that of sort -tx -ky,y.
       One of the files must be randomly accessible.

																	   JOIN(1)

10 More Discussions You Might Find Interesting

1. UNIX for Advanced & Expert Users

Performance issue

Discussion started by: shibz

2. AIX

performance issue

Discussion started by: rein

3. UNIX for Advanced & Expert Users

performance issue

Discussion started by: big123456

4. Shell Programming and Scripting

gzcat into awk and then change FILENAME and process new FILENAME

Discussion started by: timj123

5. Shell Programming and Scripting

Performance issue with awk script.

Discussion started by: RRVARMA

6. Solaris

Performance issue

Discussion started by: ppandey21

7. UNIX for Dummies Questions & Answers

Performance issue

Discussion started by: mad_man12

8. Shell Programming and Scripting

awk performance issue

Discussion started by: atlantis_yy

9. UNIX for Dummies Questions & Answers

awk script performance issue

Discussion started by: Ariean

10. UNIX for Dummies Questions & Answers

File sort performance

Discussion started by: shash

LEARN ABOUT PLAN9

join