Command very Slow

05-19-2015

Registered User

13, 2

Join Date: May 2015

Last Activity: 2 July 2015, 1:51 AM EDT

Location: Eastern Province, Saudi Arabia

Posts: 13

Thanks Given: 0

Thanked 2 Times in 2 Posts

@ agent.kgb - Which one was the wrong suggestion ?

Could you please be specific.

subrkann

View Public Profile for subrkann

Find all posts by subrkann

05-19-2015

Registered User

1,014, 10

Join Date: Jun 2011

Last Activity: 21 October 2020, 5:28 AM EDT

Posts: 1,014

Thanks Given: 258

Thanked 10 Times in 10 Posts

While find . -name "star_st*" -exec head -1 {} + | grep "1175 876330" helped reduce the time take to half it is still considered very slow.

I am answering the requested question so it help find a clue n solution.

If . is on a remote filesystem, network issues could also have a significant impact?
Its on the same file system not Remote.

Is there other heavy load on the system? Yes Here is the output of TOP showing high CPU.

Code:

top - 04:40:06 up 35 days, 13:15,  7 users,  load average: 8.86, 8.03, 7.22
Tasks: 1844 total,   1 running, 1843 sleeping,   0 stopped,   0 zombie
Cpu(s):  1.9%us,  2.9%sy,  0.0%ni, 93.7%id,  1.5%wa,  0.0%hi,  0.0%si,  0.0%st
Mem:  132141488k total, 88842372k used, 43299116k free,   887360k buffers
Swap: 16777208k total,        0k used, 16777208k free, 43330260k cached
   PID USER      PR  NI  VIRT  RES  SHR S %CPU %MEM    TIME+  COMMAND
 30350 myadmin    20   0 2892m 2.6g  13m S 111.0  2.1 338:07.13 mohton
121341 myadmin    20   0 1126m 670m  99m S 99.9  0.5 794:02.69 lobster
 29107 myadmin    20   0  167m  58m  900 S 31.7  0.0   0:22.00 find
 53199 myadmin    20   0  982m 737m  93m S  4.9  0.6  20:32.78 lobster

However, one of the colleges says that CPU is common in this case and shouldn't affect the find command.

How big is the file hierarchy rooted in . ? There is no hierarchy. I m in the same directory in which the find command runs

How many files have names starting with star_st ? 180954

Last edited by mohtashims; 05-19-2015 at 06:43 AM..

mohtashims

View Public Profile for mohtashims

Find all posts by mohtashims

05-19-2015

Registered User

12,315, 4,560

Join Date: Jul 2012

Last Activity: 22 November 2019, 4:29 PM EST

Location: San Jose, CA, USA

Posts: 12,315

Thanks Given: 952

Thanked 4,560 Times in 3,818 Posts

Quote:

Originally Posted by mohtashims

Hi, I have 99583 files in the directory with start_st

Code:

ls -l start_st* | wc -l
99583

Also, I tried the suggestion but looks like there is a syntax error:

Code:

find . -name start_st* -exec head -1 {} + | grep "1175 876330"
find: paths must precede expression: start_st_0012.log
Usage: find [-H] [-L] [-P] [-Olevel] [-D help|tree|search|stat|rates|opt|exec] [path...] [expression]

I m on Linux

Code:

uname -a
Linux my_machine1 2.6.32-431.5.1.el6.x86_64 #1 SMP Fri Jan 10 14:46:43 EST 2014 x86_64 x86_64 x86_64 GNU/Linux

Is there other heavy load on the system? Answer: No

How big is the file hierarchy rooted in .? All the files are in the same directory.Answer: There are NO subdirectories.

If . is on a remote filesystem, network issues could also have a significant impact.: Answer: It is on the same Local File System.

There is a HUGE difference between your command above:

Code:

find . -name start_st* -exec head -1 {} + | grep "1175 876330"

and the command I suggested:

Code:

find . -name "start_st*" -exec head -1 {} + | grep "1175 876330"

If you put in the double-quotes I suggested (or the single-quotes agent.kgb suggested), it should work.

But if the ls you showed us above worked and all of the files are in a single directory, try just using:

Code:

head -1 start_st* | grep "1175 876330"

And, if I'm reading your code correctly, RavinderSingh13's awk script can be modified to be more efficient than the above suggestion:

Code:

cd /directory/containing/your/files
awk '$0 ~ /1175 876330/
{nextfile}' star_st*

The nextfile command in awk is an extension to the standards, but I believe it is present in awk on Linux systems. If your awk doesn't include nextfile and your star_st* files are small, you could try:

Code:

awk 'FNR == 1 && $0 ~ /1175 876330/' star_st*

The head and grep pipeline above may be faster if your files are larger than one block, depending on your average file size and the block size used on the filesystem containing your files. (Note that the above awk uses FNR == 1, not the NR == 1 in RavinderSingh13's script (which would only look at the 1st line in the 1st file instead of looking at the 1st line in each file).

This User Gave Thanks to Don Cragun For This Post:

Don Cragun

View Public Profile for Don Cragun

Find all posts by Don Cragun

05-19-2015

Registered User

1,014, 10

Join Date: Jun 2011

Last Activity: 21 October 2020, 5:28 AM EDT

Posts: 1,014

Thanks Given: 258

Thanked 10 Times in 10 Posts

Quote:

Originally Posted by RavinderSingh13

Hello mohtashims,

Could you please try following command and let me know if this helps.
According to your statement alll files are in same directory without any sub directories so following may help you then.

Code:

awk '(NR==1 && $0 ~ /1175 876330/){print FILENAME}' /tmp/test13/star_st*

Thanks,
R. Singh

I get an error while ruuning your suggestion bash: /bin/awk: Argument list too long

mohtashims

View Public Profile for mohtashims

Find all posts by mohtashims

05-19-2015

Moderator

3,105, 1,603

Join Date: May 2013

Last Activity: 31 August 2020, 1:46 AM EDT

Location: Chennai

Posts: 3,105

Thanks Given: 1,269

Thanked 1,603 Times in 1,369 Posts

Quote:

And, if I'm reading your code correctly, RavinderSingh13's awk script can be modified to be more efficient than the above suggestion:
Code:
cd /directory/containing/your/filesawk '$0 ~ /1175 876330/{nextfile}' star_st*

Hello Don,

Thank you for correcting me, I think your code should have included !~ instead of ~ as follows.

Code:

cd /directory/containing/your/files
awk '$0 !~ /1175 876330/
{nextfile}' star_st*

Please do correct me if I am worng here.

Thanks,
R. Singh

RavinderSingh13

View Public Profile for RavinderSingh13

Find all posts by RavinderSingh13

05-19-2015

Registered User

12,315, 4,560

Join Date: Jul 2012

Last Activity: 22 November 2019, 4:29 PM EST

Location: San Jose, CA, USA

Posts: 12,315

Thanks Given: 952

Thanked 4,560 Times in 3,818 Posts

Quote:

Originally Posted by RavinderSingh13

Hello Don,

Thank you for correcting me, I think your code should have included !~ instead of ~ as follows.

Code:

cd /directory/containing/your/files
awk '$0 !~ /1175 876330/
{nextfile}' star_st*

Please do correct me if I am worng here.

Thanks,
R. Singh

Sorry, but I think you're wrong. The:

Code:

$0 ~ /1175 876330/

in awk (since there is no action part, uses the default print the current line when the line contains the string 1175 876330). This simulates the action of the grep. With !~ instead of ~, it would simulate grep -v ....

The second line of the script:

Code:

{nextfile}

(with no condition, so it applies to every input line) skips to the next input file after processing the 1st line in a file (which duplicates the action of:

Code:

head -1

on each file processed.

And, when we're processing almost 100,000 files, we need to run this command in the directory where the files are located and just pass filenames as operands. Passing the absolute pathnames of 100,000 files runs a MUCH higher chance of exceeding ARG_MAX limits when execing awk. (Which mohtashims reported as a problem in post #11 in this thread.)

Last edited by Don Cragun; 05-19-2015 at 07:24 AM.. Reason: Fix typo caused by auto spellcheck corrections.

Don Cragun

View Public Profile for Don Cragun

Find all posts by Don Cragun

05-19-2015

Registered User

12,315, 4,560

Join Date: Jul 2012

Last Activity: 22 November 2019, 4:29 PM EST

Location: San Jose, CA, USA

Posts: 12,315

Thanks Given: 952

Thanked 4,560 Times in 3,818 Posts

We are still waiting for mohtashims to tell us if:

Code:

cd /directory/containing/your/files
awk '$0 ~ /1175 876330/
{nextfile}' star_st*

works.

Don Cragun

View Public Profile for Don Cragun

Find all posts by Don Cragun

Shell Programming and Scripting

Command very Slow

8 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Why is SED so slow?

Discussion started by: wxuyec

2. Shell Programming and Scripting

Making a faster alternative to a slow awk command

Discussion started by: s052866

3. Solaris

Slow while running a command for the first time

Discussion started by: royalliege

4. Shell Programming and Scripting

File processing is very slow with cut command

Discussion started by: bilalghazi

5. Shell Programming and Scripting

slow command execution?

Discussion started by: BandGap

6. UNIX for Dummies Questions & Answers

scp is slow

Discussion started by: tomstone_98

7. UNIX for Dummies Questions & Answers

Slow System

Discussion started by: Hansaplast

8. Post Here to Contact Site Administrators and Moderators

Slow

Discussion started by: DPAI