quickest way to get the total number of lines in a file

10-01-2012

Registered User

177, 61

Join Date: Mar 2012

Last Activity: 2 November 2013, 1:26 AM EDT

Location: In books/UNIX.com

Posts: 177

Thanks Given: 16

Thanked 61 Times in 60 Posts

Code:

sed -n '$=' input_file

This User Gave Thanks to msabhi For This Post:

msabhi

View Public Profile for msabhi

Find all posts by msabhi

10-01-2012

Registered User

1,650, 478

Join Date: Mar 2012

Last Activity: 11 September 2019, 8:06 AM EDT

Posts: 1,650

Thanks Given: 58

Thanked 478 Times in 474 Posts

I think wc -l file is a efficient way of counting lines as compared to other methods discussed above..

This User Gave Thanks to pamu For This Post:

pamu

View Public Profile for pamu

Find all posts by pamu

10-01-2012

Registered User

11,728, 1,345

Join Date: Feb 2004

Last Activity: 8 May 2020, 9:07 AM EDT

Location: NM

Posts: 11,728

Thanks Given: 903

Thanked 1,345 Times in 1,201 Posts

IF you start testing methods on a file be aware of the effect file caching by the OS and disk controllers. You will get completely bogus results if you are not aware of this. I/O wait time is the biggest time consumer. Disks are at the very best 10 times slower than memory unless you have SSD.

Pretend you try sed and get this answer:

Code:

time sed -n '$=' input_file
real    0m2.098s
user    0m0.516s
sys     0m0.338s

Great - that took 2.098 seconds of wall time.
Let's try wc -l

Code:

time wc -l input_file
real    0m0.778s
user    0m0.416s
sys     0m0.338s

Wow. wc -l was faster.

No. A lot of the file data was still in cache. So there was no I/O wait. Why, because you ran against the same file. As you read thru a file the system will attempt to cache all or parts of it, depending on available resources.

The file data in the cache slowly goes away as other users read/write the same disk. After a while the file is no longer cached. How long that is, I cannot say. Solaris will use part of free memory as file cache, so will Linux. Add this to what the disk controller caches and some large chunks of really huge files can be in memory.

SAN storage behaves in a similar way, but is a lot more complex. SAN is generally slower than direct disk, then some systems have the faster directio options, the fastest storage is raw disk (bypassing the filesystem and kernel code for filesystem support). Oracle will do this for its database files if configured.

Also you can tune a filesystem.

If you have to speed up file I/O look into SSD for desktops.

These 2 Users Gave Thanks to jim mcnamara For This Post:

jim mcnamara

View Public Profile for jim mcnamara

Find all posts by jim mcnamara

10-01-2012

Registered User

919, 3

Join Date: Dec 2006

Last Activity: 5 March 2020, 5:37 PM EST

Posts: 919

Thanks Given: 757

Thanked 3 Times in 3 Posts

Quote:

Originally Posted by jim mcnamara

Code:

time sed -n '$=' input_file
real    0m2.098s
user    0m0.516s
sys     0m0.338s

Great - that took 2.098 seconds of wall time.
Let's try wc -l

Code:

time wc -l input_file
real    0m0.778s
user    0m0.416s
sys     0m0.338s

thank you so much for the detailed explanation. i've always wondered why sometimes i get faster response and other times i get a much slower response when running the same command on a file. now i know. thanks a million.

SkySmart

View Public Profile for SkySmart

Find all posts by SkySmart

10-01-2012

Registered User

177, 61

Join Date: Mar 2012

Last Activity: 2 November 2013, 1:26 AM EDT

Location: In books/UNIX.com

Posts: 177

Thanks Given: 16

Thanked 61 Times in 60 Posts

Quote:

Originally Posted by jim mcnamara

Code:

time sed -n '$=' input_file
real    0m2.098s
user    0m0.516s
sys     0m0.338s

Great - that took 2.098 seconds of wall time.
Let's try wc -l

Code:

time wc -l input_file
real    0m0.778s
user    0m0.416s
sys     0m0.338s

Yeah i thought so when i tested...varying times..Very good food for our thoughts Jim...thanks..

msabhi

View Public Profile for msabhi

Find all posts by msabhi

10-01-2012

Registered User

2,288, 480

Join Date: Apr 2007

Last Activity: 3 May 2020, 8:28 AM EDT

Location: Saint Paul, MN USA / BSD, CentOS, Debian, OS X, Solaris

Posts: 2,288

Thanks Given: 430

Thanked 480 Times in 395 Posts

Hi.

See also post at https://www.unix.com/shell-programmin...ines-file.html for some additional timings ... cheers, drl

This User Gave Thanks to drl For This Post:

drl

View Public Profile for drl

Find all posts by drl

Shell Programming and Scripting

quickest way to get the total number of lines in a file

10 More Discussions You Might Find Interesting

1. UNIX for Dummies Questions & Answers

How to find count total number of pattern in a file �?

Discussion started by: Jitten

2. UNIX for Dummies Questions & Answers

Write the total number of rows in multiple files into another file

Discussion started by: malaya kumar

3. Shell Programming and Scripting

Help with sum total number of record and total number of record problem asking

Discussion started by: patrick87

4. Shell Programming and Scripting

Select lines in which column have value greater than some percent of total file lines

Discussion started by: vaibhavkorde

5. Shell Programming and Scripting

perl script on how to count the total number of lines of all the files under a directory

Discussion started by: adityam

6. Shell Programming and Scripting

Removing lines from large files.. quickest method?

Discussion started by: frustrated1

7. Shell Programming and Scripting

Appending line number to each line and getting total number of lines

Discussion started by: chiru_h

8. Shell Programming and Scripting

total number of lines in a file

Discussion started by: Raynon

9. Shell Programming and Scripting

total number of lines

Discussion started by: mahabunta

10. Shell Programming and Scripting

Total of lines w/out header and footer incude for a file

Discussion started by: gzs553