How to sort and compare files in more efficient manner? Post: 302810633

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Is there an efficient way in finding and deleting files?

Hi I have a script to find and delete the files which are say, noDaysOld, I am interested to find the number of such files I am fniding for deleting and then deleting it. So, the script I wrote, first finds the number of such files and then deletes, clearly this is two different steps. ...

2. Shell Programming and Scripting

Why do we use sort before compare ?

Dear All. Im trying to know how exactly the command "compare" works, does it compare line by line or field by field, and the most important thing is that why the files have to be sorted before we compare them? Thanks in advance

3. UNIX for Dummies Questions & Answers

sort and compare files

Hi, I have around 14-15 files and i want to sort all the files and then compare. I dont want to sort them and store in a different file and then compare. I want to compare two files at a time. Is there a way we can do this using a single command?

4. Shell Programming and Scripting

HELP: I need to sort a text file in an uncommon manner, can't get desired results

Hi All I have a flat text file. Each line in it contains a "/full path/filename". The last three columns are predictable, but directory depth of each line varies. I want to sort on the last three columns, starting from the last, 2nd last and 3rd last. In that order. The last three columns...

5. Emergency UNIX and Linux Support

Appending the contents of two files in computational manner

Hi, I have two files say file 1 file 2 File1 1 2 4 5 File 2 asdf adf How to get the ouput something like asdf1 adf1 asdf2 adf2 asdf4 adf4 asdf5

6. Emergency UNIX and Linux Support

Help to make awk script more efficient for large files

Hello, Error awk: Internal software error in the tostring function on TS1101?05044400?.0085498227?0?.0011041461?.0034752266?.00397045?0?0?0?0?0?0?11/02/10?09/23/10???10?no??0??no?sct_det3_10_20110516_143936.txt What it is It is a unix shell script that contains an awk program as well as...

7. Programming

Help with make this Fortran code more efficient (in HPC manner)

Hi there, I had run into some fortran code to modify. Obviously, it was written without thinking of high performance computing and not parallelized... Now I would like to make the code "on track" and parallel. After a whole afternoon thinking, I still cannot find where to start. Can any one...

8. Shell Programming and Scripting

comm command -- Sort and compare two files

Team, I have two files and I am trying to find the lines unique to file1. So i have executed the below command at shell prompt and got the correct results comm -23 <(sort test) <(sort test1) When i run the same command in Bash shell script, i got the correct results. But when i run...

9. Shell Programming and Scripting

Most efficient method to extract values from text files

I have a list of files defined in a single file , one on each line.(No.of files may wary each time) eg. content of ETL_LOOKUP.dat /data/project/randomname /data/project/ramname /data/project/raname /data/project/radomname /data/project/raame /data/project/andomname size of these...

10. Shell Programming and Scripting

More efficient for loop on files

is there a more efficient way to write the following code: for eachlfile in $(ls -ltcrd ${BACKUPDIR}/${BACKUPNAME}*/content_${BACKUPNAME}* 2>/dev/null | awk '{print $NF}') do echo "==========================" ...

LEARN ABOUT OSX

sort

sort(3pm)						 Perl Programmers Reference Guide						 sort(3pm)

NAME

       sort - perl pragma to control sort() behaviour

SYNOPSIS

	   use sort 'stable';	       # guarantee stability
	   use sort '_quicksort';      # use a quicksort algorithm
	   use sort '_mergesort';      # use a mergesort algorithm
	   use sort 'defaults';        # revert to default behavior
	   no  sort 'stable';	       # stability not important

	   use sort '_qsort';	       # alias for quicksort

	   my $current;
	   BEGIN {
	       $current = sort::current();     # identify prevailing algorithm
	   }

DESCRIPTION

       With the "sort" pragma you can control the behaviour of the builtin "sort()" function.

       In Perl versions 5.6 and earlier the quicksort algorithm was used to implement "sort()", but in Perl 5.8 a mergesort algorithm was also
       made available, mainly to guarantee worst case O(N log N) behaviour: the worst case of quicksort is O(N**2).  In Perl 5.8 and later,
       quicksort defends against quadratic behaviour by shuffling large arrays before sorting.

       A stable sort means that for records that compare equal, the original input ordering is preserved.  Mergesort is stable, quicksort is not.
       Stability will matter only if elements that compare equal can be distinguished in some other way.  That means that simple numerical and
       lexical sorts do not profit from stability, since equal elements are indistinguishable.	However, with a comparison such as

	  { substr($a, 0, 3) cmp substr($b, 0, 3) }

       stability might matter because elements that compare equal on the first 3 characters may be distinguished based on subsequent characters.
       In Perl 5.8 and later, quicksort can be stabilized, but doing so will add overhead, so it should only be done if it matters.

       The best algorithm depends on many things.  On average, mergesort does fewer comparisons than quicksort, so it may be better when
       complicated comparison routines are used.  Mergesort also takes advantage of pre-existing order, so it would be favored for using "sort()"
       to merge several sorted arrays.	On the other hand, quicksort is often faster for small arrays, and on arrays of a few distinct values,
       repeated many times.  You can force the choice of algorithm with this pragma, but this feels heavy-handed, so the subpragmas beginning with
       a "_" may not persist beyond Perl 5.8.  The default algorithm is mergesort, which will be stable even if you do not explicitly demand it.
       But the stability of the default sort is a side-effect that could change in later versions.  If stability is important, be sure to say so
       with a

	 use sort 'stable';

       The "no sort" pragma doesn't forbid what follows, it just leaves the choice open.  Thus, after

	 no sort qw(_mergesort stable);

       a mergesort, which happens to be stable, will be employed anyway.  Note that

	 no sort "_quicksort";
	 no sort "_mergesort";

       have exactly the same effect, leaving the choice of sort algorithm open.

CAVEATS

       As of Perl 5.10, this pragma is lexically scoped and takes effect at compile time. In earlier versions its effect was global and took
       effect at run-time; the documentation suggested using "eval()" to change the behaviour:

	 { eval 'use sort qw(defaults _quicksort)'; # force quicksort
	   eval 'no sort "stable"';	 # stability not wanted
	   print sort::current . "
";
	   @a = sort @b;
	   eval 'use sort "defaults"';	 # clean up, for others
	 }
	 { eval 'use sort qw(defaults stable)';     # force stability
	   print sort::current . "
";
	   @c = sort @d;
	   eval 'use sort "defaults"';	 # clean up, for others
	 }

       Such code no longer has the desired effect, for two reasons.  Firstly, the use of "eval()" means that the sorting algorithm is not changed
       until runtime, by which time it's too late to have any effect. Secondly, "sort::current" is also called at run-time, when in fact the
       compile-time value of "sort::current" is the one that matters.

       So now this code would be written:

	 { use sort qw(defaults _quicksort); # force quicksort
	   no sort "stable";	  # stability not wanted
	   my $current;
	   BEGIN { $current = print sort::current; }
	   print "$current
";
	   @a = sort @b;
	   # Pragmas go out of scope at the end of the block
	 }
	 { use sort qw(defaults stable);     # force stability
	   my $current;
	   BEGIN { $current = print sort::current; }
	   print "$current
";
	   @c = sort @d;
	 }

perl v5.16.2							    2012-08-26								 sort(3pm)

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Is there an efficient way in finding and deleting files?

Discussion started by: guruparan18

2. Shell Programming and Scripting

Why do we use sort before compare ?

Discussion started by: yahyaaa

3. UNIX for Dummies Questions & Answers

sort and compare files

Discussion started by: dnat

4. Shell Programming and Scripting

HELP: I need to sort a text file in an uncommon manner, can't get desired results

Discussion started by: JakeKatz