Remove duplicate lines based on field and sort Post: 302608471

10 More Discussions You Might Find Interesting

1. Solaris

How to remove duplicate records with out sort

Can any one give me command How to delete duplicate records with out sort. Suppose if the records like below: 345,bcd,789 123,abc,456 234,abc,456 712,bcd,789 out tput should be 345,bcd,789 123,abc,456 Key for the records is 2nd and 3rd fields.fields are seperated by colon(,).

2. Shell Programming and Scripting

Remove lines, Sorted with Time based columns using AWK & SORT

Hi having a file as follows MediaErr.log 84 Server1 Policy1 Schedule1 master1 05/08/2008 02:12:16 84 Server1 Policy1 Schedule1 master1 05/08/2008 02:22:47 84 Server1 Policy1 Schedule1 master1 05/08/2008 03:41:26 84 Server1 Policy1 ...

3. Shell Programming and Scripting

How to remove duplicate records with out sort

4. Shell Programming and Scripting

Remove duplicate lines (the first matching line by field criteria)

Hello to all, I have this file 2002 1 23 0 0 2435.60 131.70 5.60 20.99 0.89 0.00 285.80 2303.90 2002 1 23 15 0 2436.60 132.90 6.45 21.19 1.03 0.00 285.80 2303.70 2002 1 23 ...

5. Shell Programming and Scripting

Sort and Remove Duplicate on file

How do we sort and remove duplicate on column 1,2 retaining the record with maximum date (in feild 3) for the file with following format. aaa|1234|2010-12-31 aaa|1234|2010-11-10 bbb|345|2011-01-01 ccc|346|2011-02-01 bbb|345|2011-03-10 aaa|1234|2010-01-01 Required Output ...

6. UNIX for Dummies Questions & Answers

remove duplicate lines based on two columns and judging from a third one

hello all, I have an input file with four columns like this with a lot of lines and for example, line 1 and line 5 match because the first 4 characters match and the fourth column matches too. I want to keep the line that has the lowest number in the third column. So I discard line 5....

7. Shell Programming and Scripting

Remove lines with duplicate first field

Trying to cut down the size of some log files. Now that I write this out it looks more dificult than i thought it would be. Need a bash script or command that goes sequentially through all lines of a file, and does this: if field1 (space separated) is the number 2012 print the entire line. Do...

8. Shell Programming and Scripting

Remove duplicate value based on two field $4 and $5

Hi All, i have input file like below... CA009156;20091003;M;AWBKCA72;123;;CANADIAN WESTERN BANK;EDMONTON;;2300, 10303, JASPER AVENUE;;T5J 3X6;; CA009156;20091003;M;AWBKCA72;321;;CANADIAN WESTERN BANK;EDMONTON;;2300, 10303, JASPER AVENUE;;T5J 3X6;; CA009156;20091003;M;AWBKCA72;231;;CANADIAN...

9. Shell Programming and Scripting

Remove duplicate lines from file based on fields

Dear community, I have to remove duplicate lines from a file contains a very big ammount of rows (milions?) based on 1st and 3rd columns The data are like this: Region 23/11/2014 09:11:36 41752 Medio 23/11/2014 03:11:38 4132 Info 23/11/2014 05:11:09 4323...

10. Shell Programming and Scripting

Remove duplicate lines, sort it and save it as file itself

Hi, all I have a csv file that I would like to remove duplicate lines based on 1st field and sort them by the 1st field. If there are more than 1 line which is same on the 1st field, I want to keep the first line of them and remove the rest. I think I have to use uniq or something, but I still...

LEARN ABOUT OSX

sort

sort(3pm)						 Perl Programmers Reference Guide						 sort(3pm)

NAME

       sort - perl pragma to control sort() behaviour

SYNOPSIS

	   use sort 'stable';	       # guarantee stability
	   use sort '_quicksort';      # use a quicksort algorithm
	   use sort '_mergesort';      # use a mergesort algorithm
	   use sort 'defaults';        # revert to default behavior
	   no  sort 'stable';	       # stability not important

	   use sort '_qsort';	       # alias for quicksort

	   my $current;
	   BEGIN {
	       $current = sort::current();     # identify prevailing algorithm
	   }

DESCRIPTION

       With the "sort" pragma you can control the behaviour of the builtin "sort()" function.

       In Perl versions 5.6 and earlier the quicksort algorithm was used to implement "sort()", but in Perl 5.8 a mergesort algorithm was also
       made available, mainly to guarantee worst case O(N log N) behaviour: the worst case of quicksort is O(N**2).  In Perl 5.8 and later,
       quicksort defends against quadratic behaviour by shuffling large arrays before sorting.

       A stable sort means that for records that compare equal, the original input ordering is preserved.  Mergesort is stable, quicksort is not.
       Stability will matter only if elements that compare equal can be distinguished in some other way.  That means that simple numerical and
       lexical sorts do not profit from stability, since equal elements are indistinguishable.	However, with a comparison such as

	  { substr($a, 0, 3) cmp substr($b, 0, 3) }

       stability might matter because elements that compare equal on the first 3 characters may be distinguished based on subsequent characters.
       In Perl 5.8 and later, quicksort can be stabilized, but doing so will add overhead, so it should only be done if it matters.

       The best algorithm depends on many things.  On average, mergesort does fewer comparisons than quicksort, so it may be better when
       complicated comparison routines are used.  Mergesort also takes advantage of pre-existing order, so it would be favored for using "sort()"
       to merge several sorted arrays.	On the other hand, quicksort is often faster for small arrays, and on arrays of a few distinct values,
       repeated many times.  You can force the choice of algorithm with this pragma, but this feels heavy-handed, so the subpragmas beginning with
       a "_" may not persist beyond Perl 5.8.  The default algorithm is mergesort, which will be stable even if you do not explicitly demand it.
       But the stability of the default sort is a side-effect that could change in later versions.  If stability is important, be sure to say so
       with a

	 use sort 'stable';

       The "no sort" pragma doesn't forbid what follows, it just leaves the choice open.  Thus, after

	 no sort qw(_mergesort stable);

       a mergesort, which happens to be stable, will be employed anyway.  Note that

	 no sort "_quicksort";
	 no sort "_mergesort";

       have exactly the same effect, leaving the choice of sort algorithm open.

CAVEATS

       As of Perl 5.10, this pragma is lexically scoped and takes effect at compile time. In earlier versions its effect was global and took
       effect at run-time; the documentation suggested using "eval()" to change the behaviour:

	 { eval 'use sort qw(defaults _quicksort)'; # force quicksort
	   eval 'no sort "stable"';	 # stability not wanted
	   print sort::current . "
";
	   @a = sort @b;
	   eval 'use sort "defaults"';	 # clean up, for others
	 }
	 { eval 'use sort qw(defaults stable)';     # force stability
	   print sort::current . "
";
	   @c = sort @d;
	   eval 'use sort "defaults"';	 # clean up, for others
	 }

       Such code no longer has the desired effect, for two reasons.  Firstly, the use of "eval()" means that the sorting algorithm is not changed
       until runtime, by which time it's too late to have any effect. Secondly, "sort::current" is also called at run-time, when in fact the
       compile-time value of "sort::current" is the one that matters.

       So now this code would be written:

	 { use sort qw(defaults _quicksort); # force quicksort
	   no sort "stable";	  # stability not wanted
	   my $current;
	   BEGIN { $current = print sort::current; }
	   print "$current
";
	   @a = sort @b;
	   # Pragmas go out of scope at the end of the block
	 }
	 { use sort qw(defaults stable);     # force stability
	   my $current;
	   BEGIN { $current = print sort::current; }
	   print "$current
";
	   @c = sort @d;
	 }

perl v5.16.2							    2012-08-26								 sort(3pm)

10 More Discussions You Might Find Interesting

1. Solaris

How to remove duplicate records with out sort

Discussion started by: svenkatareddy

2. Shell Programming and Scripting

Remove lines, Sorted with Time based columns using AWK & SORT

Discussion started by: karthikn7974

3. Shell Programming and Scripting

How to remove duplicate records with out sort

Discussion started by: svenkatareddy

4. Shell Programming and Scripting

Remove duplicate lines (the first matching line by field criteria)

Discussion started by: joggdial3000

5. Shell Programming and Scripting

Sort and Remove Duplicate on file

Discussion started by: mabarif16

6. UNIX for Dummies Questions & Answers

remove duplicate lines based on two columns and judging from a third one

Discussion started by: TheTransporter

7. Shell Programming and Scripting

Remove lines with duplicate first field

Discussion started by: ajp7701

8. Shell Programming and Scripting

Remove duplicate value based on two field $4 and $5

Discussion started by: mohan sharma

9. Shell Programming and Scripting

Remove duplicate lines from file based on fields

Discussion started by: Lord Spectre

10. Shell Programming and Scripting

Remove duplicate lines, sort it and save it as file itself

Discussion started by: refrain

LEARN ABOUT OSX

sort