Sort and Unique in Perl


 
Thread Tools Search this Thread
Top Forums Shell Programming and Scripting Sort and Unique in Perl
# 8  
Old 02-08-2008
Quote:
Originally Posted by matrixmadhan
read the file
split the record
use the third field
populate in a hash => this would maintain uniqueness
when displaying use sort keys %hash
Thanks for your input. Iam new to perl is it possible to give one simple example pls.
# 9  
Old 02-08-2008
sample code and file
try this

Code:
>cat b
1|2|3
4|9|4
3|1|2

Code:
>cat b.pl
#! /opt/third-party/bin/perl

open(FILE, "<", $ARGV[0]);

while(<FILE>) {
  chomp;
  my @arr = split(/\|/);
  $fileHash{$arr[1]}++;
}

close(FILE);

foreach my $k ( sort keys %fileHash ) {
  print "$k\n";
}

exit 0

# 10  
Old 02-08-2008
cut -d'|' f3 file_name | sort -u
# 11  
Old 02-08-2008
Sort and Unique in Perl

Almost same as MatrixMadhan's. Row count was not included, so the following code is just for completeness:

Code:
#!/usr/bin/perl -w
use strict;

# Program to get unique values for 3rd column and print them

open(FILE, "b.txt");

my %list = ();

while(<FILE>){
   chomp;
   my @array = split(/\|/);
   $list{$array[2]}++;
}

close(FILE);

# Print out the results

foreach my $value (sort keys %list) {
   print "The unique values are $value\n";
}

print "Number of rows are ".keys(%list);

# 12  
Old 02-08-2008
Good examples already, this is just a more compact form of the same thing:

Code:
#!/usr/bin/perl
use warnings;
use strict;
unless ($ARGV[0]) {
    die "Usage: perl scriptname.pl filename";
}
my %list = ();
while(<>){
   $list{(split(/\|/))[2]}++;
}
print "$_ = $list{$_}\n" for (keys %list);
exit(0);

Uses perls optimized filehandling and no temp variables so should be fast and efficient.
# 13  
Old 02-08-2008
Thanks every one.
Thanks MatrixMadhan and MobileUser!!
Iam able to make use of the same code.
Thanks KevinADC, Just that, the count is comming for each entry of hash. I would like to have one count at the end. So that I can write the unique keys to a file and Count to a seperate header file.

Thanks again.
# 14  
Old 02-08-2008
Iam using lot of other packages in the script. So when I try to use it gives me an error.

PHP Code:
Global symbol "%list" requires explicit package name at b.pl line 19 
How to declare the hash variable in the script prior to declaration?
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Sort unique

Hi, I have an input file that I have sorted in a previous stage by $1 and $4. I now need something that will take the first record from each group of data based on the key being $1 Input file 1000AAA|"ZZZ"|"Date"|"1"|"Y"|"ABC"|""|AA 1000AAA|"ZZZ"|"Date"|"2"|"Y"|"ABC"|""|AA... (2 Replies)
Discussion started by: Ads89
2 Replies

2. Shell Programming and Scripting

Sort unique by multiple fields

i need to sort to get all the unique records based on the 1st and 2nd column, and keep the record with the highest value on 5th column if there are duplicates, every column with varies length a^2^x^y^z bxc^2xx2^aa^bvxxxx^cdd a^3^1^2^3 a^2^x^1^c I want a result which will only keep the 1st... (2 Replies)
Discussion started by: dtdt
2 Replies

3. UNIX for Dummies Questions & Answers

Print unique lines without sort or unique

I would like to print unique lines without sort or unique. Unfortunately the server I am working on does not have sort or unique. I have not been able to contact the administrator of the server to ask him to add it for several weeks. (7 Replies)
Discussion started by: cokedude
7 Replies

4. Shell Programming and Scripting

sort split merge -u unique

Hi, this is about sorting a very large file (like 10 gb) to keep lines with unique entries across SOME of the columns. The line originally looked like this: sort -u -k2,2 -k3,3n -k4,4n -k5,5n -k6,6n file_unsorted > file_sorted please note the -u flag. The problem is that this single... (4 Replies)
Discussion started by: jbr950
4 Replies

5. Shell Programming and Scripting

Unique sort with three fields

I have another file with three columns A,B,C as below 123,1,502 123,2,506 123,3,702 234,4,101 235,5,104 456,6,104 456,7,100 i want to sort such that i get a unique value in column A, and for those with multiple value in A, i want the lowest value in C. output should be Code:... (3 Replies)
Discussion started by: dealerso
3 Replies

6. Shell Programming and Scripting

Unique sort with two fields

I have a file with contents below 123,502 123,506 123,702 234,101 235,104 456,104 456,100 i want to sort such that i get a unique value in column A, and for those with multiple value in A, i want the lowest value in B. output should be 123,502 234,101 235,104 456,100 (3 Replies)
Discussion started by: dealerso
3 Replies

7. Shell Programming and Scripting

Awk sort and unique

Input file --------- 12:name1:|host1|host1|host2|host1 13:name2:|host1|host1|host2|host3 14:name3: ...... Required output --------------- 12:name1:host1(2)|host1(1) 13:name2:host1(2)|host2(1)|host3(1) 14:name3: where (x) - Count how many times field appears in last column ... (3 Replies)
Discussion started by: greycells
3 Replies

8. Shell Programming and Scripting

What some ideas with sort and get unique data

Hi all, I am writing a script where i can parse through the directory and get common string in two directories i get. The command below SUN_PLATFORM=`$FIND $STREAM_PATH . -depth -name ShareableEntities | $AWK -F"/" '{if($10 ~ /sun5/) print $0}'` gives the following output:- ... (1 Reply)
Discussion started by: asirohi
1 Replies

9. Shell Programming and Scripting

Perl sort unique by one field only

Hi all, I've searched the forum and I can find some code to sort uniquely in perl but not by a single field. I have a file with data such as the following: 1,test,34 1,test2,65 2,test,35, 1,test3,34 2,test,34 What i want to do is sort it uniqely by the first field only so I'd end... (2 Replies)
Discussion started by: Donkey25
2 Replies

10. Shell Programming and Scripting

unique sort contents of a variable

Hi , I have #echo $var1 #hdisk2 hdisk3 hdisk0 hdisk2 Now I need to remove duplicate entries from this . ie. after sorting it should only have hdisk2 hdisk3 hdisk0 . I can have these values in a array as well . I understand we can use sort -u to remove the duplicates in a... (2 Replies)
Discussion started by: praveenbvarrier
2 Replies
Login or Register to Ask a Question