field separator in Perl


 
Thread Tools Search this Thread
Top Forums Shell Programming and Scripting field separator in Perl
# 8  
Old 04-09-2009
perl arrays start at index 0 (zero). A very brief example:

Code:
@array = (1 , 2 , 3 , 4);
print $array[0];#prints 1
$array[2] = 'foo'; #changes 3 to foo

# 9  
Old 04-10-2009
say the input line looks like this:

A|aardvark|ant

Code:
( $letter, $long_nm, $short_nm ) = split( /\|/, $line );

works, as does:

Code:
( @a_junk ) = split( /\|/, $line );

print "1st element: $a_junk[0] \n";
print "2nd element: $a_junk[1] \n";
print "3rd element: $a_junk[2] \n";

# 10  
Old 04-10-2009
Thank you all. To tell the truth, I'm getting quite confused. I think I have to do some serious reading of the man pages, because I still cannot figure out how to make this split work with my posted script.
Say I have this file:
Code:
bbc&aaa&aaa
mmn&aaa&ccc
lmn&bbb&aaa
aaa&ccc&ddd
ššš&&
sss&&aaa
zzz&&
aaa&bbb&ccc
aaa&aaa&bbb
uuu&&
šas&&
saš&&
cab&&
uuū&&
ūuu&&
ūūū&&

Actually, I need the rows to be unchanged, since they are rows of a database query. so I just need to sort by first column.
If I change my script like this:
Code:
use strict;
use warnings;
open (_file_, "< path-to-file")  or  die "Failed to read file : $! ";
my @not_sorted = <_file_>; 
sub normalize {
   my $in = $_[0];
   $in = lc($in);
   $in =~ tr<aeiouū>
   <aeiouu>;
   $in =~ tr<abcdefghijklmnopqrsštuvwxyz>
   <\x01-\x1B>;
   return $in;
}
my @splitted = split(/&/, @not_sorted);
my @sorted =
sort {normalize($a) cmp normalize($b) or $a cmp $b}
 $splitted[0];
print @sorted;
close (_file_);

if I run it I get this output:
Code:
16$

whichi is the count of the rows.
Please point me to the right direction to find the solution.
# 11  
Old 04-10-2009
if its a database query, it will be easier to sort them as you do the query.
# 12  
Old 04-11-2009
Well , this is wrong:

Code:
my @splitted = split(/&/, @not_sorted);

Maybe this is what you are trying to do:

Code:
use strict;
use warnings;
open (_file_, "< path-to-file")  or  die "Failed to read file : $! ";
my @not_sorted = <_file_>; 
sub normalize {
   my $in = $_[0];
   $in = lc($in);
   $in =~ tr<aeiouu>
   <aeiouu>;
   $in =~ tr<abcdefghijklmnopqrsštuvwxyz>
   <\x01-\x1B>;
   return $in;
}
my @sorted = sort {normalize($a) cmp normalize($b) || $a cmp $b} @not_sorted;
print @sorted;
close (_file_);

Note you should use "||" instead of "or" in the sort routine.
# 13  
Old 04-11-2009
Thank you again. For some reason, I didn't even think of custom sorting in my database (postgres), I'll check it out. But since I have to do some changes on the file when it comes out, I thought to do it together, and it is a good way to start learning PerlSmilie. In fact, in the database the š is written as s'

But, KevinADC, the sorting worked just fine even with "or", but I need to let perl stop sorting at the first "&" otherwise I don't get good results, because the tipical line is:
Code:
alibumbi&\begin{CJK}{UTF8}{}\begin{SChinese}呈递,授与\end{SChinese}\end{CJK}&bithe alibuha&\begin{CJK}{UTF8}{}\begin{SChinese}呈文(下对上)\end{SChinese}\end{CJK}&56, 64\\

Or should I resort to awk? But I could not find help on custom sorting in awk.

Smilie
# 14  
Old 04-11-2009
Solved

I've finally got the thing to work. Thank you all.
Here's the code.
Code:
use strict;
use warnings;
open (_file_, "< path-to-file")  or  die "Failed to read file : $! ";
my @not_sorted = <_file_>;
sub normalize {
   my $in = $_[0];
   $in = lc($in);
   $in =~ tr<aeiouū>
   <aeiouu>;
   $in =~ tr<abcdefghijklmnopqrsštuvwxyz>
   <\x01-\x1B>;
   return $in;
}
my @sorted = map {$_->[0]}
        sort{ normalize($a->[1]) cmp normalize($b->[1]) or $a->[1] cmp $b->[1]}
        map {chomp;[$_,split(/&/)]} @not_sorted;
print "$_\n" for @sorted;
close (_file_);

I did try this before but would not work because I was missing the round brackets here:
Code:
normalize($a->[1])

This is the test file:
Code:
bbc&aaa&aaa
mmn&aaa&ccc
lmn&bbb&aaa
aaa&ccc&ddd
ššš&&
sss&&aaa
zzz&&
aaa&bbb&ccc
aaa&aaa&bbb
uuu&&
šas&&
saš&&
cab&&
uuū&&
ūuu&&
ūūū&&

this is the output of the script:
Code:
aaa&ccc&ddd
aaa&bbb&ccc
aaa&aaa&bbb
bbc&aaa&aaa
cab&&
lmn&bbb&aaa
mmn&aaa&ccc
saš&&
sss&&aaa
šas&&
ššš&&
uuu&&
uuū&&
ūuu&&
ūūū&&
zzz&&

I'm a happy manSmilie
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Inserting a field without disturbing field separator on other fields

Hi All, I have the input as below: cat input 032016002 2.891 97.109 16.605 27.172 24.017 32.207 0.233 0.021 39.810 0.077 0.026 19.644 13.882 0.131 11.646 0.102 11.449 76.265 23.735 16.991 83.009 8.840 91.160 0.020 99.980 52.102 47.898 44.004 55.996 39.963 18.625 0.121 1.126 40.189... (15 Replies)
Discussion started by: am24
15 Replies

2. Shell Programming and Scripting

Field separator

Hello All, I have a file, but I want to separate the file at a particular record with comma"," in the line Input file APPLE6SSAMSUNGS5PRICEPERPIECEDOLLAR600EACH010020340URX581949695US to Output file APPLE6S,SAMSUNGS5,PRICEPERPIECE,DOLLAR600EACH,010020340URX581949695,US This is for... (11 Replies)
Discussion started by: m6248m
11 Replies

3. Shell Programming and Scripting

awk field separator

I need to set awk field separator to ";", but I need to avoid ";EXT". so that echo a;b;c;EXTd;e;f | awk -F";" '{print $3}' would give "c;EXTd" (2 Replies)
Discussion started by: locoroco
2 Replies

4. UNIX for Dummies Questions & Answers

change field separator only from nth field until NF

Hi ! input: 111|222|333|aaa|bbb|ccc 999|888|777|nnn|kkk 444|666|555|eee|ttt|ooo|ppp With awk, I am trying to change the FS "|" to "; " only from the 4th field until the end (the number of fields vary between records). In order to get: 111|222|333|aaa; bbb; ccc 999|888|777|nnn; kkk... (1 Reply)
Discussion started by: beca123456
1 Replies

5. Shell Programming and Scripting

Strings as Field separator

Hi, How i can use two strings as field separator.. I want to use filed separator's as &lt; and &gt; input - shdhd ads&lt;adsd adfs &gt;sdfsd sfsdfsd&lt; Please help me in this..:wall: thanks a lot... (3 Replies)
Discussion started by: pamu
3 Replies

6. Shell Programming and Scripting

Array and field separator

Hi all, I have an array in BASH and I need to change the IFS in order to split up it correctly. Here an example: array_test=(hello world+sunny) for elem in ${array_test}; do echo $elem done echo -e "\n changed IFS \n" OLD_IFS=$IFS IFS=+ for elem in ${array_test}; do echo... (3 Replies)
Discussion started by: Dedalus
3 Replies

7. Shell Programming and Scripting

Field separator X'1F'

Hi, I have a flat file with fields separated by a X'1F' i have to fetch 4th field from second line. please help me how to achieve it. I tried with below command and its not working. cut -f4 -d`echo -e '\x1f'` filename.txt I am using SunOS. Thanks in advance. (2 Replies)
Discussion started by: rohan10k
2 Replies

8. Shell Programming and Scripting

awk, comma as field separator and text inside double quotes as a field.

Hi, all I need to get fields in a line that are separated by commas, some of the fields are enclosed with double quotes, and they are supposed to be treated as a single field even if there are commas inside the quotes. sample input: for this line, 5 fields are supposed to be extracted, they... (8 Replies)
Discussion started by: kevintse
8 Replies

9. Shell Programming and Scripting

Field separator Ques.

Hello... Im trying to use "- " as field separator... I used awk -F"- " '{print $3}' input_file ... but it's not working, it assumes that the field separator is "-" and not "- " ... Any ideas ?? :( Thanks (6 Replies)
Discussion started by: yahyaaa
6 Replies

10. Shell Programming and Scripting

field separator as regexp

I have some version of AWK that does not support regular expression field separators ( neither do I have nawk or gawk). How do I go about reading a line with the field separator as either the string "=#" or "+=". My data looks like this: abhishek=#nnnnn+#1234+#87 One option is to use... (2 Replies)
Discussion started by: Abhishek Ghose
2 Replies
Login or Register to Ask a Question