Sort array elements from same field


 
Thread Tools Search this Thread
Top Forums UNIX for Dummies Questions & Answers Sort array elements from same field
# 1  
Old 06-08-2013
Sort array elements from same field

Hi,

input:
Code:
line1|error_type_a@15
line1|error_type_c@10
line1|error_type_b@5
line2|error_type_f@3
line2|error_type_a@1

I would need to place all the second fields with common first field on the same line, BUT with sorted error position number:
Code:
line1|error_type_b@5; error_type_c@10; error_type_a@15
line2|error_type_a@1; error_type_f@3

I managed to put in the same line...
Code:
BEGIN{FS="|"}
{
    a[$1] = a[$1] (a[$1]?"; ":"")$2
}
    
END{for(i in a){ print i FS a[i]}}'

...but the elements in the new second field are not sorted:
Code:
line1|error_type_a@15; error_type_c@10; error_type_b@5
line2|error_type_f@3; error_type_a@1

And when I incorporate the asort function like as follow, it returns a blank second field:
Code:
BEGIN{FS=OFS="|"}
{
    a[$1] = a[$1] (a[$1]?"; ":"")$2
}
    
END{for(i in a){
          split(a[i],b,"@");
          n = asort(b); 
          for (j=1; j<=n; j++){
              print i FS a[b[j]]
          }
      }
}'

Any explanation would be great !

Thanks in advance !
# 2  
Old 06-08-2013
Warning: The awk that I use does not provide the asort() function; so all of my comments are just based on reading the man pages and visually inspecting your code; I was not able to test your code to verify my hypothesis.

I believe there are two problems with your program:

First, in your END clause, when you call split(a[i],b,"@") with the array a[] containing the elements:
Code:
a[line1] = line1|error_type_b@5; error_type_c@10; error_type_a@15
a[line2] = line2|error_type_a@1; error_type_f@3

the array b[] will either contain the elements:
Code:
line1|error_type_b
5; error_type_c
10; error_type_a
15

or the elements:
Code:
line2|error_type_a
1; error_type_f
3

I'm guessing that this is not what you wanted.

And, second, after you run asort(), the elements in b[] will be changed to a list of numbers like (1, 3, 4, 2) or (1, 2, 3), respectively. Then the for loop in the END clause is going to print:
Code:
line1 FS contents of a[b[1]]
line1 FS contents of a[b[2]]
line1 FS contents of a[b[3]]
line1 FS contents of a[b[4]]
line2 FS contents of a[b[1]]
line2 FS contents of a[b[2]]
line2 FS contents of a[b[3]]

Obviously, the FS in the above lines will be replaced by | and the standards leave it is unspecified whether the "line1" lines will come before or after the "line2" lines. But, since the only defined subscripts in the a[] array are "line1" and "line2", the a[b[j]] (AKA a[1], a[2], a[3], and sometimes a[4]) will all expand to empty strings and, therefore, the contents of a[digit] in all of the output lines will be empty.

I think this explains what went wrong, but your description didn't clearly explain what output you were trying to get, so I can't suggest a way to fix it. As a (maybe not too wild) guess, I would think you might want to replace:
Code:
    a[$1] = a[$1] (a[$1]?"; ":"")$2

with something like:
Code:
    c[a[$1]] = $2

and then sort c[a[i]] or some manipulation of it in the END clause instead of trying to split elements of a[] and sort the split elements.

Hope this helps.
# 3  
Old 06-08-2013
I think you are doing this overly complicated. In fact this is one of the numerous applications of executing a "control break". You might want to follow the provided link.

I hope this helps.

bakunin
# 4  
Old 06-09-2013
Code:
sort -t'|' -k1,1 -k2.14n inputfile

Numerical sort on the 2nd key field with offset 14.
Pipe this to your working awk.
# 5  
Old 06-09-2013
Quote:
Originally Posted by MadeInGermany
Code:
sort -t'|' -k1,1 -k2.14n inputfile

Numerical sort on the 2nd key field with offset 14.
Pipe this to your working awk.
Hi MadeInGermany,
Your suggestion may be exactly what the OP needs.

When I read the 1st message in this thread, I was under the impression that the "error_type_X" strings were placeholders for arbitrary (not necessarily all the same length) strings followed by "@" and a numeric string. If my assumption is correct, there will have to be some data manipulation before the input can be sorted because the UNIX sort command doesn't allow for a different field separator to be specified for each sort key because the position of the "@" in field 2 won't be a constant. However, if the 1st field is always 5 characters (which I also assumed was not true), then the command:
Code:
sort -t'@' -k2,2n -k1.1,1.5 inputfile

might work even if the "error_type_X" strings are not a constant width.

I think we at least explained why beca123456's use of awk's asort() function is producing the output it produces.

Hopefully, the discussion in this thread so far will either help the OP get a working solution or encourage the OP to give us a better description of the actual data that might appear in the input file so we can work on a real solution instead of guessing at what needs to be done.
# 6  
Old 06-10-2013
Good idea! But order should be
Code:
sort -t'@' -k1.1,1.5 -k2,2n inputfile

Sort on the first key field until the '|' separator, and secondary numerical sort on the 2nd key field.
This User Gave Thanks to MadeInGermany For This Post:
 
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Help reading the array and sum of the array elements

Hi All, need help with reading the array and sum of the array elements. given an array of integers of size N . You need to print the sum of the elements in the array, keeping in mind that some of those integers may be quite large. Input Format The first line of the input consists of an... (1 Reply)
Discussion started by: nishantrefound
1 Replies

2. Shell Programming and Scripting

Random elements from array

Hi I wanted to print random elements from an array at bash shell I use the following code, but I always see first element getting printed #!/bin/bash c=1 expressions=(pink red white yellow purple) while ]; do echo "The value of RANDOM is $RANDOM" selectedexpression=${expressions}]};... (5 Replies)
Discussion started by: Priya Amaresh
5 Replies

3. Shell Programming and Scripting

Grouping array elements - possible?

I have a script which takes backup of some configuration files on my server. It does that by using an array which contains the complete path to the files to backup. It copys the files to a pre defined dir. Each "program" has it's own folder, ex. apache.conf is being copied to /predefined... (7 Replies)
Discussion started by: dnn
7 Replies

4. Shell Programming and Scripting

Removing elements from an array

Hi I have two arrays : @arcb= (450,625,720,645); @arca=(625,645); I need to remove the elements of @arca from elements of @arcb so that the content of @arcb will be (450,720). Can anyone sugget me how to perform this operation? The code I have used is this : my @arcb=... (3 Replies)
Discussion started by: rkrish
3 Replies

5. UNIX for Dummies Questions & Answers

printing array elements

Is there a way to print multiple array elements without iterating through the array using bash? Can you do something like... echo ${array}and get all those separate elements from the array? (2 Replies)
Discussion started by: jrymer
2 Replies

6. UNIX for Dummies Questions & Answers

Help with replacing Array elements

Hi, I have an array containing following sample information @array = qw (chr02 chr02 chr02 chr02 chr02 chr03 chr03 chr04 chr04 chr05 chr05 chr05 chr07 chr07) I need to replace all duplicate entries by an underscore to get the following output@array = qw (chr02 _ _ _ _ chr03 _ chr04 _ chr05 _ _... (4 Replies)
Discussion started by: pawannoel
4 Replies

7. Shell Programming and Scripting

perl sort array by field

Hi Everyone, Any simple code can simplify the code below, please advice. Thanks # cat 2.pl #!/usr/bin/perl use warnings; use strict; my @aaaaa = <DATA>; my @uids; foreach (@aaaaa) { my @ccccc = split (",", $_); push @uids, $ccccc;... (3 Replies)
Discussion started by: jimmy_y
3 Replies

8. Shell Programming and Scripting

Array with String Elements

How can I get my array to understand the double-quotes I'm passing into it are to separate text strings and not part of an element? here's what I'm working with... db2 -v connect to foo db2 -x "select '\"' || stats_command || '\",' from db2law1.parallel_runstats where tabname = 'BAZ'" set... (4 Replies)
Discussion started by: djschmitt
4 Replies

9. Shell Programming and Scripting

Accessing array elements

Hi, My doubt is how to access array elements.. Situation is as below: #!/bin/ksh set -x typeset -i x=0 typeset -i y=0 typeset -i BID=0 typeset -i count=0 while ] ; do x=`expr $x + 1`; hwmgr show scsi > scsi.tmp while read line; do set... (1 Reply)
Discussion started by: mansa
1 Replies

10. UNIX for Dummies Questions & Answers

Deleting Array Elements

Hi, I am writing a BASH shell script. I have an array that will contain IN ANY ORDER the following elements: DAY 8D MO NS. I would like to erase the element DAY, but since the order of the elements in the array are random, I will not know which element # DAY is (ie it's not as simple as... (3 Replies)
Discussion started by: msb65
3 Replies
Login or Register to Ask a Question