Multidimensional array:awk error


 
Thread Tools Search this Thread
Top Forums Shell Programming and Scripting Multidimensional array:awk error
# 1  
Old 12-06-2017
Multidimensional array:awk error

Code:
 awk -F'\t' -v OFS='\t' '

 { if($2 in arr) {

         #print "Sahi", NR,arr[$2]
         for(k=2;k<=NF;k++){
#            sum[arr[$2]][k]+=$2
         }

 }
 else {
     arr[$2]=NR
     #print "awk",NR
     for (k=3;k<=NF ; k++){
         sum[NR][k]=$k
     }
 }


 }

 #-------------------
# END{
#        for (k in arr){
#            printf k"\t"
#            for(z=3;z<=NF;z++){
#                printf sum[arr[k]][z]"\t"
#            }
#            print "\n"
#        }
 #    }
#--------------------------

 ' < header.txt

Error:
awk: line 15: syntax error at or near [

I want to sum columns if there's duplicate of row in column 2.


Input format:

Code:
ASPM    ASPM    6.667   6.482
LOC400861       LOC400861       6.669   6.647
ASPM    ASPM    0.001   0.002

Outout:
Code:
ASPM    ASPM    6.668   6.484
LOC400861       LOC400861       6.669   6.647

Summed columns for ASPM

For the time being I've excluded header from input file. In future I'd like to process file as is and tweak awk code for it. Smilie

But for now I'd like to fix error for multi-D array.

---------- Post updated at 05:39 PM ---------- Previous update was at 05:17 PM ----------

I fixed for error but I get the line printed twice:

Code:
awk -F'\t' -v OFS='\t' '

 { if($2 in arr) {
         for(k=2;k<=NF;k++){
             sum[arr[$2],k]= $k + sum[arr[$2],k]
         }

 }
 else {
     arr[$2]=NR
     for (k=3;k<=NF;k++){
         sum[NR,k] = $k;
     }
 }
 }

 #-------------------
 END{
         for (k in arr){
             printf k"\t"
             for(z=3;z<=NF;z++){
                 printf sum[arr[k],z]"\t"
             }
             printf"\n"
         }
    }
#--------------------------

 ' < header.txt

output:
Moderator's Comments:
Mod Comment Please use CODE tags (not QUOTE tags) when displaying sample input and output as well as when displaying code segments.

Last edited by Don Cragun; 12-07-2017 at 01:10 AM.. Reason: Change QUOTE tags to CODE tags.
# 2  
Old 12-06-2017
I'm not sure your field delimiter is space or tab from your post, which can be fixed easily. I assumed it is space for my try:
Code:
awk -v OFS="\t"  '{A[$2]+=$3; B[$2]+=$4}END{for( i in A) print i, i, A[i], B[i]}'  < head.txt

Output:
Code:
LOC400861    LOC400861    6.669    6.647
ASPM    ASPM    6.668    6.484

# 3  
Old 12-06-2017
Thanks for your reply.
For illustration I had put 4 columns. In file I've 366 columns hence I used multi-dimensional array.
# 4  
Old 12-07-2017
If you don't show us the real format of your input file(s) and show us the actual output you're trying to produce from that input, it is hard for us to guess at what might work for whatever you are trying to do.

Without knowing what operating system you're using, we can't know what extended features available in some versions of awk might be available for use in your script.

What do you expect to get when you calculate the sum ASPM + ASPM?

Is the data in all of your fields starting with field #3 through field #366 in floating point format with three digits after the decimal point? If not, what is the general format and what format should be used for the output?
# 5  
Old 12-07-2017
Notwithstanding Don Cragun's worthwhile questions, I don't think extended features necessary for this problem. Try (output formatting excluded here):
Code:
awk -F'\t' -v OFS='\t' '
        {arr[$2] = $1
         for (k=3; k<=NF; k++)  sum[arr[$2],k] += $k
        }
END     {for (k in arr) {printf "%s%s%s%s", arr[k], OFS, k, OFS
                         for (z=3; z<=NF; z++)  printf "%s%s", sum[arr[k],z], (z==NF)?ORS:OFS
                        }
        }
' file
ASPM    ASPM    6.668    6.484
LOC400861    LOC400861    6.669    6.647

# 6  
Old 12-07-2017
Quote:
Originally Posted by RudiC
Notwithstanding Don Cragun's worthwhile questions, I don't think extended features necessary for this problem. Try (output formatting excluded here):
Code:
awk -F'\t' -v OFS='\t' '
        {arr[$2] = $1
         for (k=3; k<=NF; k++)  sum[arr[$2],k] += $k
        }
END     {for (k in arr) {printf "%s%s%s%s", arr[k], OFS, k, OFS
                         for (z=3; z<=NF; z++)  printf "%s%s", sum[arr[k],z], (z==NF)?ORS:OFS
                        }
        }
' file
ASPM    ASPM    6.668    6.484
LOC400861    LOC400861    6.669    6.647

Thanks for the reply.
Do you mind explaining:

Code:
(z==NF)?ORS:OFS

# 7  
Old 12-07-2017
man awk:
Quote:
5. Expressions and operators
The expression syntax is similar to C. . . .
conditional ? :
The logical expression in front of the ? is evaluated; if TRUE, the value of the expression between that and the : is the result to be used / assigned, if FALSE, the trailing one. Available in many other languages and shells as well.
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Sort multidimensional Array

Hello I have a problem. I create a Multidimensional Array Like this: ENTRY="$kunnum-$host" ENTRY="$host" ENTRY="# $3" for key in "${!ENTRY}"; do ENTRIES=${ENTRY} # INDEX=IP(5) donedeclare -p declare -A ENTRIES=(="unas15533" ="unas" ="# RDP-Terminal 2"... (12 Replies)
Discussion started by: Marti95
12 Replies

2. Shell Programming and Scripting

Multidimensional array

I am learning about bash system variables, such as $ , @ and #. I have this piece of script implementing an array and it is doing its job just fine. This is not the only array I will be using. Just for ease of maintenance and more coding I would like to have the arrays in two dimensional... (4 Replies)
Discussion started by: annacreek
4 Replies

3. Shell Programming and Scripting

How to deal with multidimensional array in awk?

Hi all! I would like to know how to print $0 when using multidimensional array like below time being I am using for loop to print columns like this awk 'FNR==1{i++} {for(k=1;k<=NF;k++)A=$k} END{for(j=1;j<=25;j++) print A,A,A,A,A,A,A,A,A,A,A,A,A,A}' file1 file2 so here my problem is I... (5 Replies)
Discussion started by: Akshay Hegde
5 Replies

4. Shell Programming and Scripting

gawk - How to loop through multidimensional array?

I have an awk script that I am writing and I needed to make use of a multidimensional array to hold some data... Which is all fine but I need to loop through that array now and I have no idea how to do that. for a regular array, the following works: ARRAY for(var in ARRAY) { ... } ... (5 Replies)
Discussion started by: trey85stang
5 Replies

5. Shell Programming and Scripting

multidimensional array in awk

Hi, I was trying to process a file with the help of awk. I want to first display all the rows that contains 01 and at the end of processing I have to print some portion of all the lines. like below. Output expected: (2 Replies)
Discussion started by: ahmedwaseem2000
2 Replies

6. Programming

multidimensional array using c++ vector

Hi! I need to make dynamic multidimensional arrays using the vector class. I found in this page How to dynamically create a two dimensional array? - Microsoft: Visual C++ FAQ - Tek-Tips the way to do it in 2D, and now i'm trying to expand it to 3D but i don't understand how is the operator working,... (0 Replies)
Discussion started by: carl.alv
0 Replies

7. Programming

C programming working with multidimensional array

Hi, I have the following variable declaration which looks like a 3d array or N matrixs KxK of floats float (*table); I have to pass to a function only the first table. How can I do it?? Thanks (6 Replies)
Discussion started by: littleboyblu
6 Replies

8. Shell Programming and Scripting

AWK multidimensional array

In a single dim. awk array, we can use : <index> in <array name> to determine whether a particualar index exists in the array or not. Is there a way to achieve this in a awk multi dim. array ? (4 Replies)
Discussion started by: sinpeak
4 Replies

9. Shell Programming and Scripting

Awk multidimensional Array

Hello Experts,, Can anybody give me a brief idea what is following bold letter statement is for!! what is the term called so that I can google for it.. It seems to be an array inside another array.. awk' /TXADDR/ { txaddr=$NF } ##understood /TXDATA/ { txdata]=$NF... (1 Reply)
Discussion started by: user_prady
1 Replies

10. Shell Programming and Scripting

multidimensional array in perl

i'm trying to open a file with three or more columns and an undetermined, but finite number of rows. I want to define an array for each row with each element of the row as a sub array. The columns are separated by tabs or spaces. Here's the file: 12x3.12z34b.342sd3.sds 454.23.23.232 ... (9 Replies)
Discussion started by: prkfriryce
9 Replies
Login or Register to Ask a Question