Transforming 3 columns to matrix format


 
Thread Tools Search this Thread
Top Forums Programming Transforming 3 columns to matrix format
# 1  
Old 11-30-2012
Transforming 3 columns to matrix format

Dear All

I have a huge data for 3 columns similar to this

Code:
D2cls0		D2cls0		1
D2cls0		D2cls1		0.308
D2cls0		D2cls2		0.554
D2cls0		D2cls3		0.287
D2cls0		D2cls4		0.633
D2cls0		D2cls5		0.341
D2cls0		D2cls6		0.665
D2cls0		D2cls7		0.698
D2cls0		D2cls8		0.625
D2cls0		D2cls9		0.429

I want to transform into matrix like this.

Code:
	D2cls0	D2cls1	D2cls2	D2cls3	D2cls4	D2cls5	D2cls6	D2cls7	D2cls8	D2cls9
D2cls0	1	0.308	0.554	0.287	0.633	0.341	0.665	0.698	0.625	0.429

Kindly advice and Please help.

Many Thanks
Balaji
# 2  
Old 11-30-2012
What have your tried?

What commands have your tried?
have you looked at the paste command with a -s
# 3  
Old 11-30-2012
Dear Joeyg

I am familiar in awk or shell scripting.

could you please kindly help me

Many Thanks
Balaji
Quote:
Originally Posted by joeyg
What commands have your tried?
have you looked at the paste command with a -s
# 4  
Old 11-30-2012
what about something like:

Code:
$ cat sample30.txt | cut -f3 | tr "\n" "\t" > sample30x.txt
$ cat sample30.txt | cut -f5 | tr "\n" "\t" >> sample30x.txt

# 5  
Old 11-30-2012
try also:
Code:
awk '
{ a[NR]=$1; b[NR]=$2; c[NR]=$3; }
END {
  for (i=0;i<=NR;i++) printf ("%-10s",b[i]);
  print "";
  for (i=1;i<=1;i++ )  printf ("%-10s",a[i]);
  for (i=1;i<=NR;i++)  printf ("%-10s",c[i]);
  print "";
}
' input

# 6  
Old 11-30-2012
a bit more generic solution for multiple instances of field one in the input and where the sequence of field 2 doesn't matter.
Sample file - myFile:
Code:
D2cls0          D2cls0          1
D2cls0          D2cls1          0.308
D2cls0          D2cls2          0.554
D2cls0          D2cls3          0.287
D2cls0          D2cls4          0.633
D2cls0          D2cls5          0.341
D2cls0          D2cls6          0.665
D2cls0          D2cls7          0.698
D2cls0          D2cls8          0.625
D2cls0          D2cls9          0.429
D2cls1          D2cls0          1
D2cls1          D2cls1          0.308
D2cls1          D2cls2          0.554
D2cls1          D2cls3          0.287
D2cls1          D2cls4          0.633
D2cls1          D2cls5          0.341
D2cls1          D2cls6          0.665
D2cls1          D2cls7          0.698
D2cls1          D2cls8          0.625
D2cls1          D2cls9          0.429

awk -f bala.awk myFile where bala.awk is:
Code:
BEGIN {
   OFS="\t"
}
{
  row[$1,$2]=$3
  if (!($2 in f2)) { header=(header)?header OFS $2:$2;f2[$2]}
  if (col1[c]!=$1)
     col1[++c]=$1
}
END {
  printf("%*s%s\n", length(col1[1])+2, " ",header)
  ncol=split(header,colA,OFS)
  for(i=1;i<=c;i++) {
    printf("%s", col1[i])
    for(j=1;j<=ncol;j++)
      printf("%s%s%c", OFS, row[col1[i],colA[j]], (j==ncol)?ORS:"")
  }
}

produces:
Code:
        D2cls0  D2cls1  D2cls2  D2cls3  D2cls4  D2cls5  D2cls6  D2cls7  D2cls8  D2cls9
D2cls0  1       0.308   0.554   0.287   0.633   0.341   0.665   0.698   0.625   0.429
D2cls1  1       0.308   0.554   0.287   0.633   0.341   0.665   0.698   0.625   0.429

# 7  
Old 12-01-2012
This proposal does not depend on column 1 being sorted (as others above do), it will collect lines correctly into the matrix elements even if the key comes up again later with more values. It would not, of course, have the across columns sorted, then. Empty/missing elements are leaving a gap in the output:
Code:
$ awk  '{for (i=1; i<=LnCnt; i++) if ($1 == Ln[i]) break; if (i > LnCnt) Ln[++LnCnt]=$1}
        {for (j=1; j<=HdCnt; j++) if ($2 == Hd[j]) break; if (j > HdCnt) Hd[++HdCnt]=$2}
        {Mx[$1,$2] = $3}
        END {printf "%10s", ""; for (j=1; j<=HdCnt; j++)  printf "%10s", Hd[j]; printf "\n";
             for (i=1; i<=LnCnt; i++)  {printf "%10s", Ln[i];
                                        for (j=1; j<=HdCnt; j++) printf "%10s", Mx[Ln[i], Hd[j]];
                                        printf "\n"
                                       }
            }
       ' file
              D2cls0    D2cls1    D2cls2    D2cls8    D2cls9    D2cls3    D2cls4    D2cls5    D2cls6    D2cls7
    D2cls0         1     0.308     0.554     0.625     0.429     0.287     0.633               0.665     0.698
    D2cls2       0.1     1.308     1.554               1.429     1.287     1.633     1.341     1.665     1.698

This User Gave Thanks to RudiC For This Post:
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Transform columns to matrix

The following code transform the matrix to columns. Is it possible to do it other way around ( get the input from the output) ? input y1 y2 y3 y4 y5 x1 0.3 0.5 2.3 3.1 5.1 x2 1.2 4.1 3.5 1.7 1.2 x3 3.1 2.1 1.0 4.1 2.1 x4 5.0 4.0 6.0 7.0 1.1 output x1 y1 0.3 x2 y1 1.2 x3... (1 Reply)
Discussion started by: quincyjones
1 Replies

2. Shell Programming and Scripting

How to convert 2 columns into matrix -awk?

How can i convert two columns in to o and 1 matrix. thnks Input a c1 b c2 c c1 d c3 e c4 output c1 c2 c3 c4 a 1 0 0 0 b 0 1 0 0 c 1 0 0 0 d 0 0 ... (5 Replies)
Discussion started by: quincyjones
5 Replies

3. Shell Programming and Scripting

Square matrix to columns

Hello all, I am quite new in this but I need some help to keep going with my analysis. I am struggling with a short script to read a square matrix and convert it in two collumns. A B C D A 0.00 0.06 0.51 0.03 B 0.06 0.00 0.72 0.48 C 0.51 0.72 0.00 ... (7 Replies)
Discussion started by: EvaAM
7 Replies

4. Programming

Transforming data to other format

Dear All I would like to transform data from one format to another format. my Input: 0 0 1 0 1 0.308 0 2 0.554 0 3 0.287 output: Z (0,0)= 1 Z (0,1)=0.308 Z (0,2)=0.554 Z (0,3)=0.287 (2 Replies)
Discussion started by: bala06
2 Replies

5. Programming

Converting columns to matrix

Dear All I would like to convert columns to matrix For example my data looks like this D2 0 D2 0 1.0 D2 0 D2 1 0.308 D2 0 D2 2 0.554 D2 0 D2 3 0.287 D2 0 D2 4 0.633 D2 0 D2 5 0.341 D2 0 D2 6 0.665 D2 0 D2 7 0.698 D2 0 D2 8 0.625 D2 0 D2 9 0.429 D2 0 D2 10 0.698 D2 0 D2 11... (7 Replies)
Discussion started by: bala06
7 Replies

6. Shell Programming and Scripting

conversion: 3 columns into matrix

Hi guys, here https://www.unix.com/shell-programming-scripting/193043-3-column-csv-correlation-matrix-awk-perl.html I found awk script converting awk '{ OFS = ";" if (t) { if (l != $1) t = t OFS $1 } else t = OFS $1 x = x ? x OFS $NF : $NF l = $1 }... (2 Replies)
Discussion started by: grincz
2 Replies

7. Shell Programming and Scripting

Adding the individual columns of a matrix.

I have a huge matrix file containing some 1.5 million rows and 6000 columns. The matrix looks something like this: 1 2 3 4 5 6 7 8 9 3 4 5 I want to add all the numbers in the columns of this matrix and display the result to my stdout. This means that the numbers in the first column are: ... (2 Replies)
Discussion started by: shoaibjameel123
2 Replies

8. Shell Programming and Scripting

Select columns from a matrix given within a range in BASH

I have a huge matrix file which looks like this (example matrix): 1 2 3 5 4 5 6 7 7 6 8 9 1 2 4 2 7 6 5 1 3 2 1 9 As one can see, this matrix has 4 columns and 6 rows. But my original matrix has some 3 million rows and 6000 columns. For example, on this matrix I can define my task as... (2 Replies)
Discussion started by: shoaibjameel123
2 Replies

9. UNIX for Dummies Questions & Answers

convert matrix to row and columns

Dear Unix Gurus, I have a sample data set that looks like this y1 y2 y3 y4 y5 x1 0.3 0.5 2.3 3.1 5.1 x2 1.2 4.1 3.5 1.7 1.2 x3 3.1 2.1 1.0 4.1 2.1 x4 5.0 4.0 6.0 7.0 1.1 I want to open it up so that I get x1 y1 0.3 x2 y1 1.2 x3 y1 3.1 x4 y1 5.0 x1 y2 0.5 x2 y2... (3 Replies)
Discussion started by: tintin72
3 Replies

10. Shell Programming and Scripting

How to format or create a matrix report from file

Dear Unix champs, I have a input file as attached, i would like to create an report from the file as below FileType | EQUENS0001 | EQUENS0002 | EQUENS1100 | EQUENS0003 --------+-------------------------------------------------------- Msg No |... (3 Replies)
Discussion started by: manas_ranjan
3 Replies
Login or Register to Ask a Question