correlation coefficient - Awk


 
Thread Tools Search this Thread
Top Forums Shell Programming and Scripting correlation coefficient - Awk
# 1  
Old 11-23-2010
correlation coefficient - Awk

Hi guys

I have an input file with multiple columns and and rows.
Is it possible to calculate correlation of certain value of certain No (For example x of S1 = 112) with all other values (for example start with x 112 corr a 3 of S1 = x-a 0.2 )

INPUT
*******

Code:
No  S1  S2  S3  S4  Sn
a    3     4     45  34   23
x   112   0    12   23   0
b    0     1     23   12   1
n    0     1     0      1    1

OUTPUT1 x-S1 112 with all
***********
Code:
No  S1  S2  S3  S4  Sn
x-a  0.2   0.3 ...............
x-x  1     1  ................
x-b  0..........................
x-n  0.9 .......................

OUTPUT2 - x-S2 0 with all and so on .............

---------- Post updated at 03:09 AM ---------- Previous update was at 02:17 AM ----------

More clear explanation

in one output
x - S1 i.e. 112 with all combinations ex:

Code:
112 vs 3 [ x-S1 vs a -S1]
112 vs 4 [ x-S1 vs a- S2 ]
112 vs 45
112 vs 34
112 vs 23
112 vs 112
…………….
……………..
112 vs 1

in second output
Code:
x - S2 i.e. 0 with all ……………………. ex:
0 vs 3
0 vs 4
………
………
0 vs 1

in next output
x-S3 i.e. 12 with all …………… and so on

Probably in the given input I should get 5 outputs because of 5 samples S1,2,3,4,n.
If they are more like 69 or some thing then I should get 69 outputs or everything in one output if possible.

Please let me know if it is still confusing.

thanx

Last edited by quincyjones; 11-23-2010 at 03:31 AM..
# 2  
Old 11-23-2010
So how do you get the number, from: 112 vs 3 , get 0.2?
# 3  
Old 11-23-2010
that was just example Smilie
Login or Register to Ask a Question

Previous Thread | Next Thread

8 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Correlation Between 3 Different Loops using Bash

I have 3 loops that I use to determine the permission level of AWS user accounts. This array lists the AWS policy ARN (Amazon Resource Name): for ((policy_index=0;policy_index<${#aws_managed_policies};++policy_index)); do aws_policy_arn="${aws_managed_policies}" ... (1 Reply)
Discussion started by: bluethundr
1 Replies

2. Shell Programming and Scripting

Help with identify gradient and the coefficient of determination of a straight line

Hi, Do anybody experience using awk or perl command to identify gradient of a straight line and The coefficient of determination/R-square value of a chart (R2) ? Below is my input file : t1 2 3 5 8 t2 0 2 0 2 t3 1 1 1 1 t4 50 70 80 90 . . . Desired Output : t1 2 3 5 8 0.6986... (10 Replies)
Discussion started by: perl_beginner
10 Replies

3. Shell Programming and Scripting

3 column .csv --> correlation matrix; awk, perl?

Greetings, salutations. I have a 3 column csv file with ~13 million rows and I would like to generate a correlation matrix. Interestingly, you all previously provided a solution to the inverse of this problem. Thread title: "awk? adjacency matrix to adjacency list / correlation matrix to list"... (6 Replies)
Discussion started by: R3353
6 Replies

4. Shell Programming and Scripting

Pearson correlation between two files

Hi, I want a quick way to determine the pearson correlation between two files. The two files have the same format with only the 3rd column varying. E.g. of file 1 chr1 0 62 chr1 1 260 chr1 2 474 chr1 3 562 chr1 4 633 chr1 5 870 chr1 6 931 chr1 7 978 chr1 8 1058 chr1 9 1151 E.g.... (1 Reply)
Discussion started by: kylle345
1 Replies

5. Shell Programming and Scripting

awk? adjacency matrix to adjacency list / correlation matrix to list

Hi everyone I am very new at awk but think that that might be the best strategy for this. I have a matrix very similar to a correlation matrix and in practical terms I need to convert it into a list containing the values from the matrix (one value per line) with the first field of the line (row... (5 Replies)
Discussion started by: stonemonkey
5 Replies

6. Shell Programming and Scripting

AWK - calculating simple correlation of rows

Is there any way to calculate a simple correlation of few selected rows with all the rows in input ? In the below example I selected Row01,02,03 and correlated with all the rows. I was trying to run in R. But the this big data matrix is too much to handle for R and eventually my system is... (3 Replies)
Discussion started by: quincyjones
3 Replies

7. Shell Programming and Scripting

Calculate Correlation between two fields !

Hello, I request your help with a shell script (awk) that ask for two inputs in order to calculate the correlation of the last rows between two fields ( 3 and 4). Data: EC-GLD,1/25/2011,41.270000,129.070000 EC-GLD,1/26/2011,41.550000,129.280000 EC-GLD,1/27/2011,42.260000,127.800000... (1 Reply)
Discussion started by: csierra
1 Replies

8. UNIX for Dummies Questions & Answers

chmod and cgi correlation

How much do chmod settings affect cgi scripts?? I have a "webmaster" at my work that says I cannot change the permissions on the cgi scripts, and that they work with only certain permissions. They are set for 644, I want to change them to 775 and put her in her own group, like she should be, not... (6 Replies)
Discussion started by: bigmacc
6 Replies
Login or Register to Ask a Question