how to calculate all pairwise distances in two dimensions and transform them into a matrix
Hello to all,
I am very new in the shell scripting and I need help. I have data for several individuals in several rows followed by a tag and by 5 values per row, with the name of the individual in the first column, e.g.:
I would like to calculate the sum of the absolute values of the pairwise differences between individuals. For instance:
Additionally, if the tags of two individuals are different, I would like to sum 100 to the final number. So finally the distance between:
IND1 and IND2 would be = 5 + 100 = 105
IND1 and IND3 would be = 6 + 0 = 6
IND2 and IND3 would be = 9 + 100 = 109
After this, I would like to transform this list of distances into a matrix:
Could some one help me with this? Thanks a lot in advance! Best!
File 1 contains the list of words that needed to be randomly paired:
Tiger
Cat
Fish
Frog
Dog
Mouse
Elephant
Monkey
File 2 contains the pairs that should not be used (in any solution) during random pairing.
Elephant-Dog
Cat-Fish
Monkey-Frog
Dog-Elephant, Fish-Cat, Frog-Monkey... (1 Reply)
The following code transform the matrix to columns. Is it possible to do it other way around ( get the input from the output) ?
input
y1 y2 y3 y4 y5
x1 0.3 0.5 2.3 3.1 5.1
x2 1.2 4.1 3.5 1.7 1.2
x3 3.1 2.1 1.0 4.1 2.1
x4 5.0 4.0 6.0 7.0 1.1
output
x1 y1 0.3
x2 y1 1.2
x3... (1 Reply)
This matrix represents correlation values.
Is it possible to calculate the percentage of columns (a1, a2, a3) that have a value >= |0.5| and report the percentage that has positive correlation >0.5 and negative correlation <-0.5 separately. thanx in advance!
input
name a1 a2 a3... (5 Replies)
I have to remove sequences from a file based on the distance value. I am attaching the file containing the distances (Distance.xls)
The second file looks something like this:
Sequences.txt
>Sample1 Freq 59
ggatatgatgatgaactggt
>Sample1 Freq 54
ggatatgatgttgaactggt
>Sample1 Freq 44... (2 Replies)
Hi Friends,
I have an input matrix file like this
Col1 Col2 Col3 Col4
R1 1 2 3 4
R2 4 5 6 7
R3 5 6 7 8
I would like to consider only the numeric values without touching the column header and the row header.
I looked up on the forum's search, and I found this. But, I donno how to... (3 Replies)
I have a large numbers of files containing data that look like this:
(ID31:0.01682,(ID-123:0.00000,(ID_24:0.00000,ID&890:0.00000):0.00000):0.00000,ID12876:0.00000);
(ID_24:-0.00052,(ID31:0.01697,(ID-123:-0.00059,ID&890:0.03528):0.00037):0.00027,ID12876:0.03484);
I need to find ":" anywhere... (6 Replies)
Hi,
We are using dimensions 10 (source code control system) for our programs. Some programs contain special characters like ‘$' , ‘#' , ‘ , ‘ etc..
During the check-out process of an item , a unix shell script will be called to process the item. If the item contains a ‘$' character, it will... (0 Replies)