Sponsored Content
Top Forums Shell Programming and Scripting How many studies have unequal values for each pair? Post 302978352 by RavinderSingh13 on Friday 29th of July 2016 10:05:55 AM
Old 07-29-2016
Quote:
Originally Posted by senhia83
You are right, that will pair will be 2. Thanks a lot..I need to understand your code now Smilie
Hello senhia83,

First of all thank you for asking good question and showing us what you have done to solve that too, keep it up.
Coming to your question, could you please try following.

One Liner form of solution:
Code:
awk 'BEGIN{print "Study" OFS "Point" OFS "Value"} NR>1{C[$2];E[$2 OFS 1+C[$2]++]=$3;sub(/[[:alpha:]]/,X,$2);Q=Q>$2+0?Q:$2+0} END{;for(i in C){D[++j]=i};for(i=1;i<=length(D);i++){for(k=i+1;k<=length(D);k++){for(q=1;q<=Q;q++){if(E[D[i] OFS q] != E[D[k] OFS q] && E[D[i] OFS q] && E[D[k] OFS q]){R[D[i] OFS D[k]]++;}}}};for(u in R){print u OFS R[u]}}' OFS="\t"   Input_file

Non-one liner form of solution:
Code:
awk 'BEGIN{
           print "Study" OFS "Point" OFS "Value"
          } 
     NR>1 {
           C[$2];
           E[$2 OFS 1+C[$2]++]=$3;
           sub(/[[:alpha:]]/,X,$2);
           Q=Q>$2+0?Q:$2+0
          } 
     END  {
           for(i in C){
                       D[++j]=i
                      };
           for(i=1;i<=length(D);i++){
                                     for(k=i+1;k<=length(D);k++){
                                                                 for(q=1;q<=Q;q++){
                                                                                   if(E[D[i] OFS q] != E[D[k] OFS q] && E[D[i] OFS q] && E[D[k] OFS q]){
                                                                                                                                                        R[D[i] OFS D[k]]++;
                                                                                                                                                                                  }
                                                                                  }
                                                                }
                                    };
           for(u in R){
                       print u OFS R[u]
                      }
           }
    ' OFS="\t"   Input_file

NOTE: Considering that field 2 always will have the number(digit) in it, as per your Input_file shown. Also tested this with GNU awk.

Thanks,
R. Singh

Last edited by RavinderSingh13; 07-29-2016 at 11:19 AM.. Reason: Changed the non-one liner form solution's spaces and fit them to good Looking one :)
 

7 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Splitting a file into unequal parts

How do I split a file into many parts but with different amounts of lines per part? I looked at the split command but that only splits evenly. I'd like a range specified to determine how many lines each output file should have. For example, if the input file has 1000 lines and the range is... (1 Reply)
Discussion started by: revax
1 Replies

2. UNIX for Dummies Questions & Answers

Merge two files with common IDs but unequal number of rows

Hi, I have two files that I would like to merge and think that there should be a solution using awk. The files look something like this: file 1 IDX1 IDY1 IDX2 IDY2 IDX3 IDY3 file 2 IDY1 dataA data1 IDY2 dataB data2 IDY3 dataC data3 Desired output IDX1 IDY1 dataA data1 IDX2 ... (5 Replies)
Discussion started by: katie8856
5 Replies

3. Shell Programming and Scripting

Newline between unequal record fields

Assume the following 5 records (field separator is a space): 0903 0903 0910 0910 0910 0910 0910 0910 0917 0917 0917 0917 0924 1001 1001 1001 1001 1008 1008 1008 1008 1015 1015 1015 1015 1022 1029 1029 1029 1029 1105 1105 1105 1105 1112 1112 1112 1112 1119 1126 1126 1126 1126 1203 1203 1203 1203... (8 Replies)
Discussion started by: tree
8 Replies

4. Shell Programming and Scripting

Pair wise comparisons

Hi, I have 25 groups and I need to perform all possible pairwise compariosns between them using the formula n(n-1)/2. SO in my case it will be 25(25-1)/2 which is equal to 300 comparisons. my 25 groups are FG1 FG2 FG3 FG4 FG5 NT5E CD44 CD44 CD44 AXL ADAM19 CCDC80 L1CAM L1CAM CD44... (1 Reply)
Discussion started by: Diya123
1 Replies

5. Shell Programming and Scripting

Compare two unsorted unequal files extracted from xml

I have two files for comparison which are extracts from set of xml files. file1 has: Comparing File: BRCSH1to320140224CC3.xml :: TZZZ:BR :: TAZZ:OUT UIZZ:0 :: ERAZ:1.000000 UIZZ:0 :: CTZZ:B UIZZ:0 :: CCAZ:MYR Comparing File: BRMY20140224CC18REG013SPFNSY13.xml :: TZZZ:BR :: TAZZ:INB... (1 Reply)
Discussion started by: vamsi gunda
1 Replies

6. Shell Programming and Scripting

Finding difference between two columns of unequal length

Hi, I have two files which look like this cat waitstate.txt 18.2 82.1 cat gostate.txt 5.6 5.8 6.1 6.3 6.6 6.9 7.2 7.5 (4 Replies)
Discussion started by: jamie_123
4 Replies

7. Shell Programming and Scripting

awk name pair values

Team, I have a file like below FILE: NAM1,KEY1,VAL1 NAM1,KEY2,VAL2 NAM1,KEY3,VAL3 NAM2,KEY1,VALA NAM2,KEY2,VALB NAM2,KEY3,VALCOutput: I have to build commands like below <Script> VAL1 VAL2 VAL3 NAME1 <Script> VALA VALB VALC NAME2Can you please help with awk command i can use... (4 Replies)
Discussion started by: mallak
4 Replies
All times are GMT -4. The time now is 03:09 PM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy