Concatenating column values with unique id into single row


 
Thread Tools Search this Thread
Top Forums Shell Programming and Scripting Concatenating column values with unique id into single row
# 1  
Old 08-17-2009
Concatenating column values with unique id into single row

Hi,

I have a table in Db2 with data say

id_1 phase1
id_1 phase2
id_1 phase3
id_2 phase1
id_2 phase2

I need to concatenate the values like
id_1 phase1,phase2,phase3
id_2 phase1,phase2

I tried recursive query but in vain as the length of string to be concatenated in quite long.
How can this be implemented using shell scripts effectively.

Thanks

---------- Post updated at 02:29 AM ---------- Previous update was at 02:11 AM ----------

I also need to insert the output values into new table
# 2  
Old 08-17-2009
Lightbulb

Code:
$more file9
id_1 phase1
id_1 phase2
id_1 phase3
id_2 phase1
id_2 phase2

$awk '
{
          if (vector[$1])
              vector[$1]=vector[$1]","$2
      else
              vector[$1]=$2
}
END {
        for (palabra in vector)
                print palabra" "vector[palabra]
}       
' file9

Result:
id_1 phase1,phase2,phase3
id_2 phase1,phase2

# 3  
Old 08-17-2009
using awk:

Code:
#  nawk '{t[$1]?t[$1]=t[$1]","$2:t[$1]=$2}END{for (i in t){print i,t[i]}}' infile
id_2 phase1,phase2
id_1 phase1,phase2,phase3

HTH
# 4  
Old 08-17-2009
Thanks ,

The below is the sample data.
2U_1_2 1-KCER
2U_1_2 1-KCER
2U_1_2 1-SKBY
2U_1_2 1-SKBY
2U_1_2 1-SKBY
2U_1_2 Post delivery
D_1_2 P05
D_1_2 P06
D_1_2 P07
D_1_2 Post delivery
K_1_3 Generate Programs
K_1_3 Parse Programs
K_1_3 Auctions Java
K_1_3 Registration
K_1_3 End-to-End Integration
K_1_3 Post delivery
D_18_1 P05
D_18_1 P06
D_18_1 P07
D_18_1 Post delivery

When I perform the awk command on this data the result is not as expected,
D_1_2 P05,P06,P07,Post
2U_1_2 1-KCER,1-KCER,1-SKBY,1-SKBY,1-SKBY,Post
D_18_1 P05,P06,P07,Post
K_1_3 Generate,Parse,Auctions,Registration,End-to-End,Post

Could you please give a solution for this, Also, I need to insert these values into another table, how to establish it

Last edited by jsaravana; 08-17-2009 at 09:03 AM..
# 5  
Old 08-17-2009
i guess it is cos u r having more columns comparing to the original sample u posted so just a little modification to the previous post, and if u have more columns, u have to add them
Code:
 
nawk '{t[$1]?t[$1]=t[$1]","$2" "$3:t[$1]=$2" "$3}END{for (i in t){print i,t[i]}}' inputfile

# 6  
Old 08-18-2009
Hi,
Thanks a many.

I was not aware of nawk commands so I gave few samples.Now I understood the concept, atleast at basic level.
There are around 3500 unique phase fields. Each may have 3 words or even 8 words and go on eg: .
Also some has characters like '&,-' in between words like 'Testing & Delivery' In that case I want whole text instead of 'Testing &'

Please let me know how it can be done.
# 7  
Old 08-18-2009
This should work:
Code:
# awk '{n=$1;sub($1" ","",$0);t[n]?t[n]=t[n]","$0:t[n]=$0}END{for(i in t)print i,t[i]}' f2
K_1_3 Generate Programs,Parse Programs,Auctions Java,Registration,End-to-End Integration,Post delivery
D_1_2 P05,P06,P07,Post delivery
2U_1_2 1-KCER,1-KCER,1-SKBY,1-SKBY,1-SKBY,Post delivery
D_18_1 P05,P06,P07,Post delivery


Last edited by danmero; 08-18-2009 at 04:03 AM.. Reason: Fix space problem.
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. UNIX for Beginners Questions & Answers

Find unique values but only in column 1

Hi All, Does anyone have any suggestions/examples of how i could show only lines where the first field is not duplicated. If the first field is listed more than once it shouldnt be shown even if the other columns make it unique. Example file : 876,RIBDA,EC2 876,RIBDH,EX7 877,RIBDF,E28... (4 Replies)
Discussion started by: mutley2202
4 Replies

2. Shell Programming and Scripting

Bring values in the second column into single line (comma sep) for uniq value in the first column

I want to bring values in the second column into single line for uniq value in the first column. My input jvm01, Web 2.0 Feature Pack Library jvm01, IBM WebSphere JAX-RS jvm01, Custom01 Shared Library jvm02, Web 2.0 Feature Pack Library jvm02, IBM WebSphere JAX-RS jvm03, Web 2.0 Feature... (10 Replies)
Discussion started by: kchinnam
10 Replies

3. Shell Programming and Scripting

awk to filter out lines containing unique values in a specified column

Hi, I have multiple files that each contain four columns of strings: File1: Code: 123 abc gfh 273 456 ddff jfh 837 789 ghi u4u 395 File2: Code: 123 abc dd fu 456 def 457 nd 891 384 djh 783 I want to compare the strings in Column 1 of File 1 with each other file and Print in... (3 Replies)
Discussion started by: owwow14
3 Replies

4. UNIX for Dummies Questions & Answers

Unique values in a row sum the next column in UNIX

Hi would like to ask you guys any advise regarding my problem I have this kind of data file.txt 111111111,20 111111111,50 222222222,70 333333333,40 444444444,10 444444444,20 I need to get this file1.txt 111111111,70 222222222,70 333333333,40 444444444,30 using this code I can... (6 Replies)
Discussion started by: reks
6 Replies

5. Shell Programming and Scripting

Print every 5 4th column values as separate row with different first column

Hi, I have the following file, chr1 100 200 20 chr1 201 300 22 chr1 220 345 23 chr1 230 456 33.5 chr1 243 567 90 chr1 345 600 20 chr1 430 619 21.78 chr1 870 910 112.3 chr1 914 920 12 chr1 930 999 13 My output would be peak1 20 22 23 33.5 90 peak2 20 21.78 112.3 12 13 Here the... (3 Replies)
Discussion started by: jacobs.smith
3 Replies

6. Shell Programming and Scripting

Print unique names in each row of a specific column using awk

Is it possible to remove redundant names in the 4th column? input cqWE 100 200 singapore;singapore AZO 300 400 brazil;america;germany;ireland;germany .... .... output cqWE 100 200 singapore AZO 300 400 brazil;america;germany;ireland (4 Replies)
Discussion started by: quincyjones
4 Replies

7. Shell Programming and Scripting

Subtracting each row from the first row in a single column file using awk

Hi Friends, I have a single column data like below. 1 2 3 4 5 I need the output like below. 0 1 2 3 4 where each row (including first row) subtracting from first row and the result should print below like the way shown in output file. Thanks Sid (11 Replies)
Discussion started by: ks_reddy
11 Replies

8. Shell Programming and Scripting

duplicate row based on single column

I am a newbie to shell scripting .. I have a .csv file. It has 1000 some rows and about 7 columns... but before I insert this data to a table I have to parse it and clean it ..basing on the value of the first column..which a string of phone number type... example below.. column 1 ... (2 Replies)
Discussion started by: mitr
2 Replies

9. Shell Programming and Scripting

print unique values of a column and sum up the corresponding values in next column

Hi All, I have a file which is having 3 columns as (string string integer) a b 1 x y 2 p k 5 y y 4 ..... ..... Question: I want get the unique value of column 2 in a sorted way(on column 2) and the sum of the 3rd column of the corresponding rows. e.g the above file should return the... (6 Replies)
Discussion started by: amigarus
6 Replies

10. Shell Programming and Scripting

Converting Column values to comma delimted single Row

I have a requirement in which i have to read a file which has multiple columns seperated by a pipe "|" from this i have to read each column values seperately and create a comma seperated row for the column and write to another file. eg: Input file: ColA ColB 1 2 2 x 3 y... (5 Replies)
Discussion started by: nvuradi
5 Replies
Login or Register to Ask a Question