Duplicate rows in CSV files based on values

04-24-2009

Registered User

3, 0

Join Date: Apr 2009

Last Activity: 24 April 2009, 10:26 AM EDT

Posts: 3

Thanks Given: 0

Thanked 0 Times in 0 Posts

Duplicate rows in CSV files based on values

I want to duplicate a row if found two or more values in a particular column for corresponding row which is delimitted by comma.

Input

Code:

abc,line one,value1
abc,line two, value1, value2
abc,line three,value1

needs to converted to

Code:

abc,line one,value1
abc,line two, value1
abc,line two, value2
abc,line three,value1

How this could be done using unix script???

Thanks in advance..........

Last edited by Yogesh Sawant; 04-24-2009 at 10:11 AM.. Reason: added code tags

Incrediblian

View Public Profile for Incrediblian

Find all posts by Incrediblian

04-24-2009

Registered User

66, 0

Join Date: Aug 2008

Last Activity: 2 June 2009, 6:32 AM EDT

Posts: 66

Thanks Given: 0

Thanked 0 Times in 0 Posts

hello Incrediblian,
one thing I want to confirm is
"abc,line" will be constant or it may change.

pradeepreddy

View Public Profile for pradeepreddy

Find all posts by pradeepreddy

04-24-2009

Registered User

3, 0

Join Date: Apr 2009

Last Activity: 24 April 2009, 10:26 AM EDT

Posts: 3

Thanks Given: 0

Thanked 0 Times in 0 Posts

Duplicate rows in CSV files based on values

Hi pradeep,

Thanks for your reply, No its not same, all the field would to be unique.
Input

abc, first line, value1
def, second line, value2,value3
ghi, third line, value4

need to be

abc, first line, value1
def, second line, value2
def, second line, value3
ghi, third line, value4

Thanks in advance....

Incrediblian

View Public Profile for Incrediblian

Find all posts by Incrediblian

04-24-2009

Registered User

5,690, 630

Join Date: Jan 2007

Last Activity: 9 January 2017, 4:40 AM EST

Location: Варна, България / Milano, Italia

Posts: 5,690

Thanks Given: 184

Thanked 630 Times in 587 Posts

Code:

perl -F, -lane'
if ( @F > 3 ) {
    print join ",", @F[ 0, 1, $_ ] for 2 .. @F - 1;
}
else {
    print;
}' infile

Last edited by radoulov; 04-24-2009 at 11:11 AM.. Reason: refactored

radoulov

View Public Profile for radoulov

Find all posts by radoulov

04-24-2009

Registered User

3, 0

Join Date: Apr 2009

Last Activity: 24 April 2009, 10:26 AM EDT

Posts: 3

Thanks Given: 0

Thanked 0 Times in 0 Posts

Duplicate rows in CSV files based on values

hi radoulov,

Thank you so much,

I am new to perl and unix as well. can you please explain me ?

Cant this be done using unix ?

Thanks in advance.

Incrediblian

View Public Profile for Incrediblian

Find all posts by Incrediblian

04-25-2009

Registered User

66, 0

Join Date: Aug 2008

Last Activity: 2 June 2009, 6:32 AM EDT

Posts: 66

Thanks Given: 0

Thanked 0 Times in 0 Posts

script is as below:
#! /bin/bash
while read line
do
first=`echo $line | awk -F "line" '{print $1}'
first=`echo $first line,`
value=`echo $line | awk -F "line" '{print $2}' | sed 's/,/ /g'`
for i in `echo $value`
do
echo $first $i
done
done < ref_file

Contents of ref_file is as below
abc, first line, value1
def, second line, value2,value3, value4
ghi, third line, value4 , value5

pradeepreddy

View Public Profile for pradeepreddy

Find all posts by pradeepreddy

04-25-2009

Registered User

66, 0

Join Date: Aug 2008

Last Activity: 2 June 2009, 6:32 AM EDT

Posts: 66

Thanks Given: 0

Thanked 0 Times in 0 Posts

hello Incrediblian ,

This will work very well compared than previous one.

while read line
do
first=`echo $line | awk -F "," '{print $1}'`
second=`echo $line | awk -F "," '{print $2}'`
first_half=`echo $first,$second, `
value=`echo $line | cut -d "," -f 3- | sed 's/,/ /g'`
for i in `echo $value`
do
echo $first_half$i
done
done < ref

Contents of ref_file is as below:
abc,line first,value1
def,line second,value2,value3, value4
ghi,line third,value4, value2

pradeepreddy

View Public Profile for pradeepreddy

Find all posts by pradeepreddy

Shell Programming and Scripting

Duplicate rows in CSV files based on values

10 More Discussions You Might Find Interesting

1. UNIX for Beginners Questions & Answers

Get duplicate rows from a csv file

Discussion started by: ggupta

2. Shell Programming and Scripting

Extract and exclude rows based on duplicate values

Discussion started by: CHoggarth

3. Shell Programming and Scripting

Average values of duplicate rows

Discussion started by: Sanchari

4. Shell Programming and Scripting

Remove duplicate rows based on one column

Discussion started by: clarissab

5. Shell Programming and Scripting

How to generate a csv files by separating the values from the input file based on position?

Discussion started by: babom

6. Shell Programming and Scripting

Duplicate rows in CSV files based on values

Discussion started by: vbhonde11

7. Shell Programming and Scripting

printing 3 files side by side based on similar values in rows

Discussion started by: zerofire123

8. UNIX for Dummies Questions & Answers

forming duplicate rows based on value of a key

Discussion started by: ruby_sgp

9. Shell Programming and Scripting

how to delete duplicate rows based on last column

Discussion started by: reva

10. UNIX for Dummies Questions & Answers

Remove duplicate rows of a file based on a value of a column

Discussion started by: risk_sly