Remove duplicate words from column 1

06-29-2015

Registered User

114, 2

Join Date: Oct 2012

Last Activity: 26 April 2020, 9:12 PM EDT

Posts: 114

Thanks Given: 57

Thanked 2 Times in 1 Post

Remove duplicate words from column 1

Tried using sed and uniq but it's removing the entire line. Can't seem to figure a way to just remove the word. Any help is appreciated. I have a file:

Code:

dog, text1, text2, text3
dog, text1, text2, text3
dog, text1, text2, text3
cat, text1, text2, text3

Trying to remove all duplicate instances of dog and just keep the formatting:

Code:

dog, text1, text2, text3
    text1, text2, text3
    text1, text2, text3
cat, text1, text2, text3

Thanks.

jimmyf

View Public Profile for jimmyf

Find all posts by jimmyf

06-29-2015

Registered User

1,781, 705

Join Date: May 2008

Last Activity: 10 November 2021, 5:38 PM EST

Posts: 1,781

Thanks Given: 62

Thanked 705 Times in 653 Posts

Code:

awk 'F[$1]++ {$1=OFS}1' test.file

This User Gave Thanks to Aia For This Post:

Aia

View Public Profile for Aia

Find all posts by Aia

06-29-2015

Moderator

12,296, 3,792

Join Date: Nov 2008

Last Activity: 1 January 2021, 1:47 AM EST

Location: Amsterdam

Posts: 12,296

Thanks Given: 679

Thanked 3,792 Times in 3,282 Posts

Or try:

Code:

awk 'F[$1]++ {p=$1; gsub(/./,FS,p); sub($1,p)}1' file

or, if spacing is always exactly one space:

Code:

awk 'F[$1]++ {gsub(/./,FS,$1)}1'  file

Code:

dog, text1, text2, text3
     text1, text2, text3
     text1, text2, text3
cat, text1, text2, text3

Scrutinizer

View Public Profile for Scrutinizer

Find all posts by Scrutinizer

06-29-2015

Registered User

114, 2

Join Date: Oct 2012

Last Activity: 26 April 2020, 9:12 PM EDT

Posts: 114

Thanks Given: 57

Thanked 2 Times in 1 Post

can the text be manipulated from horizontal to vertical for the output so there is only one row per instance?

Code:

dog, text1, text2, text3, text1 text2, text3, text1, text2, text3
cat, text1, text2, text3

jimmyf

View Public Profile for jimmyf

Find all posts by jimmyf

06-29-2015

Registered User

15,129, 5,008

Join Date: Jul 2012

Last Activity: 4 May 2020, 4:31 PM EDT

Location: Aachen, Germany

Posts: 15,129

Thanks Given: 735

Thanked 5,008 Times in 4,483 Posts

Try

Code:

awk '$1 != L {printf "%s%s", DL, $0; DL=RS; L = $1; next} {printf "%s%s%s%s%s", $2, OFS, $3, OFS, $4} END {print ""}' FS="," OFS="," file

This User Gave Thanks to RudiC For This Post:

RudiC

View Public Profile for RudiC

Find all posts by RudiC

06-29-2015

Registered User

114, 2

Join Date: Oct 2012

Last Activity: 26 April 2020, 9:12 PM EDT

Posts: 114

Thanks Given: 57

Thanked 2 Times in 1 Post

Thanks again RudiC!

jimmyf

View Public Profile for jimmyf

Find all posts by jimmyf

06-30-2015

Moderator

12,296, 3,792

Join Date: Nov 2008

Last Activity: 1 January 2021, 1:47 AM EST

Location: Amsterdam

Posts: 12,296

Thanks Given: 679

Thanked 3,792 Times in 3,282 Posts

Code:

awk 'p!=$1{if(NR>1) print s; p=s=$1} {$1=x; s=s $0} END{print s}' FS=, OFS=, file

Scrutinizer

View Public Profile for Scrutinizer

Find all posts by Scrutinizer

UNIX for Dummies Questions & Answers

Remove duplicate words from column 1

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Remove duplicate values in a column(not in the file)

Discussion started by: ratheeshjulk

2. Shell Programming and Scripting

Filter file to remove duplicate values in first column

Discussion started by: LMHmedchem

3. Shell Programming and Scripting

Find duplicate words in first column between "10" repetiotions

Discussion started by: phaethon

4. Shell Programming and Scripting

Remove duplicate rows based on one column

Discussion started by: clarissab

5. UNIX for Dummies Questions & Answers

[SOLVED] remove lines that have duplicate values in column two

Discussion started by: pathunkathunk

6. Shell Programming and Scripting

Remove very first pair of duplicate words

Discussion started by: manas_ranjan

7. UNIX for Dummies Questions & Answers

Remove duplicate rows when >10 based on single column value

Discussion started by: informaticist

8. Shell Programming and Scripting

Remove duplicate line detail based on column one data

Discussion started by: patrick87

9. Shell Programming and Scripting

remove duplicate words in a line

Discussion started by: sam_2921

10. UNIX for Dummies Questions & Answers

Remove duplicate rows of a file based on a value of a column

Discussion started by: risk_sly