remove duplicate


 
Thread Tools Search this Thread
Top Forums Shell Programming and Scripting remove duplicate
# 1  
Old 11-02-2009
remove duplicate

Hi,

I am tryung to use shell or perl to remove duplicate characters
for example , if I have " I love google" it will become I love ggle"
or even "I loveggle" if removing duplicate white space

Thanks
CC
# 2  
Old 11-02-2009
Please post a better example with the input, process rules, and output clearer.
As posted the duplicate space rule is inconsistent with the duplicate character rule.
# 3  
Old 11-07-2009
To make it simple

if original input is "I love yahoo"
I need remove duplicate words
so it will become

I love yah
# 4  
Old 11-07-2009
Quote:
Originally Posted by ccp
...
if original input is "I love yahoo"
I need remove duplicate words
so it will become

I love yah
You mention "duplicate words", but you have removed duplicate characters in your expected output.
So I shall assume you want to remove duplicate characters, and not words.

Code:
$
$ cat f2
i love google
i love yahoo
i am tech savvy
$
$ perl -lne 's/(.)\1//g; print' f2
i love ggle
i love yah
i am tech say
$
$

tyler_durden
# 5  
Old 11-07-2009
Code:
echo I Love Yahoo | sed 's/\([a-z]\)\1//'

# 6  
Old 11-07-2009
tyler

it seems not right

"I love yahoo" becomes " I love yah" because "o" appears in love once
so "I am tech savvy" should become " I am tech savy"
Thanks
CC
# 7  
Old 11-07-2009
Quote:
Originally Posted by ccp
...
"I love yahoo" becomes " I love yah" because "o" appears in love once
Nope, that's an incorrect statement. The correct statement is:

"I love yahoo" becomes "I love yah" because "o" appears in "yahoo" twice, and hence (both occurrences are) removed.

Quote:
...
so "I am tech savvy" should become " I am tech savy"
...
Going by the stated requirement in your first post:

Quote:
...
I am tryung to use shell or perl to remove duplicate characters
...
and noting that the only duplicate characters in "I am tech savvy" are the two "v"s in "savvy", they should be removed, thus resulting in "I am tech say".

tyler_durden
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

How to remove duplicate lines?

Hi All, I am storing the result in the variable result_text using the below code. result_text=$(printf "$result_text\t\n$name") The result_text is having the below text. Which is having duplicate lines. file and time for the interval 03:30 - 03:45 file and time for the interval 03:30 - 03:45 ... (4 Replies)
Discussion started by: nalu
4 Replies

2. UNIX for Dummies Questions & Answers

Remove Duplicate Lines

Hi I need this output. Thanks. Input: TAZ YET FOO FOO VAK TAZ BAR Output: YET VAK BAR (10 Replies)
Discussion started by: tara123
10 Replies

3. UNIX for Dummies Questions & Answers

Remove duplicate

Hi, How can I replace || with space and then remove duplicate from following text? T111||T222||T444||T222||T555 Thanks in advance (10 Replies)
Discussion started by: tinku981
10 Replies

4. Shell Programming and Scripting

Remove duplicate

Hi , I have a pipe seperated file repo.psv where i need to remove duplicates based on the 1st column only. Can anyone help with a Unix script ? Input: 15277105||Common Stick|ESHR||Common Stock|CYRO AB 15277105||Common Stick|ESHR||Common Stock|CYRO AB 16111278||Common Stick|ESHR||Common... (12 Replies)
Discussion started by: samrat dutta
12 Replies

5. Shell Programming and Scripting

How to remove duplicate ID's?

HI I have file contains 1000'f of duplicate id's with (upper and lower first character) as below i/p: a411532A411532a508661A508661c411532C411532 Requirement: But i need to ignore lowercase id's and need only below id's o/p: A411532 A508661 C411532 (9 Replies)
Discussion started by: buzzme
9 Replies

6. Shell Programming and Scripting

Help with remove duplicate content

Input file data_1 10 US data_1 2 US data_1 5 UK data_2 20 ENGLAND data_2 12 KOREA data_3 4 CHINA . . data_60 123 US data_60 23 UK data_60 45 US Desired output file data_1 10 US data_1 5 UK data_2 20 ENGLAND data_2 12 KOREA (2 Replies)
Discussion started by: perl_beginner
2 Replies

7. UNIX for Dummies Questions & Answers

Remove duplicate in array

Hi, I have a list of numbers stored in an array as below. 5 7 10 30 30 40 50 Please advise how could I remove the duplicate value in the array ? Thanks in advance. (5 Replies)
Discussion started by: Rock
5 Replies

8. Shell Programming and Scripting

Remove duplicate

Hi all, I have a text file fileA.txt DXRV|02/28/2006 11:36:49.049|SAC||||CDxAcct=2420991350 DXRV|02/28/2006 11:37:06.404|SAC||||CDxAcct=6070970034 DXRV|02/28/2006 11:37:25.740|SAC||||CDxAcct=2420991350 DXRV|02/28/2006 11:38:32.633|SAC||||CDxAcct=6070970034 DXRV|02/28/2006... (2 Replies)
Discussion started by: sabercats
2 Replies

9. Shell Programming and Scripting

Remove duplicate ???

Hi all, I have a out.log file CARR|02/26/2006 10:58:30.107|CDxAcct=1405157051 CARR|02/26/2006 11:11:30.107|CDxAcct=1405157051 CARR|02/26/2006 11:18:30.107|CDxAcct=7659579782 CARR|02/26/2006 11:28:30.107|CDxAcct=9534922327 CARR|02/26/2006 11:38:30.107|CDxAcct=9534922327 CARR|02/26/2006... (3 Replies)
Discussion started by: sabercats
3 Replies

10. Shell Programming and Scripting

remove duplicate

i have a text its contain many record, but its written in one line, i want to remove from that line the duplicate record, not record have fixed width ex: width = 4 inputfile test.txt =abc cdf abc abc cdf fgh fgh abc abc i want the outputfile =abc cdf fgh only those records can any one help... (4 Replies)
Discussion started by: kazanoova2
4 Replies
Login or Register to Ask a Question