The UNIX and Linux Forums  

Go Back   The UNIX and Linux Forums > Top Forums > UNIX for Dummies Questions & Answers
Google UNIX.COM


UNIX for Dummies Questions & Answers If you're not sure where to post a UNIX or Linux question, post it here. All UNIX and Linux newbies welcome !!

More UNIX and Linux Forum Topics You Might Find Helpful
Thread Thread Starter Forum Replies Last Post
SED replace string by occurrence uttamhoode Shell Programming and Scripting 4 03-05-2008 01:04 AM
last occurrence of a string across multiple files porphyrin UNIX for Dummies Questions & Answers 2 12-24-2007 05:39 AM
appending string to text file based on search string malaymaru Shell Programming and Scripting 1 06-09-2006 05:53 AM
Search and replace to first occurrence of string gilmord UNIX for Dummies Questions & Answers 7 05-03-2006 04:43 AM
copying or concatinating string from 1st bit, leaving 0th bit jazz High Level Programming 2 11-10-2005 08:38 AM

Reply
 
Submit Tools LinkBack Thread Tools Display Modes
  #1 (permalink)  
Old 10-27-2006
Registered User
 

Join Date: Jun 2005
Posts: 26
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Reddit! Stumble this Post!Spurl this Post!
delete string in a text file leaving the first occurrence

Hi,

How can i delete the second and subsequent occurrence of a particular string from a file ?

eg) test.txt
cattle
bat
battle
mat
matter
cattle
cattle

my output file should be

cattle
bat
battle
mat
matter

I am new to unix and your advice is greatly appreciated.

Thanks in advance,
gops
Reply With Quote
Forum Sponsor
  #2 (permalink)  
Old 10-27-2006
vino's Avatar
Supporter (in vino veritas)
 

Join Date: Feb 2005
Location: Bangalore, India
Posts: 2,628
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Reddit! Stumble this Post!Spurl this Post!
Looks like you want to remove the duplicate entries.

Removing duplicates
Reply With Quote
  #3 (permalink)  
Old 10-27-2006
Registered User
 

Join Date: Jun 2005
Posts: 26
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Reddit! Stumble this Post!Spurl this Post!
Hi Vino,

Thanks for your prompt response.

Here i want to pass a string and only duplicate entry of that string should be deleted rather than all duplicates.

In the extract file which we have, lots of duplicates are available and we dont want to remove everything except the string which we specify explicitely.

Thanks in advance.

cheers,
gops
Reply With Quote
  #4 (permalink)  
Old 10-27-2006
...@...
 

Join Date: Feb 2004
Location: NM
Posts: 3,289
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Reddit! Stumble this Post!Spurl this Post!
Code:
#!/bin/ksh

awk -v check_val="$1" '{
        if( $0==check_val) { key[$0]++ }
        if(key[$0} < 2 ) print $0
       }' inputfile > outputfile
Reply With Quote
  #5 (permalink)  
Old 10-27-2006
Registered User
 

Join Date: Jun 2005
Posts: 26
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Reddit! Stumble this Post!Spurl this Post!
Hi Jim,

I have a problem executing the awk script which you have recommended.

I am using Korn shell. Do i need to change some thing to accommodate that.
Herewith i am attaching my file

test.txt
cat
cattle
bat
battle
mat
matter
fat
fatter
cattle
cattle

I am bit confused about the script. When i execute it, I am getting following error message.

awk: syntax error near line 1
awk: bailing out near line 1

Let me know what i should assign for $0 and $1.
Thanks in advance.
Since I am a novice, I have several queries from my end.

cheers,
gops
Reply With Quote
  #6 (permalink)  
Old 10-27-2006
Registered User
 

Join Date: Mar 2006
Location: Bangalore,India
Posts: 1,397
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Reddit! Stumble this Post!Spurl this Post!
$1 is the string whose duplicates will be removed. Either you pass the string to that script or replace $0 by the string in that script.

Awk process the text file line by line. For each line read is placed in $0.
Reply With Quote
  #7 (permalink)  
Old 10-27-2006
Registered User
 

Join Date: Oct 2006
Location: Tampa, Florida
Posts: 14
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Reddit! Stumble this Post!Spurl this Post!
Could you use 'uniq'?

Code:
uniq -u test.txt
cat
cattle
bat
battle
mat
matter
fat
fatter
Reply With Quote
  #8 (permalink)  
Old 10-27-2006
Registered User
 

Join Date: Jun 2005
Posts: 26
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Reddit! Stumble this Post!Spurl this Post!
Hi Thomas,

I want to remove duplicates which are specified explicitely rather than all duplicates.
Thanks Thomas.

Hi Anbu,

I am getting the same error message. Can you take up my example and change the script accordingly.
Thanks in advance.

gops
Reply With Quote
  #9 (permalink)  
Old 10-27-2006
Registered User
 

Join Date: Mar 2006
Location: Bangalore,India
Posts: 1,397
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Reddit! Stumble this Post!Spurl this Post!
Code:
#!/bin/ksh

awk -v check_val="$1" '{
        if( $0==check_val) { key[$0]++ }
        if(key[$0] < 2 ) print $0
       }' inputfile > outputfile
Reply With Quote
  #10 (permalink)  
Old 10-31-2006
Registered User
 

Join Date: Jun 2005
Posts: 26
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Reddit! Stumble this Post!Spurl this Post!
Hi Anbu,

I was off for few days and hence the delay in replying.
I tried your stuff but I am getting this error
awk -v check_val="$1" '{
if( $0==check_val) { key[$0]++ }
if(key[$0] < 2 ) print $0
}' inputfile > outputfile

awk: syntax error near line 1
awk: bailing out near line 1

I created a script test and fired it like ksh test.ksh cattle

I am using korn shell. Do i need to make some changes for that ?

Thanks in advance.

cheers,
gops
Reply With Quote
  #11 (permalink)  
Old 10-31-2006
Registered User
 

Join Date: Jun 2005
Posts: 26
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Reddit! Stumble this Post!Spurl this Post!
Hi Guys,

It is working fine when i supstitute nawk with awk command. Anbu, thanks a lot.

When i pass the exact string that line gets deleted in the outputfile.

But i want to pass a part of the string and i want whole string to be deleted.

for eg

inputfile

cattle
battle
cat
mattle
cattle

my outputfile should be
cattle
battle
mattle

here i want to check for cat%. I dont know how to do that using nawk.
Please help me out on this.

Best Regards,
gops
Reply With Quote
  #12 (permalink)  
Old 10-31-2006
Registered User
 

Join Date: Oct 2006
Location: Bangalore, India
Posts: 41
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Reddit! Stumble this Post!Spurl this Post!
Quote:
Originally Posted by anbu23
Code:
#!/bin/ksh

awk -v check_val="$1" '{
        if( $0==check_val) { key[$0]++ }
        if(key[$0] < 2 ) print $0
       }' inputfile > outputfile
umm..how abt this?

awk '/$1/' inputfile>temp

awk '{for ( $0 in `cat temp` )
key[$0]++
if(key[$0] < 2 ) print $0
}' inputfile > outputfile


i didnt try this out, but i hope the logic is clear. i searched for the pattern specified by $1 and stored all the matches in temp. then instead of a direct comparison with 'if', i matched $0 with all the values in temp using a for loop. rest of the logic is the same. pardon any syntactical errors.
Reply With Quote
  #13 (permalink)  
Old 10-31-2006
Registered User
 

Join Date: Jun 2005
Posts: 26
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Reddit! Stumble this Post!Spurl this Post!
Hi npolayan,

I can able to create the temp file with appropriate values to look at but when i executed the awk statement
awk '{ for ($0 in `cat temp`)
key[$0]++
if(key[$0] < 2 ) print $0
}' inputfile > outputfile

I am getting error messages

illegal statement at source line 1
illegal statement at source line 4

Let me know if you have the answer.

Thanks in advance.

cheers,
gops
Reply With Quote
Google UNIX.COM
Reply

Thread Tools
Display Modes


The 50 most popular UNIX and Linux searches.
Google Search Cloud for The UNIX and Linux Forums
421 service not available, remote server has closed connection ^m automate ftp autosys awk trim bash eval bash for loop boot: cannot open kernel/sparcv9/unix command copy/move folder in unix couldn't set locale correctly curses.h cut command in unix export command in unix find grep find mtime find null character in a unix file grep multiple lines grep or grep recursive hp-ux ifconfig inaddr_any inappropriate ioctl for device lynx javascript mailx attachment mget mtime ping port remove first character from string in k shell replace space by comma , perl script rsync ftp scp recursive segmentation fault(coredump) sftp script snoop unix solaris change ip address stale nfs file handle syn_sent tar exclude tar extract to folder test: argument expected unix unix .profile unix forum unix forums unix internals unix mtime unix simulator unix.com vi substitute while loop within while loop shell script


All times are GMT -7. The time now is 03:22 AM.


Powered by: vBulletin, Copyright ©2000 - 2006, Jelsoft Enterprises Limited.
The UNIX and Linux Forums Content Copyright ©1993-2008 The CEP Blog All Rights Reserved -Ad Management by RedTyger Visit The Global Fact Book