Using sed, awk or perl to remove substring of all lines except the first

04-28-2013

Registered User

12, 0

Join Date: Aug 2011

Last Activity: 19 May 2013, 12:10 PM EDT

Posts: 12

Thanks Given: 1

Thanked 0 Times in 0 Posts

Using sed, awk or perl to remove substring of all lines except the first

Greetings All,

I would like to find all occurences of a pattern and delete a substring from the all matching lines EXCEPT the first. For example:

1234::group:user1,user2,user3,blah1,blah2,blah3
2222::othergroup:user9,user8
4444::othergroup2:user3,blah,blah,user1
1234::group3:user5,user1

This should be for all combinations of gid and user. If this can be accomplished using a sed or awk one liner that would be great. Otherwise, I guess I'll try a Perl script.

Any ideas to get me started will be greatly appreciated. I'm thinking a nested for loop with an internal sed call would be a start.

jacksolm

View Public Profile for jacksolm

Find all posts by jacksolm

04-28-2013

Registered User

858, 184

Join Date: Mar 2013

Last Activity: 12 May 2013, 11:33 PM EDT

Posts: 858

Thanks Given: 18

Thanked 184 Times in 179 Posts

It's unclear to me what output you want, and no point in guessing.

Also, please use code tags.

hanson44

View Public Profile for hanson44

Find all posts by hanson44

04-28-2013

Registered User

12,315, 4,560

Join Date: Jul 2012

Last Activity: 22 November 2019, 4:29 PM EST

Location: San Jose, CA, USA

Posts: 12,315

Thanks Given: 952

Thanked 4,560 Times in 3,818 Posts

I think this awk script does what you want (and you can turn it into a 1-liner if you insist, but I prefer readable). Note this this script also removes one of the duplicated group (blah) entries from the input line:

Code:

4444::othergroup2:user3,blah,blah,user1

Code:

awk '
BEGIN { FS = OFS = ":" }
{       n=split($4, g, /,/)
        for(i = 1; i <= n; i++)
                if(($1,g[i]) in key) {
                        for(j = i + 1; j <= n; j++) g[j - 1] = g[j]
                        i--
                        n--
                        c = 1
                } else  key[$1,g[i]]
        if(c) { c = 0
                $4 = n ? g[1] : ""
                for(j = 2; j <= n; j++) $4 = $4 "," g[j]
        }
        print
}' data

With your sample input, this script produces the output:

Code:

1234::group:user1,user2,user3,blah1,blah2,blah3
2222::othergroup:user9,user8
4444::othergroup2:user3,blah,user1
1234::group3:user5

As always, if you're using a Solaris/SunOS system, use /usr/xpg4/bin/awk, /usr/xpg6/bin/awk, or nawk instead of awk.

Don Cragun

View Public Profile for Don Cragun

Find all posts by Don Cragun

04-28-2013

Registered User

12, 0

Join Date: Aug 2011

Last Activity: 19 May 2013, 12:10 PM EDT

Posts: 12

Thanks Given: 1

Thanked 0 Times in 0 Posts

Here's a little background information. Due to group member limitations we had to split groups into separate lines by using different group names and identical GIDs. A few help desk admins maintaining the NIS map entered users into all of the groups instead of selecting the latest group to add the user. A single GID could have several group names. I want to loop through the group file and remove the redundant entries.

jacksolm

View Public Profile for jacksolm

Find all posts by jacksolm

04-28-2013

Registered User

12,315, 4,560

Join Date: Jul 2012

Last Activity: 22 November 2019, 4:29 PM EST

Location: San Jose, CA, USA

Posts: 12,315

Thanks Given: 952

Thanked 4,560 Times in 3,818 Posts

Quote:

Originally Posted by jacksolm

Here's a little background information. Due to group member limitations we had to split groups into separate lines by using different group names and identical GIDs. A few help desk admins maintaining the NIS map entered users into all of the groups instead of selecting the latest group to add the user. A single GID could have several group names. I want to loop through the group file and remove the redundant entries.

That was what I understood when I wrote the awk script for you. Did you decide not to try it because it is more than 1 line?

Don Cragun

View Public Profile for Don Cragun

Find all posts by Don Cragun

04-29-2013

Registered User

12, 0

Join Date: Aug 2011

Last Activity: 19 May 2013, 12:10 PM EDT

Posts: 12

Thanks Given: 1

Thanked 0 Times in 0 Posts

Don,

Sorry for the delay. I just had a chance to try your solution. I had a few typos which resulted in syntax errors. After debugging, the script was able to execute. It is strange that awk complained about the single quote before BEGIN. It didn't appear to delete the user from the redundant group. Any ideas? I will post the edited script. I appreciate your time and awk expertise.

jacksolm

View Public Profile for jacksolm

Find all posts by jacksolm

04-29-2013

Registered User

12,315, 4,560

Join Date: Jul 2012

Last Activity: 22 November 2019, 4:29 PM EST

Location: San Jose, CA, USA

Posts: 12,315

Thanks Given: 952

Thanked 4,560 Times in 3,818 Posts

Quote:

Originally Posted by jacksolm

Don,

Sorry for the delay. I just had a chance to try your solution. I had a few typos which resulted in syntax errors. After debugging, the script was able to execute. It is strange that awk complained about the single quote before BEGIN. It didn't appear to delete the user from the redundant group. Any ideas? I will post the edited script. I appreciate your time and awk expertise. Smilie

I suggest you copy the script I provided and save it into a file and execute that file. When I ran that code it produced exactly the output i listed right after the script. If it isn't doing that for you, you must still have some typos.

What are the results of running the command:

Code:

uname -a

on your system? What shell are you using?

This User Gave Thanks to Don Cragun For This Post:

Don Cragun

View Public Profile for Don Cragun

Find all posts by Don Cragun

Shell Programming and Scripting

Using sed, awk or perl to remove substring of all lines except the first

10 More Discussions You Might Find Interesting

1. UNIX for Beginners Questions & Answers

awk with sed to combine lines and remove specific odd # pattern from line

Discussion started by: cmccabe

2. Shell Programming and Scripting

Remove lines matching a substring in a specific column

Discussion started by: gfhsd

3. Shell Programming and Scripting

Sed/awk/perl substitution with multiple lines

Discussion started by: sudo

4. Shell Programming and Scripting

Process alternate lines in awk/sed/perl

Discussion started by: empyrean

5. Shell Programming and Scripting

Need an awk / sed / or perl one-liner to remove last 4 characters with non-unique pattern.

Discussion started by: right_coaster

6. Shell Programming and Scripting

Command to remove duplicate lines with perl,sed,awk

Discussion started by: cola

7. Shell Programming and Scripting

How to remove spaces using awk,sed,perl?

Discussion started by: cola

8. Shell Programming and Scripting

perl or awk remove empty lines when condition

Discussion started by: jimmy_y

9. Shell Programming and Scripting

How to remove lines before and after with awk / sed ?

Discussion started by: ashimada

10. Shell Programming and Scripting

Sed or Awk to remove specific lines

Discussion started by: Shoeless_Mike