Filter duplicate block of text using SED

08-07-2008

Registered User

2, 0

Join Date: Aug 2008

Last Activity: 22 December 2008, 9:53 PM EST

Posts: 2

Thanks Given: 0

Thanked 0 Times in 0 Posts

Filter duplicate block of text using SED

Hi,

I would like to print a block of text between 2 regular expression using Sed,
This can be achieved by using the command as shown below, however my problem is the same block of text is repeated twice. I would like to eliminate the duplicate block of text.

For Example

If my file test.txt contains following data.
**********************************************
start
test for the block
test for the block
test for the block
End
Blah Blah
Blah Blah
Blah Blah
start
test for the block
test for the block
test for the block
end

*******************
Now if i use sed command to print the text between regular expressions
"start" and "end"

sed -n '/start/,/end/p' text.txt >> ouput.txt

I get the block of text twice in output.txt file as shown below
********************************************************************
start
test for the block
test for the block
test for the block
end
start
test for the block
test for the block
test for the block
end
****************

Please help on how do I filter duplicate printing.

Thanks in Advance
Deepak

dkumar91

View Public Profile for dkumar91

Find all posts by dkumar91

08-07-2008

Registered User

1,009, 2

Join Date: May 2008

Last Activity: 28 October 2009, 7:03 PM EDT

Location: Sydney, Australia

Posts: 1,009

Thanks Given: 0

Thanked 2 Times in 2 Posts

Make sed quit when it encounters the end.

Code:

sed -n '/start/,/end/p;/end/q' text.txt >> output.txt

Annihilannic

View Public Profile for Annihilannic

Find all posts by Annihilannic

08-07-2008

Registered User

2, 0

Join Date: Aug 2008

Last Activity: 22 December 2008, 9:53 PM EST

Posts: 2

Thanks Given: 0

Thanked 0 Times in 0 Posts

Hi,

Thanks for your help and time on this, It works great.

Deepak.

dkumar91

View Public Profile for dkumar91

Find all posts by dkumar91

10-22-2008

Registered User

23, 0

Join Date: Sep 2008

Last Activity: 29 October 2008, 8:47 PM EDT

Posts: 23

Thanks Given: 0

Thanked 0 Times in 0 Posts

I'd like to piggy-back onto this post and ask, how do I read everything in between "start" and "end" so that "start" and "end" are not included in the extraction?

I recognize that I can use different words, but I want to keep "start" and "end" in my file. The reason is that I created a help page for a script I wrote. When option -h is used, it grabs the text from the file. It would be handy to have all my help pages begin with "start" and end with "end". But I don't want those words to display on the screen.

Thanks.

ankimo

View Public Profile for ankimo

Find all posts by ankimo

10-23-2008

Registered User

1,305, 26

Join Date: Jun 2007

Last Activity: 11 November 2016, 3:44 AM EST

Location: Beijing China

Posts: 1,305

Thanks Given: 0

Thanked 26 Times in 26 Posts

Code:

sed -n '/start/,/end/p
 /end/q
 ' filename | sed '/^start$/d
/^end$/d'

summer_cherry

View Public Profile for summer_cherry

Find all posts by summer_cherry

10-23-2008

Registered User

1,009, 2

Join Date: May 2008

Last Activity: 28 October 2009, 7:03 PM EDT

Location: Sydney, Australia

Posts: 1,009

Thanks Given: 0

Thanked 2 Times in 2 Posts

Or another idea:

Code:

awk '/^start$/ { while (getline && $0 !~ /^end$/) print }' inputfile

Annihilannic

View Public Profile for Annihilannic

Find all posts by Annihilannic

Shell Programming and Scripting

Filter duplicate block of text using SED

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

CSV File:Filter duplicate records from column1 & another column having unique record

Discussion started by: as7951

2. Shell Programming and Scripting

Filter duplicate records from csv file with condition on one column

Discussion started by: as7951

3. Shell Programming and Scripting

Filter file to remove duplicate values in first column

Discussion started by: LMHmedchem

4. UNIX for Dummies Questions & Answers

Filter records in a huge text file from a filter text file

Discussion started by: tech_frk

5. Shell Programming and Scripting

Block of text replacement using sed

Discussion started by: abhitanshu

6. Shell Programming and Scripting

Delete first block of text with sed/awk

Discussion started by: teresaejunior

7. Shell Programming and Scripting

using sed/awk to replace a block of text in a file?

Discussion started by: kiddsupreme

8. Shell Programming and Scripting

Filter or remove duplicate block of text without distinguishing marks or fields

Discussion started by: samask

9. Shell Programming and Scripting

Filter/remove duplicate .dat file with certain criteria

Discussion started by: mukeshguliao

10. Shell Programming and Scripting

using sed(?) to delete a block of text

Discussion started by: toast