Grep lines between two specific words after matching pattern


 
Thread Tools Search this Thread
Top Forums UNIX for Beginners Questions & Answers Grep lines between two specific words after matching pattern
# 8  
Old 02-20-2020
If it's not a big file, you can get a simple to understand but clunky way by using the output of grep -n "start" $filename and grep -n "end" $filename to get you the record numbers to search between and then perhaps a sed -n "$start_line,$end_line"p $filename

This would be slow with a very large file though because you would read it all three times.

Does this help, or is your file big enough to warrant a solution that just reads it once?





Kind regards,
Robin
# 9  
Old 02-20-2020
Do you want the End pattern excluded? Try



Code:
awk '/Start/,/End/ {if (/12345/) P = 1; if (/End/) P = 0} P' file
12345
.
.

EDIT: included?

Code:
awk '/Start/,/End/ {if (/12345/) P = 1; if (P) print; if (/End/) P = 0}' file
12345
.
.
End

If there are more pattern pairs found in the file (which is not specified nor found in sample input) we need to rethink.

Last edited by RudiC; 02-20-2020 at 06:44 AM..
# 10  
Old 02-20-2020
Dear rbatte1,





Dear RudiC,

As per your suggestion, i have tried the solution on sample file and output is like this

Code:
grep -n "start" $filename

Quote:
output:
12:start
41:start
Code:
grep -n "stop" $filename

Quote:
output:
28:end
49:end
there will be multiple matching patterns, hence need to grep with 3 matching pattern
# 11  
Old 02-20-2020
Code:
grep -zoP "(?s)Start.*12345.*End" file


Last edited by rdrtx1; 02-20-2020 at 12:44 PM..
# 12  
Old 02-20-2020
Code:
awk '/Start/{p=$0; next} p{p=p ORS $0} /End/{if(p~/12345/)print p; p=x}' file

Or, using vertical real estate:
Code:
awk '
  /Start/ {
    buffer=$0
    next
  } 
  buffer {
    buffer=buffer ORS $0
  } 
  /End/ {
    if(buffer~/12345/) print buffer
    buffer=""
  }
' file

# 13  
Old 02-21-2020
The previous did not work for me.
Perhaps because of the mistake that a test buffer makes assumptions about the contents that can go wrong.
Better have a separate state variable (here: buffer_on)
Code:
awk '
  buffer_on {
    buffer=buffer ORS $0
  }
  ! buffer_on && /Start/ {
    buffer_on=1
    buffer=$0
  }
  buffer_on && /End/ {
    if (buffer~/12345/) print buffer
    buffer_on=0
  }
' file

--- Post updated at 12:07 ---

Introducing a separator variable:
one can include/exclude the Start and/or End pattern by simply changing the order of the 3 code blocks.
Code:
awk '
  buffer_on {
    buffer=buffer ors $0
    ors=ORS
  }
  ! buffer_on && /Start/ {
    buffer_on=1
    buffer=ors=""
  }
  buffer_on && /End/ {
    if (buffer~/12345/) print buffer
    buffer_on=0
  }
' file

Code:
awk '
  ! buffer_on && /Start/ {
    buffer_on=1
    buffer=ors=""
  }
  buffer_on {
    buffer=buffer ors $0
    ors=ORS
  }
  buffer_on && /End/ {
    if (buffer~/12345/) print buffer
    buffer_on=0
  }
' file

Code:
awk '
  ! buffer_on && /Start/ {
    buffer_on=1
    buffer=ors=""
  }
   buffer_on && /End/ {
    if (buffer~/12345/) print buffer
    buffer_on=0
  }
  buffer_on {
    buffer=buffer ors $0
    ors=ORS
  }
' file

Code:
awk '
   buffer_on && /End/ {
    if (buffer~/12345/) print buffer
    buffer_on=0
  }
  buffer_on {
    buffer=buffer ors $0
    ors=ORS
  }
  ! buffer_on && /Start/ {
    buffer_on=1
    buffer=ors=""
  }
' file

# 14  
Old 02-22-2020
Dear Scrutinizer and MadeInGermany ,

both solutions are working fine for thank you so much.





Thanks rbatte1,

I have tried your approach also, it works fine, but issue is that sed -n "$start_line,$end_line"p $filename is not working
calling variable inside sed is not working, sed -n "3,15"p $filename is working fine but sed -n "$start_line,$end_line"p is not, is there any alternate solution.
As per your suggestion i have tried this

Code:

# input of sample.gmf is like this, without quotes, "DOCSTART_2 |" for start
  #input of sample.gmf is like this, without quotes,"DOCEND |" for end 

# input for third matching pattern is "343"

 
# after grep -n the output is like this "1:DOCSTART_2 |" or "520:DOCEND |"


grep -n "DOCSTART_2" /home/testing/sagar/sample.GMF | awk -F ":" '{print $1}'  >  cat /home/testing/sagar/DOCSTART_2    ## start line numbers for entire file 

grep -n "DOCEND" /home/testing/sagar/sample.GMF | awk -F ":" '{print $1}'  >  cat /home/testing/sagar/DOCEND   ## end line numbers for entire file

input=`grep -n "12345" /home/testing/sagar/sample.GMF | awk -F ":" '{print $1}'`     # matching pattern (343)
  
> /home/clarity/sagar/less_DOCSTART_2
> /home/clarity/sagar/great_DOCEND

for file in `cat /home/testing/sagar/DOCSTART_2`
do
a=`echo $file`
if [ $a -lt $input ]
then
echo $a >> /home/clarity/sagar/less_DOCSTART_2
else
echo hi >> /dev/null
fi
done
 DOCSTART=`sort -n  /home/clarity/sagar/less_DOCSTART_2 | tail -1`  ## greatest start number


 for file1 in `cat /home/clarity/sagar/DOCEND`
do
b=`echo $file1 | awk -F ":" '{print $1}'`
if [ $b -gt $input ]
then
echo $a >> /home/clarity/sagar/great_DOCEND
else
echo hi >> /dev/null
fi
done
DOCEND=`sort -n  /home/clarity/sagar/great_DOCEND | head -1`   ## lowest end number
cat /home/testing/sagar/sample.GMF | sed -n "$DOCSTART,$DOCEND"'p  > /home/testing/sagar/sample.GMF_new   ##### not working

any suggestions or any changes in approach.
Thanks in advance.
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Grep only words containing specific string

Hello, I have two files. All urls are space seperated. source http://xx.yy.zz http://df.ss.sd.xz http://09.09.090.01 http://11.22.33 http://canada.xx.yy http://01.02.03.04 http://33.44.55 http://98.87.76.65 http://russia.xx.zz http://aa.tt.xx.zz http://1w.2e.3r.4t http://china.rr.tt ... (4 Replies)
Discussion started by: baris35
4 Replies

2. UNIX for Dummies Questions & Answers

Grep -v lines starting with pattern 1 and not matching pattern 2

Hi all! Thanks for taking the time to view this! I want to grep out all lines of a file that starts with pattern 1 but also does not match with the second pattern. Example: Drink a soda Eat a banana Eat multiple bananas Drink an apple juice Eat an apple Eat multiple apples I... (8 Replies)
Discussion started by: demmel
8 Replies

3. Shell Programming and Scripting

Sed: printing lines AFTER pattern matching EXCLUDING the line containing the pattern

'Hi I'm using the following code to extract the lines(and redirect them to a txt file) after the pattern match. But the output is inclusive of the line with pattern match. Which option is to be used to exclude the line containing the pattern? sed -n '/Conn.*User/,$p' > consumers.txt (11 Replies)
Discussion started by: essem
11 Replies

4. Shell Programming and Scripting

Find all matching words in text according to pattern

Hello dear Unix shell professionals, I am desperately trying to get a seemingly simple logic to work. I need to extract words from a text line and save them in an array. The text can look anything like that: aaaaaaa${important}xxxxxxxx${important2}ooooooo${importantstring3}...I am handicapped... (5 Replies)
Discussion started by: Grünspanix
5 Replies

5. UNIX for Dummies Questions & Answers

Extract lines with specific words with addition 2 lines before and after

Dear all, Greetings. I would like to ask for your help to extract lines with specific words in addition 2 lines before and after these lines by using awk or sed. For example, the input file is: 1 ak1 abc1.0 1 ak2 abc1.0 1 ak3 abc1.0 1 ak4 abc1.0 1 ak5 abc1.1 1 ak6 abc1.1 1 ak7... (7 Replies)
Discussion started by: Amanda Low
7 Replies

6. Shell Programming and Scripting

Finding lines matching the Pattern and their previous lines in a file

Hi, I am trying to locate the occurences of certain pattern like 'Possible network disconnect' in a text file. I can get the actual lines matching the pttern using: grep -w 'Possible network disconnect' file_name. But I am more interested in getting the timing of these events which are... (7 Replies)
Discussion started by: sagarparadkar
7 Replies

7. Shell Programming and Scripting

Keep lines with specific words up in an order

I hava a file with following data: number|CREDIT_ID|NULL date|SYS_CREATION_DATE|NULL varchar2|GGS_COMMIT_CHAR|NULL varchar2|GGS_OP_TYPE|NULL number|GGS_SCN|NULL| number|GGS_LOG_SEQ|NULL number|GGS_LOG_POS|NULL number|GGS_ORACREC_SCN|NULL varchar2|BATCH_ID|NULL char|GGS_IMAGE_TYPE|NULL ... (6 Replies)
Discussion started by: kolesunil
6 Replies

8. Shell Programming and Scripting

counting the lines matching a pattern, in between two pattern, and generate a tab

Hi all, I'm looking for some help. I have a file (very long) that is organized like below: >Cluster 0 0 283nt, >01_FRYJ6ZM12HMXZS... at +/99% 1 279nt, >01_FRYJ6ZM12HN12A... at +/99% 2 281nt, >01_FRYJ6ZM12HM4TS... at +/99% 3 283nt, >01_FRYJ6ZM12HM946... at +/99% 4 279nt,... (4 Replies)
Discussion started by: d.chauliac
4 Replies

9. Shell Programming and Scripting

How to from grep command from a file which contains matching words?

Hi all I have a file with below content (content is variable whenever new product is launched). I need form a grep command like this egrep "Unknown product|Invalid symboland so on" How to do it using a script? Unknown product Invalid symbol No ILX exch found exceeds maximum size AFX... (4 Replies)
Discussion started by: johnl
4 Replies

10. Programming

getting file words as pattern matching

Sir, I want to check for the repation of a user address in a file i used || as my delimiter and want to check repetaip0n of the address that is mailid and then i have to use IMAP and all. How can i do this... I am in linux ...and my file is linux file. ... (5 Replies)
Discussion started by: arunkumar_mca
5 Replies
Login or Register to Ask a Question