Grep couple of consecutive lines if each lines contains certain string

05-30-2012

Registered User

81, 5

Join Date: Oct 2011

Last Activity: 16 June 2016, 4:29 AM EDT

Location: Bucharest

Posts: 81

Thanks Given: 22

Thanked 5 Times in 5 Posts

Grep couple of consecutive lines if each lines contains certain string

Hello,

I want to extract from a file like :

Code:

20120530025502914 | REQUEST | whatever
20120530025502968 | RESPONSE | whatever
20120530025502985 | RESPONSE | whatever
20120530025502996 | REQUEST | whatever
20120530025503013 | REQUEST | whatever
20120530025503045 | RESPONSE | whatever

I want to extract all groups of 2 lines in which the first line contanis 'REQUEST' and the next line below contains 'RESPONSE'

Basically from the above file I would like to have extracted the following :

Code:

20120530025502914 | REQUEST | whatever
20120530025502968 | RESPONSE | whatever
20120530025503013 | REQUEST | whatever
20120530025503045 | RESPONSE | whatever

(Please note those numbers from the first fields to be able to identify which line I've extracted from the initial file - basically the lines in red from the initial file ).
I'm not completely emty-handed, I have this snippet as a starting point ('stolen' a while ago from internet :P ) :

Code:

nawk 'c-->0;$0~s{if(b)for(c=b+1;c>1;c--)print r[(NR-c+1)%b];print;c=a}b{r[NR%b]=$0}' b=0 a=1 s="string" file

Which takes the line I that contains the "string" (in my case "REQUEST" ) and the next line after, but i don;t know where to put the condition that the line after to contain the string "RESPONSE" and if so to extract the respective group of 2 lines.

Last edited by Franklin52; 05-30-2012 at 08:03 AM.. Reason: Please use code tags for data and code samples

black_fender

View Public Profile for black_fender

Find all posts by black_fender

05-30-2012

Registered User

7,747, 559

Join Date: Feb 2007

Last Activity: 20 April 2020, 11:28 AM EDT

Location: The Netherlands

Posts: 7,747

Thanks Given: 139

Thanked 559 Times in 520 Posts

Try this:

Code:

awk -F"|" '$2 ~ "REQUEST" {s=$0;f=1;next} f && $2 ~ "RESPONSE" {print s RS $0;f=0}' file

These 2 Users Gave Thanks to Franklin52 For This Post:

Franklin52

View Public Profile for Franklin52

Find all posts by Franklin52

05-30-2012

Registered User

81, 5

Join Date: Oct 2011

Last Activity: 16 June 2016, 4:29 AM EDT

Location: Bucharest

Posts: 81

Thanks Given: 22

Thanked 5 Times in 5 Posts

Quote:

Originally Posted by Franklin52

Try this:

Code:

awk -F"|" '$2 ~ "REQUEST" {s=$0;f=1;next} f && $2 ~ "RESPONSE" {print s RS $0;f=0}' file

Thanks for the response, but I think there must be a syntax error, because I get this :

Code:

 echo kkk | awk -F"|" '$2 ~ "REQUEST" {s=$0;f=1;next} f && $2 ~ "RESPONSE" {print s RS $0;f=0}'
awk: syntax error near line 1
awk: bailing out near line 1

black_fender

View Public Profile for black_fender

Find all posts by black_fender

05-30-2012

Registered User

7,747, 559

Join Date: Feb 2007

Last Activity: 20 April 2020, 11:28 AM EDT

Location: The Netherlands

Posts: 7,747

Thanks Given: 139

Thanked 559 Times in 520 Posts

On Solaris use nawk or /usr/xpg4/bin/awk rather than awk

Franklin52

View Public Profile for Franklin52

Find all posts by Franklin52

05-30-2012

Registered User

1,801, 116

Join Date: Oct 2003

Last Activity: 15 May 2015, 11:55 AM EDT

Location: 54.23, -4.53

Posts: 1,801

Thanks Given: 1

Thanked 116 Times in 101 Posts

Quote:

Originally Posted by black_fender

I'm not completely emty-handed, I have this snippet as a starting point ('stolen' a while ago from internet :P ) :

Code:

nawk 'c-->0;$0~s{if(b)for(c=b+1;c>1;c--)print r[(NR-c+1)%b];print;c=a}b{r[NR%b]=$0}' b=0 a=1 s="string" file

I recognise my own code from this post: https://www.unix.com/302098992-post2.html

Since I wrote that in 2006, I notice it has propogated over the internet in other forums and blogs, and now seems to have taken a life of its own.

It's not applicable in this case.

Ygor

View Public Profile for Ygor

Find all posts by Ygor

05-30-2012

Registered User

2,288, 480

Join Date: Apr 2007

Last Activity: 3 May 2020, 8:28 AM EDT

Location: Saint Paul, MN USA / BSD, CentOS, Debian, OS X, Solaris

Posts: 2,288

Thanks Given: 430

Thanked 480 Times in 395 Posts

Hi.

I often use cgrep for complex matching and manipulation. It extends some of the features of GNU/grep and is comparable in speed. The heart of the following script is the cgrep. The surrounding code displays the environment under which it was run, as well as comparing results:

Code:

#!/usr/bin/env bash

# @(#) s1	Demonstrate matching on successive lines, cgrep.
# See: http://sourceforge.net/projects/cgrep/

# Section 1, setup, pre-solution, $Revision: 1.25 $".
# Infrastructure details, environment, debug commands for forum posts. 
# Uncomment export command to run script as external user.
# export PATH="/usr/local/bin:/usr/bin:/bin" HOME=""
set +o nounset
pe() { for _i;do printf "%s" "$_i";done; printf "\n"; }
pl() { pe;pe "-----" ;pe "$*"; }
edges() { local _f _n _l;: ${1?"edges: need file"}; _f=$1;_l=$(wc -l $_f);
  head -${_n:=3} $_f ; pe "--- ( $_l: lines total )" ; tail -$_n $_f ; }
db() { : ; }
db() { ( printf " db, ";for _i;do printf "%s" "$_i";done;printf "\n" ) >&2 ; }
C=$HOME/bin/context && [ -f $C ] && $C cgrep

set -o nounset
pe

FILE=${1-data1}

# Display sample of data file, with edges or head & tail as a last resort.
db " Section 1: display of input data and expected output."
pe " || start sample [ specimen first:middle:last ] $FILE"
specimen $FILE expected-output.txt 2>/dev/null \
|| { pe "(head/tail)"; head -n 5 $FILE; pe " ||"; tail -n 5 $FILE; }
pe " || end"

# Section 2, solution.
pl " Results:"
db " Section 2: solution."
cgrep -a 'REQUEST.*\n.*RESPONSE' $FILE |
tee f1

# Section 3, post-solution, check results, clean-up, etc.
v1=$(wc -l <expected-output.txt)
v2=$(wc -l < f1)
pl " Comparison of $v2 created lines with $v1 lines of desired results:"
db " Section 3: validate generated calculations with desired results."

pl " Comparison with desired results:"
if [ ! -f expected-output.txt -o ! -s expected-output.txt ]
then
  pe " Comparison file \"expected-output.txt\" zero-length or missing."
  exit
fi
if cmp expected-output.txt f1
then
  pe " Succeeded -- files have same content."
else
  pe " Failed -- files not identical -- detailed comparison follows."
  if diff -b expected-output.txt f1
  then
    pe " Succeeded by ignoring whitespace differences."
  fi
fi

exit 0

producing:

Code:

% ./s1

Environment: LC_ALL = C, LANG = C
(Versions displayed with local utility "version")
OS, ker|rel, machine: Linux, 2.6.26-2-amd64, x86_64
Distribution        : Debian GNU/Linux 5.0.8 (lenny) 
bash GNU bash 3.2.39
cgrep ATT cgrep 8.15

 db,  Section 1: display of input data and expected output.
 || start sample [ specimen first:middle:last ] data1
Whole: 5:0:5 of 6 lines in file "data1"
20120530025502914 | REQUEST | whatever
20120530025502968 | RESPONSE | whatever
20120530025502985 | RESPONSE | whatever
20120530025502996 | REQUEST | whatever
20120530025503013 | REQUEST | whatever
20120530025503045 | RESPONSE | whatever

Whole: 5:0:5 of 4 lines in file "expected-output.txt"
20120530025502914 | REQUEST | whatever
20120530025502968 | RESPONSE | whatever
20120530025503013 | REQUEST | whatever
20120530025503045 | RESPONSE | whatever
 || end

-----
 Results:
 db,  Section 2: solution.
20120530025502914 | REQUEST | whatever
20120530025502968 | RESPONSE | whatever
20120530025503013 | REQUEST | whatever
20120530025503045 | RESPONSE | whatever

-----
 Comparison of 4 created lines with 4 lines of desired results:
 db,  Section 3: validate generated calculations with desired results.

-----
 Comparison with desired results:
 Succeeded -- files have same content.

I like awk for its flexilbility (and especially in readability compared to sed for compilcated jobs), but I don't like one-off (nonce) scripts, as well as the fact that my measurements indicate that awk uses about 5 times as much CPU and 5 times as much wall clock time as most members of the grep family for the similar tasks (however, cgrep does use more system time, about twice as much).

See the sourceforge link for the compilable source if it is not in an available repository.

Best wishes ... cheers, drl

drl

View Public Profile for drl

Find all posts by drl

05-30-2012

Moderator

12,296, 3,792

Join Date: Nov 2008

Last Activity: 1 January 2021, 1:47 AM EST

Location: Amsterdam

Posts: 12,296

Thanks Given: 679

Thanked 3,792 Times in 3,282 Posts

@drl, grep cannot do this and I do not think cgrep is present on Solaris, is it? cgrep looks nice though and it is fast indeed. I presume cgrep was tested against gawk, which is one of the slowest awks. Perhaps you could compare it to the fastest awk, which is mawk..

Last edited by Scrutinizer; 05-30-2012 at 11:26 AM..

Scrutinizer

View Public Profile for Scrutinizer

Find all posts by Scrutinizer

Shell Programming and Scripting

Grep couple of consecutive lines if each lines contains certain string

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Remove duplicate consecutive lines with specific string

Discussion started by: Mannu2525

2. Shell Programming and Scripting

Grep three consecutive lines if each lines contains certain string

Discussion started by: Saumitra Pandey

3. Shell Programming and Scripting

Grep a string and count following lines starting with another string

Discussion started by: Syeda Sumayya

4. Shell Programming and Scripting

Grep 2 consecutive lines and replace the second line in a file

Discussion started by: Dhoni

5. Shell Programming and Scripting

Grep a string from input file and delete next three lines including the line contains string in xml

Discussion started by: greet_sed

6. Shell Programming and Scripting

Merge two non-consecutive lines based on line number or string

Discussion started by: munkee

7. Shell Programming and Scripting

Print lines between two lines after grep for a text string

Discussion started by: jbruce

8. Shell Programming and Scripting

grep string & a few lines after

Discussion started by: ashterix

9. Shell Programming and Scripting

Grep string but also it will show the next 5 lines

Discussion started by: thepurple

10. Shell Programming and Scripting

grep string & next n lines

Discussion started by: ashterix