Searching backwards using regular expressions Post: 302389010

Sponsored Content

Top Forums Shell Programming and Scripting Searching backwards using regular expressions Post 302389010 by drl on Friday 22nd of January 2010 06:48:36 AM

01-22-2010

Registered User

Hi.

I like utility cgrep for situations like this. It allows one to specify conveniently a regular expression for the previous and succeeding boundaries -- "windows":

Code:

#!/usr/bin/env bash

# @(#) s1	Demonstrate previous boundary match with cgrep.
# http://www.bell-labs.com/project/wwexptools/cgrep/

echo
set +o nounset
LC_ALL=C ; LANG=C ; export LC_ALL LANG
echo "Environment: LC_ALL = $LC_ALL, LANG = $LANG"
echo "(Versions displayed with local utility \"version\")"
version >/dev/null 2>&1 && version "=o" $(_eat $0 $1)
set -o nounset
echo

FILE=${1-data1}

echo " Data file $FILE:"
cat $FILE

echo
echo " Results:"
cgrep -D -w "else if" "find me" $FILE

exit 0

producong:

Code:

% ./s1

Environment: LC_ALL = C, LANG = C
(Versions displayed with local utility "version")
OS, ker|rel, machine: Linux, 2.6.26-2-amd64, x86_64
Distribution        : Debian GNU/Linux 5.0 
GNU bash 3.2.39

 Data file data1:
if(condition)
	multiline
	statement
else if(condition)
	multiline
	statement
else if(condition)
	multiline
	statement
else if(condition)
	multiline
	statement
else if(condition)
	multiline
	statement
	find me
else if(condition)
	multiline
	statement

 Results:
else if(condition)
	multiline
	statement
	find me

You will need to obtain, compile, and make available cgrep. See the URL mentioned in the script for that.

Best wishes ... cheers, drl

drl

View Public Profile for drl

Find all posts by drl

10 More Discussions You Might Find Interesting

1. Programming

regular expressions in c++

How do I use the regular expressions in c++?

2. Shell Programming and Scripting

Regular Expressions

How can i create a regular expression which can detect a new line charcter followed by a special character say * and replace these both by a string of zero length? Eg: Input File san.txt hello hi ...

3. Shell Programming and Scripting

4. UNIX for Dummies Questions & Answers

regular expressions

how to find for a file whose name has all characters in uppercase after 'project'? I tried this: find . -name 'project**.pdf' ./projectABC.pdf ./projectABC123.pdf I want only ./projectABC.pdf What is the regular expression that correponds to "all characters are capital"? thanks

5. UNIX for Advanced & Expert Users

Regular Expressions

Hi, below is a piece of code written by my predecessor at work. I'm kind of a newbie and am trying to figure out all the regular expressions in this piece of code. It is really a tough time for me to figure out all the regular expressions. Please shed some light on the regular expressions...

6. Shell Programming and Scripting

Need help with Regular Expressions

Hi, In ksh, I am trying to compare folder names having -141- in it's name. e.g.: 4567-141-8098 should match this expression '*-141-*' but, -141-2354 should fail when compared with '*-141-*' simlarly, abc should fail when compared with '*-141-*' I tried multiple things but nevertheless,...

7. UNIX for Advanced & Expert Users

regular expressions

I have a flat file with the following drug names Nutropin AQ 20mg PEN Cart 2ml Norditropin Cart 15mg/1.5ml I have to extract digits that are before mg i.e 20 and 15 ; how to do this using regular expressions Thanks ram

8. Shell Programming and Scripting

searching regular expressions with special characters like dot using grep

hi everybody I am a new user to this forum and its previous posts have been very useful. I'm searching in a file using grep for patterns like 12.13.444 55.44.443 i.e. of form <digit><digit>.<digit><digit>.<digit><digit><digit> Can anybody help me with this. Thanks in advance

9. Shell Programming and Scripting

Help with regular expressions

I have a file that I'm trying to find all the cases of phone number extensions and deleting them. So input file looks like: abc x93825 def 13234 x52673 hello output looks like: abc def 13234 hello Basically delete lines that have 5 numbers following "x". I tried: x\(4) but it...

10. Shell Programming and Scripting

Regular expressions

I need to pick a part of string lets stay started with specific character and end with specific character to replace using sed command the line is like this:my audio book 71-skhdfon1dufgjhgf8.wav' I want to move the characters beginning with - end before. I have different files with random...

LEARN ABOUT PLAN9

regexp

REGEXP(6)							   Games Manual 							 REGEXP(6)

NAME

       regexp - regular expression notation

DESCRIPTION

       A  regular  expression  specifies  a  set  of  strings of characters.  A member of this set of strings is said to be matched by the regular
       expression.  In many applications a delimiter character, commonly bounds a regular expression.  In the following specification for  regular
       expressions the word `character' means any character (rune) but newline.

       The syntax for a regular expression e0 is

	      e3:  literal | charclass | '.' | '^' | '$' | '(' e0 ')'

	      e2:  e3
		|  e2 REP

	      REP: '*' | '+' | '?'

	      e1:  e2
		|  e1 e2

	      e0:  e1
		|  e0 '|' e1

       A literal is any non-metacharacter, or a metacharacter (one of .*+?[]()|^$), or the delimiter preceded by

       A  charclass  is  a  nonempty string s bracketed [s] (or [^s]); it matches any character in (or not in) s.  A negated character class never
       matches newline.  A substring a-b, with a and b in ascending order, stands for the inclusive range of characters between a and  b.   In	s,
       the  metacharacters  an initial and the regular expression delimiter must be preceded by a other metacharacters have no special meaning and
       may appear unescaped.

       A matches any character.

       A matches the beginning of a line; matches the end of the line.

       The REP operators match zero or more (*), one or more (+), zero or one (?), instances respectively of the preceding regular expression e2.

       A concatenated regular expression, e1e2, matches a match to e1 followed by a match to e2.

       An alternative regular expression, e0|e1, matches either a match to e0 or a match to e1.

       A match to any part of a regular expression extends as far as possible without preventing a match to the remainder of the  regular  expres-
       sion.

SEE ALSO

       awk(1), ed(1), sam(1), sed(1), regexp(2)

																	 REGEXP(6)

10 More Discussions You Might Find Interesting

1. Programming

regular expressions in c++

Discussion started by: szzz

2. Shell Programming and Scripting

Regular Expressions

Discussion started by: sandeep_hi

3. Shell Programming and Scripting

Help with regular expressions

Discussion started by: arushunter

4. UNIX for Dummies Questions & Answers

regular expressions

Discussion started by: melanie_pfefer

5. UNIX for Advanced & Expert Users

Regular Expressions

Discussion started by: ramky79

6. Shell Programming and Scripting

Need help with Regular Expressions

Discussion started by: jidsh

7. UNIX for Advanced & Expert Users

regular expressions

Discussion started by: ramky79

8. Shell Programming and Scripting

searching regular expressions with special characters like dot using grep

Discussion started by: jpriyank

9. Shell Programming and Scripting

Help with regular expressions

Discussion started by: pxalpine

10. Shell Programming and Scripting

Regular expressions

Discussion started by: XP_2600

LEARN ABOUT PLAN9

regexp