Regular expression to match multiple lines?


 
Thread Tools Search this Thread
Top Forums Shell Programming and Scripting Regular expression to match multiple lines?
# 1  
Old 01-18-2014
Question Regular expression to match multiple lines?

Using a regular expression, I would like multiple lines to be matched.

By default, a period (.) matches any character except newline. However, (?s) and /s modifiers are supposed to force . to accept a newline and to match any character including a newline.

However, the following two perl statements that use (?s) and /s failed to find a pattern spanning multiple lines.

Code:
perl -p -e 's/a(?s).*f/z/' srcfile > dstfile

and
Code:
perl -p -e 's/a.*f/z/s' srcfile > dstfile

where the content of srcfile is
Code:
abc
def
ghi
jkl

which can be created by

Code:
cat > srcfile << EOF
abc
def
ghi
jkl
EOF


I wanted the regular expression to match the string consisting of two lines that starts with "a" and ends with "f".

In other words, I wanted to replace
Code:
abc
def
with z.

So, I wanted dstfile to become
Code:
z
ghi
jkl

However, the above perl statements failed. The regular expressions in the above perl statements matched nothing, making dstfile identical to srcfile.

What went wrong?

What regular expression would match multiple lines?

How can a perl or bash command line find a pattern spanning multiple lines in srcfile, replace it with another, and save the modified text into dstfile?

Many thanks, in advance.

Last edited by Scott; 01-18-2014 at 10:35 AM.. Reason: More code tags
# 2  
Old 01-18-2014
perl -pe operates on a line-by-line basis, so it will not match a multiline pattern
You could try something like (within a paragraph):
Code:
perl -00 -pe 's/a.*f/z/s' file

or (within a file)
Code:
perl -0777 -pe 's/a.*f/z/s' file


Last edited by Scrutinizer; 01-18-2014 at 11:08 AM..
This User Gave Thanks to Scrutinizer For This Post:
# 3  
Old 01-18-2014
Hi.

Similarly, given file data1:
Code:
abc
def
ghi
jkl

and perl code p1:
Code:
#!/usr/bin/env perl

# @(#) p1	Demonstrate slurp and single-string match.

use strict;
use warnings;

my $a = slurp();
$a =~ s/abc.*f/z/s;
print $a;

exit(0);

# Best practices, p213 for a file.
sub slurp {
  my $scalar = do { local $/; <> };
  return $scalar;
}

then:
Code:
$ ./p1 data1
z
ghi
jkl

See perldoc perlre for details.

Best wishes ... cheers, drl
This User Gave Thanks to drl For This Post:
# 4  
Old 01-18-2014
Question

Thank you for your replies, Scrutinizer and drl.


drl wrote:

Quote:
Originally Posted by drl
# @(#) p1 Demonstrate slurp and single-string match.
# Best practices, p213 for a file.

I do not understand what are meant by "@(#)", "p1" and "p213".

Does "@(#)" have anything to do with an array variable?

Do p1 and p213 mean page 1 and page 213 of a book or pdf document?

My copy of perlre does not have any page numbers printed.

Many thanks, in advance.
# 5  
Old 01-19-2014
Hi.
Quote:
Originally Posted by LessNux
... I do not understand what are meant by "@(#)", "p1" and "p213" ...
The shell, perl, awk, etc. all ignore anything after an unquoted "#". The string "@(#)" is a special key so that a one-line description of the script can be extracted. For example using script p1 as input: $ what ./p1 wil produce this on standatrd output:
Code:
p1	Demonstrate slurp and single-string match.

This is an old convention, but we have found it useful to generate local indices of scripts. We have written a script to do this as well as create the indices for our shop. You might find an heirloom man page for command what. See an example of the string at bash - shell script templates - Stack Overflow

The string "p1" is the name of the file in which the perl script resides.

The string "p213" refers to the page number in the book Amazon.com: Perl Best Practices eBook: Damian Conway: Books

Best wishes ... cheers, drl

Last edited by drl; 01-19-2014 at 12:47 AM..
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Regular expression match

echo 20110101 | awk '{ print match($0,/^((17||18||19||20)|)-*(|0|1)-*(|0||3)$/)) I am getting a match for the above, where as it shouldn't, as there is no hyphen in the echoed date. Another question is what is the difference between || and | in the above statement (4 Replies)
Discussion started by: tostay2003
4 Replies

2. Homework & Coursework Questions

Regular Expression to match files in Perl

Hi Everybody! I need some help with a regular expression in Perl that will match files named messages, but also files named message.1, message.2 and so on. So really I need one that will find messages and messages that might be followed by a period and a digit without matching other files like... (2 Replies)
Discussion started by: Hax0rc1ph3r
2 Replies

3. Shell Programming and Scripting

regular expression grouping across multiple lines

cat book.txt book1 price 23 sku 1234 auth Bill book2 sku 1233 price 22 auth John book3 auth Frank price 24 book4 price 25 sku 129 auth Tod import re f = open('book.txt', 'r') text = f.read() f.close() m =... (2 Replies)
Discussion started by: chirish
2 Replies

4. Shell Programming and Scripting

regular expression exact match

hi everyone suppose we have two scenario echo ABCD | grep \{4\} DATE echo SYSDATE | grep \{4\} SYSDATE i want to match the string of four length only please help (5 Replies)
Discussion started by: aishsimplesweet
5 Replies

5. Shell Programming and Scripting

regular expression match

I am trying to match a similar line using grep with regular expression the line is /remote/mac/pbbbb/abc/def/hij/hop/include/abc/tif/element/test/testfiles/Office.cpp:57: const OfficeType& getType().get() const; I just need to extract the bold characters using grep with regular expression.... (5 Replies)
Discussion started by: prasbala
5 Replies

6. Shell Programming and Scripting

regular expression to match repeated appearance

Hi all, I am looking for a regex syntax to match repeated appearance. Likes, ']+]+' matches for string '65A SOME MORE AND 78B' Now, this gets messy if I need to extract all such repeated appearance. I don't want to write ] four or five times for matching repeated appearance. Thanks in... (2 Replies)
Discussion started by: guruparan18
2 Replies

7. Shell Programming and Scripting

Regular Expression to match repeated characters

Hello All I have file which contain sample data like below - test.txt ---------------------------------------------- jambesh aaa india trxxx sdasd mentor asss light train bbblah --------------------------------------------- I want to write a regX which would print only those... (4 Replies)
Discussion started by: jambesh
4 Replies

8. Shell Programming and Scripting

Regular expression match

Hi all, any idea how to match the following: char*<no or any string or space> buf and char *<no or any string or space> buf i need to capture the buf characters too. currently i need two checks to cover this: #search char* <any string> buf or char *<any string> buf @noarray =... (2 Replies)
Discussion started by: ChaMeN
2 Replies

9. UNIX for Dummies Questions & Answers

Regular Expression - match 'b' that follows 'a' and is at the end of a string

Hi, I'm struggling with a regex that would match a 'b' that follows an 'a' and is at the end of a string of non-white characters. For example: Line 1: aba abab b abb aab bab baa I can find the right strings but I'm lacking knowledge of how to "discard" the bits that precede bs.... (2 Replies)
Discussion started by: machinogodzilla
2 Replies

10. UNIX for Dummies Questions & Answers

Exact match with regular expression

Hi I have a file with data arranged into columns. The first column is the chromosome name. When I use grep to subset only rows with chr1, I get chr1 but also chr10, chr11,.. How do I get only rows with chr1? grep chr1 filein > fileout head fileout chr1 59757841 chr11 108258691 ... (2 Replies)
Discussion started by: jdhahbi
2 Replies
Login or Register to Ask a Question