Perl Pattern Match

11-18-2009

Registered User

45, 0

Join Date: Sep 2009

Last Activity: 21 November 2013, 11:12 PM EST

Posts: 45

Thanks Given: 0

Thanked 0 Times in 0 Posts

Perl Pattern Match

Hi Friends, I have a tuff time with regular expressionss. Please let me know how to make this happen as it consumed lots of my time but in vain. Here is the sample text file i need to match for. I need to search for pattern1 removed, if it matches then search for pattern types either SE\ or dcfm-derby-dataload.sql not both. Similarly i need to search for pattern2 added and if it matches then search for either dcm-postgres-schema.sql or migrate\. Here i need to print files(dcm-postgres-schema.sql) and directories(migrate\) separately if it matches for removed and if it matches for added separately. Please help me.

---------[ removed ]---------------|----------------------------------
SE\ 2008-11-01 vbhupati |-
---------[ removed ]---------------|----------------------------------
dcfm-derby-dataload.sql vo+|-
---------[ removed ]---------------|----------------------------------
dcfm-derby-schema.sql voba+|-
---------[ removed ]---------------|----------------------------------
dcfm-nms-sybase-dataload.sql 27T+|-
-----------------------------------|---------------[ added ]----------
-| dcm-inm-postgres-schema.sql T0+
-----------------------------------|---------------[ added ]-----------
-| dcm-postgres-dataload.sql 23T08:+
----------------------------------|---------------[ added ]-----------
-| dcm-postgres-schema.sql -T01:58+
-----------------------------------|---------------[ added ]-----------
-| migrate\ --10-13T06:31 ycho

Output must be like, i should be able to print both the lines:
---------[ removed ]---------------|----------------------------------
SE\ 2008-11-01 vbhupati |-
---------[ removed ]---------------|----------------------------------
dcfm-derby-dataload.sql vo+|-
.
.
.
-----------------------------------|---------------[ added ]----------
-| dcm-inm-postgres-schema.sql T0+
-----------------------------------|---------------[ added ]-----------
-| dcm-postgres-dataload.sql 23T08:+
.
.
.
Please help me. Thanks, nmattam

nmattam

View Public Profile for nmattam

Find all posts by nmattam

11-18-2009

Registered User

2,288, 480

Join Date: Apr 2007

Last Activity: 3 May 2020, 8:28 AM EDT

Location: Saint Paul, MN USA / BSD, CentOS, Debian, OS X, Solaris

Posts: 2,288

Thanks Given: 430

Thanked 480 Times in 395 Posts

Hi.

If you have access to command glark, you can do this from the command line. Here is a short example of glark on your data:

Code:

#!/usr/bin/env bash

# @(#) s1	Demonstrate complex matching using "glark".

echo
set +o nounset
LC_ALL=C ; LANG=C ; export LC_ALL LANG
echo "Environment: LC_ALL = $LC_ALL, LANG = $LANG"
echo "(Versions displayed with local utility \"version\")"
version >/dev/null 2>&1 && version "=o" $(_eat $0 $1) glark
set -o nounset
echo

FILE=${1-data1}

echo " Data file $FILE:"
cat $FILE

echo
echo " Results with \"removed\":"
glark -U -a 1 "removed" -o 'SE\\' "dcfm-derby-dataload.sql" $FILE

echo
echo " Results with \"added\":"
glark -U -a 1 "added" -o 'migrate\\' "dcm-postgres-schema.sql" $FILE

exit 0

producing:

Code:

% ./s1

Environment: LC_ALL = C, LANG = C
(Versions displayed with local utility "version")
OS, ker|rel, machine: Linux, 2.6.26-2-amd64, x86_64
Distribution        : Debian GNU/Linux 5.0 
GNU bash 3.2.39
glark, version 1.8.0

 Data file data1:
---------[ removed ]---------------|----------------------------------
SE\ 2008-11-01 vbhupati |-
---------[ removed ]---------------|----------------------------------
dcfm-derby-dataload.sql vo+|-
---------[ removed ]---------------|----------------------------------
dcfm-derby-schema.sql voba+|-
---------[ removed ]---------------|----------------------------------
dcfm-nms-sybase-dataload.sql 27T+|-
-----------------------------------|---------------[ added ]----------
-| dcm-inm-postgres-schema.sql T0+
-----------------------------------|---------------[ added ]-----------
-| dcm-postgres-dataload.sql 23T08:+
----------------------------------|---------------[ added ]-----------
-| dcm-postgres-schema.sql -T01:58+
-----------------------------------|---------------[ added ]-----------
-| migrate\ --10-13T06:31 ycho

 Results with "removed":
    1 ---------[ removed ]---------------|----------------------------------
    2 SE\ 2008-11-01 vbhupati |-
    3 ---------[ removed ]---------------|----------------------------------
    4 dcfm-derby-dataload.sql vo+|-
    5 ---------[ removed ]---------------|----------------------------------

 Results with "added":
   13 ----------------------------------|---------------[ added ]-----------
   14 -| dcm-postgres-schema.sql -T01:58+
   15 -----------------------------------|---------------[ added ]-----------
   16 -| migrate\ --10-13T06:31 ycho

Briefly, this says that matches must occur within one line of each other. The -a means "and", the -o means "or". Line 5 is printed because it is within one line of matched line number 4.

The two cases are separated here to avoid (some) confusion, but I think a master pattern could be created to handle it all in one pass over the data file. It would, however, not be easy to read. The glark code is written in ruby.

I installed it from the Debian (5, "lenny") repository. More information can be found at glark and glark | freshmeat.net

I don't see this as complicated from the viewpoint of regular expressions, but it is complex from the viewpoint of structuring decisions.

If you don't have access to glark, then you will probably need to use perl or awk. There are many experts on both here, so someone may be along shortly to help with that ... cheers, drl

-----

Busy server seemed to post this response twice ... cheers, drl

Last edited by drl; 11-18-2009 at 11:51 AM..

drl

View Public Profile for drl

Find all posts by drl

11-19-2009

Registered User

45, 0

Join Date: Sep 2009

Last Activity: 21 November 2013, 11:12 PM EST

Posts: 45

Thanks Given: 0

Thanked 0 Times in 0 Posts

Perl Pattern Match

Dear drl,
I appreciate your help, but i do not have access to glark. I appreciate if any one can help me with another possible solution.
Thanks, nmattam

nmattam

View Public Profile for nmattam

Find all posts by nmattam

Shell Programming and Scripting

Perl Pattern Match

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Perl removing line match with pattern in column

Discussion started by: justbow

2. Shell Programming and Scripting

Perl removing line match with pattern in column

Discussion started by: justbow

3. Shell Programming and Scripting

PERL - Use of uninitialized value in pattern match (m//)

Discussion started by: chris01010

4. Shell Programming and Scripting

Perl match pattern

Discussion started by: arrals_vl

5. Shell Programming and Scripting

How to replace with pattern match using Perl

Discussion started by: sol_nov

6. Shell Programming and Scripting

perl pattern match on xml

Discussion started by: satnamx

7. Shell Programming and Scripting

Perl Array / pattern match large CPU usage

Discussion started by: Donkey25

8. Shell Programming and Scripting

pattern match url in string / PERL

Discussion started by: mrealty

9. Shell Programming and Scripting

Perl: Printing Multiple Lines after pattern match

Discussion started by: Deep9000

10. Shell Programming and Scripting

Perl script to match a pattern and print lines

Discussion started by: ammu