Perl -- Script to re-format data Post: 302903387

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Shell script to format a .CSV data

Hi There I needed to write a Unix shell script which will pick up the data from a .CSV file and reformat it as per the requirement and write it to another .CSV file. Currently I am in the proess of Data Import to "Remedy System" (A one kind of incident mangement Application) and this...

2. UNIX for Advanced & Expert Users

shell script to format .CSV data

Hi all, I have written a shell script to search a specified directory (e.g. /home/user) for a list of specific words (shown as ${TMPDIR}/wordlist below). The script works well enough, but I was wondering if there was a way to display the line number that the word is found on? Thanks! cat...

3. Shell Programming and Scripting

Need script to format data specifically

Hi all, I need a script specially using loops to print below output... variables x, a and b will be read from user... x a b x+1 a b+1 x+2 a b+2 x+3 a+1 b x+4 a+1 b+1 x+5 a+1 b+2 for example.... 1 ...

4. Shell Programming and Scripting

Perl Script for reading table format data from file.

Hi, i need a perl script which reads the file, content is given below. and output in new file. TARGET DRIVE IO1 IO2 IO3 IO4 IO5 ------------ --------- --------- --------- --------- --------- 0a.1.8 266 236 ...

5. Shell Programming and Scripting

Help with perl script to output data in table format...

Hello, I need help with a perl script that will process a text file and match virtual server name to profile(s). the rest will be ignored. Virtual server name follows the word "virtual" in the begging of the line. There could be multiple profiles assigned to one virtual server. For example, ...

6. Shell Programming and Scripting

awk - script help: column to row format of data allignment?

Experts Good day, I have the following data, file1 BRAAGRP1 A2X B2X C2X D2X BRBGRP12 A3X B3X Z10 D09 BRC1GRP2 LO01

7. UNIX for Dummies Questions & Answers

How to retrive data from DB(Aqua studio) in CVS format using UNIX script?

I am using aqua studio DB. I need to retrive the data from my database using uxin script in .csv format. i am using select query along with the joins. my o/p in the DB is of the below format. Cycle IDCycle StatusRecord 98N-0000ACV23-3636FCliet Level (Af)Success1689393HF-J7879-09090RCliet Level...

8. Shell Programming and Scripting

Script to generate Excel file or to SQL output data to Excel format/tabular format

Hi , i am generating some data by firing sql query with connecting to the database by my solaris box. The below one should be the header line of my excel ,here its coming in separate row. TO_CHAR(C. CURR_EMP_NO ---------- --------------- LST_NM...

9. Shell Programming and Scripting

A script to format a file (ideally PERL)

Hi forum members. It has been several years since my last post. Currently I am using fairly large datasets on a day to day basis for handling immigration cases at a law firm. Our Input file is filled out by our secretary staff. The first column is the case ID-sample ID then the second column is...

10. Shell Programming and Scripting

In PErl script: need to read the data one file and generate multiple files based on the data

We have the data looks like below in a log file. I want to generat files based on the string between two hash(#) symbol like below Source: #ext1#test1.tale2 drop #ext1#test11.tale21 drop #ext1#test123.tale21 drop #ext2#test1.tale21 drop #ext2#test12.tale21 drop #ext3#test11.tale21 drop...

LEARN ABOUT DEBIAN

regexp::list

Regexp::List(3pm)					User Contributed Perl Documentation					 Regexp::List(3pm)

NAME

       Regexp::List - builds regular expressions out of a list of words

SYNOPSIS

	 use Regexp::List;
	 my $l	= Regexp::List->new;
	 my $re = $l->list2re(qw/foobar fooxar foozap fooza/);
	 # $re is now qr/foo(?:[bx]ar|zap?)/

ABSTRACT

       This module offers "list2re" method that turns a list of words into an optimized regular expression which matches all words therein.

       The optimized regular expression is much more efficient than a simple-minded '|'-concatenation thereof.

DESCRIPTION

       This module use Object-Oriented approach so you can use this module as a base and tweak its features.  This module is a base class of
       Regexp::Optimizer.

   EXPORT
       Since this is an OO module there is no symbol exported.

METHODS

       This module offers methods below;

       $l = Regexp::List->new(key=>value, ...)
	   Constructor.  When arguments are fed in key => value, manner, it sets attributes.  See "$l->set" for details

       $re = $l->list2re(list of words ...)
	   Does the job.  Takes a list of words and turn it into an optimal regular expresson.	See "IMPLEMENTATION" to find out how it is
	   achieved.  If you want to know the underlying black magic even further, see the source.

       $l->set(key => value, ...)
	   Sets attributes.  There are many attributes supported but let me mention just a few that you may be interested.

	   lookahead
	       Whether you prepend a lookahead assertion or not.  Default value is 1.  This module is smart enough to omit the assertion when you
	       don't need one.

		 $re = $l->list2re(qw/1 2 3 infinity/);
		 # qr/(?=[123i])(?:[123]|infinity)/
		 $re = $l->set(lookahead=>0)->list2re(qw/1 2 3 infinity/);
		 # qr/(?:[123]|infinity)/

	   quotemeta
	       Whether you quote metacharacters or not.  Default is 1.	If you really need this feature try Regexp::Optimizer instead.

		 @list = qw/3 3.14 3.14159265358979/;
		 $re = $l->list2re(@list);
		 # qr/3(?:.14(?:159265358979)?)?)/
		 $re = $l->set(lookahead=>0)->list2re(@list);
		 # qr/3(?:.14(?:159265358979)?)?)/
		 # which does match 3.14 but also "11+3=14"

	   modifies
	       Currently it accepts 'i', 'm', 's', and 'x', the same as regular expression modifiers.

		 @list = qw/Perl perl BASIC basic/;
		 $re = $l->list2re(@list);
		 # qr/(?=[BPbp])(?:[Pp]erl|BASIC|basic)/
		 $re = $l->set(modifiers => 'i')->list2re(@list);
		 # qr/(?=[bp])(?:perl|basic)/i
		 print $l->set(modifiers => 'x')->list2re(@list);
		 # Try for yourself;  Isn't itcute ?

       $l->expand($re);
	   Utility methods to expand a regular expression.  Handy when you want to check the complex regexes.

       $l->unexpand($re);
	   Utility methods to unexpand a regular expression.

IMPLEMENTATION

       This module optimizes the regular expression as follows.  Let's see what happens when qw/foobar fooxar foozap fooza/ is fed

       trie building (common prefix aggregation)
	   first the corresponding trie structure is built

		  +- bar
	     foo -+- xar
		  +- za -+- p
			 +- ''

       common suffix aggregation
	   it checks if any leaf node can be optimized for common suffix.  In this case we can do so to "bar" and "xar".

		  +- b -+-ar
	     foo -+- x -+
		  +- za -+- p
			 +- ''

       character class conversion
	   If a branch contains more than two single characters, it turns it into a character class.

	     foo -+- [bx] --- ar
		  +- za -+-p
			 +- ''

       empty leaf to "?"
	   Empty leaf is converted to a '?' quantifier

	     foo -+- [bx] --- ar
		  +- za -+-p?

       join all leafs into a group
	   And the final result is reached.

	     foo(?:[bx]ar|zap?)

BENCHMARKS

       This module is faily robust.  You can practically use this module to find a regular expression that matches all words in a dictionary.
       Here is a result by on perl 5.8.0, FreeBSD 4-Stable, Pentium III 800 Mhz with 512 MB RAM.

	# Sat May 31 09:11:06 2003 (	 0.000000 s) Reading /usr/share/dict/words
	# Sat May 31 09:11:07 2003 (	 0.847797 s) 235881 lines read.
	# Sat May 31 09:11:07 2003 (	 0.000000 s) Making regexp.
	# Sat May 31 09:13:09 2003 (   121.596928 s) Done.
	# Sat May 31 09:13:09 2003 (	 0.000000 s) Saving to t/words.rx
	# Sat May 31 09:13:09 2003 (	 0.000000 s) Reading t/words.rx
	# Sat May 31 09:13:13 2003 (	 3.679176 s) Done.
	# Sat May 31 09:13:13 2003 (	 0.000000 s) Opening /usr/share/dict/words for comparison.
	# Sat May 31 09:13:13 2003 (	 0.255222 s) /usr/share/dict/words:235881 lines found.
	# Sat May 31 09:13:13 2003 (	 0.000000 s) Showtime!
	# 235881/235881
	# Sat May 31 10:44:17 2003 (  5464.370409 s) Done.
	# Sat May 31 10:44:17 2003 (  5464.370624 s) 43.167 matches/s

       The result of optimization is obvious as the number of alteration increases.  Here is a result of a benchmark which matches randomly picked
       words against "/usr/share/dict/words".

	 ====	     2 words
		 Rate naive optim
	 naive 1.79/s	 --  -28%
	 optim 2.49/s	39%    --

	 ====	   256 words
	       s/iter naive optim
	 naive	 31.7	 --  -81%
	 optim	 5.95  433%    --

SEE ALSO

       Regexp::Optimizer -- uses this module as its base

       "eg/" directory in this package contains example scripts.

       Perl standard documents
	   perltodo, perlre

       CPAN Modules
	   Regexp::Presuf, Text::Trie

       Books
	   Mastering Regular Expressions  <http://www.oreilly.com/catalog/regex2/>

AUTHOR

       Dan Kogai <dankogai@dan.co.jp>

COPYRIGHT AND LICENSE

       Copyright 2003 by Dan Kogai, All Rights Reserved.

       This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself.

perl v5.14.2							    2011-11-16							 Regexp::List(3pm)

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Shell script to format a .CSV data

Discussion started by: Uday1982

2. UNIX for Advanced & Expert Users

shell script to format .CSV data

Discussion started by: tmcmurtr

3. Shell Programming and Scripting

Need script to format data specifically

Discussion started by: swapniltathe

4. Shell Programming and Scripting

Perl Script for reading table format data from file.

Discussion started by: asak