Sponsored Content
Full Discussion: Regex to identify pattern
Top Forums UNIX for Beginners Questions & Answers Regex to identify pattern Post 303004861 by Scott on Tuesday 10th of October 2017 08:10:45 AM
Old 10-10-2017
Good then then RudiC just gave you one Smilie It's not pretty, but they often aren't!
 

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

To identify filename in which having match PATTERN

Hi, Any idea to identify bunch of files( gz format) in which having match PATTERN wanted and print out those files ? :) Regards, (14 Replies)
Discussion started by: cedrichiu
14 Replies

2. Shell Programming and Scripting

identify specific pattern

hello I want to create a folder under all partitions that are : /hostname1 /hostname2..... how to exclude /hostname (i.e start directly with /hostname1) thank you (1 Reply)
Discussion started by: melanie_pfefer
1 Replies

3. Shell Programming and Scripting

How to identify the occurence of a pattern between a unique character?

hi, is it possible to find the number of occurences of a pattern between two paranthesis. for e.g i have a file as below. >>{ >>hi >>GoodMorning >>how are you? >>} >>is it good, >>tell me yes, if it is good In the above file, its clear the occurence of word "Good"... (17 Replies)
Discussion started by: divak
17 Replies

4. UNIX for Dummies Questions & Answers

Use Regex to identify / format a complex string

First of all, please have mercy on me. I am not a noob to programming, but I am about as noob as you can get with regex. That being said, I have a problem. I've got a string that looks something like this: Publication - Bob M. Jones, Tony X. Stark, and Fred D. Man, \"Really Awesome Article... (1 Reply)
Discussion started by: egill
1 Replies

5. Shell Programming and Scripting

Regex to identify a full-stop as a sentence delimiter

Hello, Splitting a sentence using the full-stop/question-mark/exclamation is a common device. Whereas the question-mark / exclamation do not pose too much of a problem; the full-stop as a sentence delimiter raises certain issues because of its varied use: just to name a few. Standard parsers... (9 Replies)
Discussion started by: gimley
9 Replies

6. Shell Programming and Scripting

Regex to identify word in second position on a line

I am interested in finding a regex to find a word in second position on a line. The word in question is या I tried the following PERL EXPRESSION but it did not work: ] या or ^\W या But both gave Null results I am giving below a Sample file: देना या सौंपना=delegate तह जमना या... (8 Replies)
Discussion started by: gimley
8 Replies

7. Shell Programming and Scripting

PHP - Regex for matching string containing pattern but without pattern itself

The sample file: dept1: user1,user2,user3 dept2: user4,user5,user6 dept3: user7,user8,user9 I want to match by '/^dept2.*/' but don't want to have substring 'dept2:' in output. How to compose such regex? (8 Replies)
Discussion started by: urello
8 Replies

8. Shell Programming and Scripting

Identify file pattern, take count of pattern, then act

Guys - Need your ideas on a section of code to finish something up. To make a long story short, I'm parsing a print output file that goes to pre-printed forms. I'm intercepting it, parsing it, formatting it, cutting it up into individual pages, grabbing the text I want in zones, building an... (3 Replies)
Discussion started by: ampsys
3 Replies

9. Shell Programming and Scripting

Regex to identify unique words in a dictionary database

Hello, I have a dictionary which I am building for the Open Source Community. The data structure is as under HEADWORD=PARTOFSPEECH=ENGLISH MEANING as shown in the example below अ=m=Prefix signifying negation. अँहँ=ind=Interjection expressing disapprobation. अं=int=An interjection... (2 Replies)
Discussion started by: gimley
2 Replies

10. Shell Programming and Scripting

Regex to identify illegal characters in a perso-arabic database

I am working on Sindhi: a perso-Arabic script and since it shares the Unicode-block with over 400 other languages, quite often the database contains characters which are not wanted: illegal characters. I have identified the character set of Sindhi which is given below: For clarity's sake, each... (8 Replies)
Discussion started by: gimley
8 Replies
XMLPRETTY(1p)						User Contributed Perl Documentation					     XMLPRETTY(1p)

NAME
xmlpretty - XML pretty printer SYNOPSIS
xmlpretty [--options] [filename] DESCRIPTION
xmlpretty is the commandline interface to XML::Handler::YAWriter, acting as a tool to add and remove pretty printing to XML files. xmlpretty has several methods to add human readablitiy. If you want to add readablity without adding so-called ignorable whitespace, use it in the following way : $ xmlpretty --AddHiddenNewline --AddHiddenAttrTab --CatchEmptyElement uglyfile.xml > prettyfile.xml If you do not want to process the file further, but only want it human readable, add visible whitespace to the file as follows : $ xmlpretty --PrettyWhiteNewline --PrettyWhiteIndent --CatchEmptyElement uglyfile.xml > prettyfile.xml You may use YAWriter to clean whitespace from XML documents. This may work in 99% of the cases where you want to get rid of ignorable whitespace caused by the various forms of pretty printing. $ xmlpretty --NoWhiteSpace --NoComments --AddHiddenNewline --AddHiddenAttrTab --CatchEmptyElement prettyfile.xml > cleanfile.xml Options Options are given in a gnu like --option idiom. AddHiddenNewline boolean Add hidden newline before ">" AddHiddenAttrTab boolean Add hidden tabulation for attributes CatchEmptyElement boolean Catch empty Elements, apply "/>" compression CatchWhiteSpace boolean Catch whitespace with comments IsSGML boolean This option will cause start_document, processing_instruction and doctype_decl to appear as SGML. The SGML is still well-formed of course, if your SAX events are well-formed. NoComments boolean Supress Comments NoDTD boolean Supress DTD NoPI boolean Supress Processing Instructions NoProlog boolean Supress <?xml ... ?> Prolog NoWhiteSpace boolean Supress WhiteSpace to clean documents from prior pretty printing. PrettyWhiteIndent boolean Add visible indent before any eventstring PrettyWhiteNewline boolean Add visible newlines before any eventstring SAX1 boolean (not yet implemented) Output only SAX1 compilant eventstrings Bugs: Automatic recoding between 8bit and 16bit does not yet work correctly ! I have Perl-5.6 at home and here I can specify "use utf8;" in the right places to make recoding work. But I dislike saying "use 5.00555;" because many systems run 5.00503. AUTHOR
Michael Koehne, Kraehe@Copyleft.De Thanks "Derksen, Eduard (Enno), CSCIO" <enno@att.com> helped me with the Escape hash and gave quite a lot of useful comments. SEE ALSO
perl and XML::Parser::PerlSAX perl v5.10.1 2010-02-18 XMLPRETTY(1p)
All times are GMT -4. The time now is 04:56 PM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy