Sponsored Content
Full Discussion: Regex help needed
Top Forums Shell Programming and Scripting Regex help needed Post 302609071 by Bloomy on Monday 19th of March 2012 09:16:04 AM
Old 03-19-2012
Regex help needed

Hello,

I'd like to write a regex that transforms a German base form of a noun into one of its inflected forms, namely

I want to translate "Haus" to "Häuser"

This is what I've got:
Code:
/^(.+)$/_Umlaut( $1 )_er/

where _Umlaut( x )_ is a function operating on the noun stem captured by $1 The function performs the following transformations on the noun stem to model the effect of a typical vowel change in the stem:

Replace last occurrence of vowel [x] according to (vowel change):
Code:
/au/äu/
/a/ä/
/o/ö/
/u/ü/

Return the modified stem

However, this function doesn't work with "au", it only works for words like Topf -> Töpfe

In the case of "Haus", it wrongly outputs "Häüser", so it makes an Umlaut out of both vowels, which I don't want.

Any suggestions on how to tell the function how to perform this step only on the first vowel are appreciated :-)

Thanks and kind regards,
Kat

Last edited by jim mcnamara; 03-19-2012 at 12:11 PM..
 

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Sed and regex help needed

Hi all, I'm writing a script that replaces a value in a file. The file is formatted as follows: So, for this example, I'd like to replace the value for param_two. The value for param_two can be a one, or two-digit number. It replaces the value in file.cfg, and directs the... (9 Replies)
Discussion started by: marknu1
9 Replies

2. Shell Programming and Scripting

Converting perl regex to sed regex

I am having trouble parsing rpm filenames in a shell script.. I found a snippet of perl code that will perform the task but I really don't have time to rewrite the entire script in perl. I cannot for the life of me convert this code into something sed-friendly: if ($rpm =~ /(*)-(*)-(*)\.(.*)/)... (1 Reply)
Discussion started by: suntzu
1 Replies

3. Shell Programming and Scripting

regex in ls

Hi Experts, I am using ls with regex in the below manner: VAR="*.txt *.TXT" ls -l $VAR This is working fine if I have both txt and TXT extension files in my directory. But if any of them is not present, its throwing errors, that *.TXT file not found in the directory. So what am i missing... (6 Replies)
Discussion started by: sugarcane
6 Replies

4. Shell Programming and Scripting

perl regex help needed

Hi, I want to validate strings in perl, the string may contains characters from a-zA-Z0-9 and symbols +-_.:/\ To validate such a string I computed a regex if ($string =~ m/^/) { print "valid"; } else { print "invalid"; } but this regex also validates strings that contain... (8 Replies)
Discussion started by: zing_foru
8 Replies

5. UNIX for Dummies Questions & Answers

read regex from ID file, print regex and line below from source file

I have a file of protein sequences with headers (my source file). Based on a list of IDs (which are included in some of the headers), I'd like to print out only the specified sequences, with only the ID as header. In other words, I'd like to search source.txt for the terms in IDs.txt, and print... (3 Replies)
Discussion started by: pathunkathunk
3 Replies

6. Shell Programming and Scripting

Help needed in regex

Hi, Could you please help me in writing a regex for the following requirement? Let following be the string format: abc.cdef.ghij.lm I need to check between dots, there is atleast one character{a-z,A-Z,*}. Eg: abc1.gt2.345j is valid, but not 123.abc.vff.gth because 123 should not be... (2 Replies)
Discussion started by: lorzinian
2 Replies

7. UNIX for Dummies Questions & Answers

Regex Needed:(

I am looking for the proper regex to match the hostname "areagc11" of this log.... Any help would be awsome:) Oct 25 11:08:18 areagc11 961: Oct 25 18:08:17.536 GMT: %SYS-5-CONFIG_I: Configured from console by someone onvty1 (10.156.72.97) (6 Replies)
Discussion started by: jlaigo2
6 Replies

8. UNIX for Dummies Questions & Answers

RegEX help needed

Hi, Have to filter out string before the last underscore in the following input: UNIX_Solaris_59_KSH output: UNIX_Solaris_59 dummy one but :mad: Thanks & Regards, Sourabh Singh Khichi (4 Replies)
Discussion started by: skhichi
4 Replies

9. Shell Programming and Scripting

Perl, RegEx - Help me to understand the regex!

I am not a big expert in regex and have just little understanding of that language. Could you help me to understand the regular Perl expression: ^(?!if\b|else\b|while\b|)(?:+?\s+){1,6}(+\s*)\(*\) *?(?:^*;?+){0,10}\{ ------ This is regex to select functions from a C/C++ source and defined in... (2 Replies)
Discussion started by: alex_5161
2 Replies

10. Shell Programming and Scripting

Sendmail K command regex: adding exclusion/negative lookahead to regex -a@MATCH

I'm trying to get some exclusions into our sendmail regular expression for the K command. The following configuration & regex works: LOCAL_CONFIG # Kcheckaddress regex -a@MATCH +<@+?\.++?\.(us|info|to|br|bid|cn|ru) LOCAL_RULESETS SLocal_check_mail # check address against various regex... (0 Replies)
Discussion started by: RobbieTheK
0 Replies
thai/thwctype.h(3)						      libthai							thai/thwctype.h(3)

NAME
thai/thwctype.h - Thai wide-char character classifications. SYNOPSIS
Functions int th_wcistis (thwchar_t wc) Is the wide character convertible to a valid TIS-620 code? int th_wcisthai (thwchar_t wc) Is the wide character a Thai character? int th_wciseng (thwchar_t wc) Is the wide character an English character? int th_wcisthcons (thwchar_t wc) Is the wide character a Thai consonant? int th_wcisthvowel (thwchar_t wc) Is the wide character a Thai vowel? int th_wcisthtone (thwchar_t wc) Is the wide character a Thai tone mark? int th_wcisthdiac (thwchar_t wc) Is the wide character a Thai diacritic? int th_wcisthdigit (thwchar_t wc) Is the character a Thai digit? int th_wcisthpunct (thwchar_t wc) Is the character a Thai punctuation? int th_wcistaillesscons (thwchar_t wc) Is the wide character a Thai consonant that fits the x-height? int th_wcisovershootcons (thwchar_t wc) Is the wide character a Thai consonant with stem above ascender? int th_wcisundershootcons (thwchar_t wc) Is the wide character a Thai consonant with stem below baseline? int th_wcisundersplitcons (thwchar_t wc) Is the wide character a Thai consonant with split part below baseline? int th_wcisldvowel (thwchar_t wc) Is the wide character a Thai leading vowel? int th_wcisflvowel (thwchar_t wc) Is the wide character a Thai following vowel? int th_wcisupvowel (thwchar_t wc) Is the wide character a Thai upper vowel? int th_wcisblvowel (thwchar_t wc) Is the wide character a Thai below vowel? int th_wcchlevel (thwchar_t wc) Position for rendering: Detailed Description Thai wide-char character classifications. Function Documentation int th_wcchlevel (thwchar_twc) Position for rendering: o 3 = above/top o 2 = top o 1 = above o 0 = base o -1 = below int th_wcistis (thwchar_twc) Is the wide character convertible to a valid TIS-620 code? TIS-620 here means US-ASCII plus TIS-620 extension. Author Generated automatically by Doxygen for libthai from the source code. Version 0.1.14 Tue Jun 17 2014 thai/thwctype.h(3)
All times are GMT -4. The time now is 12:51 PM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy