I'd like to write a regex that transforms a German base form of a noun into one of its inflected forms, namely
I want to translate "Haus" to "Häuser"
This is what I've got:
where _Umlaut( x )_ is a function operating on the noun stem captured by $1 The function performs the following transformations on the noun stem to model the effect of a typical vowel change in the stem:
Replace last occurrence of vowel [x] according to (vowel change):
Return the modified stem
However, this function doesn't work with "au", it only works for words like Topf -> Töpfe
In the case of "Haus", it wrongly outputs "Häüser", so it makes an Umlaut out of both vowels, which I don't want.
Any suggestions on how to tell the function how to perform this step only on the first vowel are appreciated :-)
Thanks and kind regards,
Kat
Last edited by jim mcnamara; 03-19-2012 at 12:11 PM..
Hi all,
I'm writing a script that replaces a value in a file. The file is formatted as follows:
So, for this example, I'd like to replace the value for param_two. The value for param_two can be a one, or two-digit number. It replaces the value in file.cfg, and directs the... (9 Replies)
I am having trouble parsing rpm filenames in a shell script.. I found a snippet of perl code that will perform the task but I really don't have time to rewrite the entire script in perl. I cannot for the life of me convert this code into something sed-friendly:
if ($rpm =~ /(*)-(*)-(*)\.(.*)/)... (1 Reply)
Hi Experts,
I am using ls with regex in the below manner:
VAR="*.txt *.TXT"
ls -l $VAR
This is working fine if I have both txt and TXT extension files in my directory. But if any of them is not present, its throwing errors, that *.TXT file not found in the directory. So what am i missing... (6 Replies)
Hi,
I want to validate strings in perl, the string may contains characters from a-zA-Z0-9 and symbols +-_.:/\
To validate such a string I computed a regex
if ($string =~ m/^/) {
print "valid";
} else {
print "invalid";
}
but this regex also validates strings that contain... (8 Replies)
I have a file of protein sequences with headers (my source file). Based on a list of IDs (which are included in some of the headers), I'd like to print out only the specified sequences, with only the ID as header.
In other words, I'd like to search source.txt for the terms in IDs.txt, and print... (3 Replies)
Hi,
Could you please help me in writing a regex for the following requirement?
Let following be the string format:
abc.cdef.ghij.lm
I need to check between dots, there is atleast one character{a-z,A-Z,*}.
Eg: abc1.gt2.345j is valid, but not 123.abc.vff.gth because 123 should not be... (2 Replies)
I am looking for the proper regex to match the hostname "areagc11" of this log.... Any help would be awsome:)
Oct 25 11:08:18 areagc11 961: Oct 25 18:08:17.536 GMT: %SYS-5-CONFIG_I: Configured from console by someone onvty1 (10.156.72.97) (6 Replies)
Hi,
Have to filter out string before the last underscore in the following
input: UNIX_Solaris_59_KSH
output: UNIX_Solaris_59
dummy one but :mad:
Thanks & Regards,
Sourabh Singh Khichi (4 Replies)
I am not a big expert in regex and have just little understanding of that language.
Could you help me to understand the regular Perl expression:
^(?!if\b|else\b|while\b|)(?:+?\s+){1,6}(+\s*)\(*\) *?(?:^*;?+){0,10}\{
------
This is regex to select functions from a C/C++ source and defined in... (2 Replies)
I'm trying to get some exclusions into our sendmail regular expression for the K command. The following configuration & regex works:
LOCAL_CONFIG
#
Kcheckaddress regex -a@MATCH
+<@+?\.++?\.(us|info|to|br|bid|cn|ru)
LOCAL_RULESETS
SLocal_check_mail
# check address against various regex... (0 Replies)
Discussion started by: RobbieTheK
0 Replies
LEARN ABOUT CENTOS
th_wcisthdiac
thai/thwctype.h(3) libthai thai/thwctype.h(3)NAME
thai/thwctype.h -
Thai wide-char character classifications.
SYNOPSIS
Functions
int th_wcistis (thwchar_t wc)
Is the wide character convertible to a valid TIS-620 code?
int th_wcisthai (thwchar_t wc)
Is the wide character a Thai character?
int th_wciseng (thwchar_t wc)
Is the wide character an English character?
int th_wcisthcons (thwchar_t wc)
Is the wide character a Thai consonant?
int th_wcisthvowel (thwchar_t wc)
Is the wide character a Thai vowel?
int th_wcisthtone (thwchar_t wc)
Is the wide character a Thai tone mark?
int th_wcisthdiac (thwchar_t wc)
Is the wide character a Thai diacritic?
int th_wcisthdigit (thwchar_t wc)
Is the character a Thai digit?
int th_wcisthpunct (thwchar_t wc)
Is the character a Thai punctuation?
int th_wcistaillesscons (thwchar_t wc)
Is the wide character a Thai consonant that fits the x-height?
int th_wcisovershootcons (thwchar_t wc)
Is the wide character a Thai consonant with stem above ascender?
int th_wcisundershootcons (thwchar_t wc)
Is the wide character a Thai consonant with stem below baseline?
int th_wcisundersplitcons (thwchar_t wc)
Is the wide character a Thai consonant with split part below baseline?
int th_wcisldvowel (thwchar_t wc)
Is the wide character a Thai leading vowel?
int th_wcisflvowel (thwchar_t wc)
Is the wide character a Thai following vowel?
int th_wcisupvowel (thwchar_t wc)
Is the wide character a Thai upper vowel?
int th_wcisblvowel (thwchar_t wc)
Is the wide character a Thai below vowel?
int th_wcchlevel (thwchar_t wc)
Position for rendering:
Detailed Description
Thai wide-char character classifications.
Function Documentation
int th_wcchlevel (thwchar_twc)
Position for rendering:
o 3 = above/top
o 2 = top
o 1 = above
o 0 = base
o -1 = below
int th_wcistis (thwchar_twc)
Is the wide character convertible to a valid TIS-620 code? TIS-620 here means US-ASCII plus TIS-620 extension.
Author
Generated automatically by Doxygen for libthai from the source code.
Version 0.1.14 Tue Jun 17 2014 thai/thwctype.h(3)