Hi, I'm working on gathering information stored in .txt files. The format of the data within the .txt files is shown in the picture uploaded with this post. Sections like the one pictured are repeated (with different data, same format) many times within each .txt file but each section is of data... (4 Replies)
Hi All,
I have the below scenario in my environment
Developers used to copy file from windows to Linux box. Some time on the copied file developers miss to run the dos2unix utility. Because of this script gets failed during the execution. Most of the failures are due to the dos2unix format... (7 Replies)
Hi the below perl snippet will replace any three letter string in the beginning with a two letter string which is specified..but if i want to modfiy only certain characters for eg..
ABC - AB
CAB - AB
AAA - No Modifcations
1AB - AB
AB8 - AB
Whatever coming before or after of AB only have... (2 Replies)
Hello,
Splitting a sentence using the full-stop/question-mark/exclamation is a common device. Whereas the question-mark / exclamation do not pose too much of a problem; the full-stop as a sentence delimiter raises certain issues because of its varied use:
just to name a few.
Standard parsers... (9 Replies)
I am interested in finding a regex to find a word in second position on a line. The word in question is या
I tried the following PERL EXPRESSION but it did not work:
] या
or
^\W या
But both gave Null results
I am giving below a Sample file:
देना या सौंपना=delegate
तह जमना या... (8 Replies)
Gurus,
I have a data file which has a certain number of columns say 101. It has one description column which contains foreign characters and due to this some times, those special characters are translated to new line character and resulting in failing the process.
I am using the following awk... (4 Replies)
Hello,
I have a dictionary which I am building for the Open Source Community. The data structure is as under
HEADWORD=PARTOFSPEECH=ENGLISH MEANING
as shown in the example below
अ=m=Prefix signifying negation.
अँहँ=ind=Interjection expressing disapprobation.
अं=int=An interjection... (2 Replies)
I am working on Sindhi: a perso-Arabic script and since it shares the Unicode-block with over 400 other languages, quite often the database contains characters which are not wanted: illegal characters.
I have identified the character set of Sindhi which is given below:
For clarity's sake, each... (8 Replies)
Hi
In a file I have string in multiple lines. Like below:
<?=test.getObjectName("L", "testTBL","D") ?>
<?=test.getObjectName("L", "testTBL","testDB", "D") ?>
I want to use regex to search for the pattern "<?=test.getObjectName...?>"
If the parenthesis has 3 parameters then return 2nd... (5 Replies)
Hi,
I need some guidance with understanding this Perl script below. I am not the author of the script and the author has not leave any documentation. I supposed it is meant to be 'easy' if you're a Perl or regex guru. I am having problem understanding what regex to use :confused: The script does... (3 Replies)
Discussion started by: newbie_01
3 Replies
LEARN ABOUT PHP
mb_regex_set_options
MB_REGEX_SET_OPTIONS(3) 1 MB_REGEX_SET_OPTIONS(3)mb_regex_set_options - Set/Get the default options for mbregex functions
SYNOPSIS
string mb_regex_set_options ([string $options = mb_regex_set_options()])
DESCRIPTION
Sets the default options described by $options for multibyte regex functions.
PARAMETERS
o $options
- The options to set. This is a string where each character is an option. To set a mode, the mode character must be the last one
set, however there can only be set one mode but multiple options.
Regex options
+-------+-------------------------------+---+---+
|Option | | | |
| | | | |
| | Meaning | | |
| | | | |
+-------+-------------------------------+---+---+
| i | | | |
| | | | |
| | Ambiguity match on | | |
| | | | |
| x | | | |
| | | | |
| | Enables extended pattern form | | |
| | | | |
| m | | | |
| | | | |
| | | | |
| | '.' matches with newlines | | |
| | | | |
| s | | | |
| | | | |
| | | | |
| | '^' -> 'A', '$' -> '' | | |
| | | | |
| p | | | |
| | | | |
| | Same as both the m and s | | |
| | options | | |
| | | | |
| l | | | |
| | | | |
| | Finds longest matches | | |
| | | | |
| n | | | |
| | | | |
| | Ignores empty matches | | |
| | | | |
| e | | | |
| | | | |
| | eval(3) resulting code | | |
| | | | |
+-------+-------------------------------+---+---+
Regex syntax modes
+-----+----------------------------+---+---+
|Mode | | | |
| | | | |
| | Meaning | | |
| | | | |
+-----+----------------------------+---+---+
| j | | | |
| | | | |
| | Java (Sun java.util.regex) | | |
| | | | |
| u | | | |
| | | | |
| | GNU regex | | |
| | | | |
| g | | | |
| | | | |
| | grep | | |
| | | | |
| c | | | |
| | | | |
| | Emacs | | |
| | | | |
| r | | | |
| | | | |
| | Ruby | | |
| | | | |
| z | | | |
| | | | |
| | Perl | | |
| | | | |
| b | | | |
| | | | |
| | POSIX Basic regex | | |
| | | | |
| d | | | |
| | | | |
| | POSIX Extended regex | | |
| | | | |
+-----+----------------------------+---+---+
RETURN VALUES
The previous options. If $options is omitted, it returns the string that describes the current options.
SEE ALSO mb_split(3), mb_ereg(3), mb_eregi(3).
PHP Documentation Group MB_REGEX_SET_OPTIONS(3)