You are right: non-greedy matching takes care of a few cases. Still, what i said about regexps being unable to parse still stands. For instance, your perl-program would have problems with:
I admit, this (and some more cases i could cite) are fringe. Maybe the thread-O/P will never encounter any of these. But again, it is - in a strict sense - impossible to overcome all of these problems (some of them, yes, but not all) because a regexp engine cannot work as a parser.
I hope this helps.
bakunin
These 2 Users Gave Thanks to bakunin For This Post:
hi, i am on aix. i used smitty to remove a user..
but then found that its directory still exists....
so i have to remove the directory manually...
am i doing it the right way? (2 Replies)
hello,
Sometimes I need to remove all the files except one or more.I mean, there are 90 files and I want to remove 88 of them. how can i do that?is it possible to tell the "rm" command not to remove specified files? (4 Replies)
My file has varied width references:
width=10%
style=width:5%
width:1506%
width:99.58%
so I'm trying clear all the width calls with one procedure:
's/width= *%//' and 's/width=*%//'but neither is working. (6 Replies)
not sure what is this but any can help me delete this ^I
cat -A file.txt
CLAS^I^I|890^I|7,10,12,341,305,308,29,54^M$
LCLS^I^I|891^I|7,10,12,341,305,308,29,54^M$
MURB^I^I|892^I|7,10,12,341,305,308,29,54^M$
LATI^I^I|893^I|7,10,12,341,305,308,29,54^M$
i want to remove the ^I^I... (2 Replies)
Hi all,
I want to remove the remove bracket sign ( ) and put in the separate column I also want to remove the repeated entry like in first row in below input (PA156) is repeated
ESR1 (PA156) leflunomide (PA450192) (PA156) leflunomide (PA450192)
CHST3 (PA26503) docetaxel... (2 Replies)
Hi guys,
I need to write a script so that when i execute the "rm" command, the file mentioned need to be copied to other folder and then be deleted. this should be done in back ground. can you please help me out?? (1 Reply)
Had increased FS system size (sample_lv) on particular disks hdisk189 hdisk190 in a shared FS
but unfortunately given addnl size occupies the space on other disks hdisk78 hdisk40 too
In case, need to remove the addnl lv size occupied on hdisk78 hdisk40. How to achieve it. Pls advice.
... (3 Replies)
Discussion started by: ksgnathan
3 Replies
9. Post Here to Contact Site Administrators and Moderators
In this thread: /shell-programming-and-scripting/255687-organizing-text-file-capital-names-capital-word-capital-word.html (sorry i cant use links)
that is not an example, those are real students names with real student login id's for the college i am attending and i am on that list. Please... (3 Replies)
The bash below executes and does find all the .bam files in each R_2019 folder. However set -x shows that the .bam extension only gets removed from one .bam file in each folder (appears to be the last in each). Why is it not removing the extension from each (this is $SAMPLE)? Thank you :).
set... (4 Replies)
Discussion started by: cmccabe
4 Replies
LEARN ABOUT DEBIAN
marc::charset::code
MARC::Charset::Code(3pm) User Contributed Perl Documentation MARC::Charset::Code(3pm)NAME
MARC::Charset::Code - represents a MARC-8/UTF-8 mapping
SYNOPSIS DESCRIPTION
Each mapping from a MARC-8 value to a UTF-8 value is represented by a MARC::Charset::Code object in a MARC::Charset::Table.
METHODS
new()
The constructor.
name()
A descriptive name for the code point.
marc()
A string representing the MARC-8 bytes codes.
ucs()
A string representing the UCS code point in hex.
charset_code()
The MARC-8 character set code.
is_combining()
Returns true/false to tell if the character is a combining character.
to_string()
A stringified version of the object suitable for pretty printing.
char_value()
Returns the unicode character. Essentially just a helper around ucs().
marc_value()
The string representing the MARC-8 encoding.
charset_name()
Returns the name of the character set, instead of the code.
to_string()
Returns a stringified version of the object.
marc8_hash_code()
Returns a hash code for this Code object for looking up the object using MARC8. First portion is the character set code and the second is
the MARC-8 value.
utf8_hash_code()
Returns a hash code for uniquely identifying a Code by it's UCS value.
default_charset_group
Returns 'G0' or 'G1' indicating where the character is typicalling used in the MARC-8 environment.
get_marc8_escape
Returns an escape sequence to move to the Code from another marc-8 character set.
charset_value
Returns the charset value, not the hex sequence.
perl v5.12.4 2010-03-29 MARC::Charset::Code(3pm)