03-30-2011
Sybase says en_GB is like Sybase iso_1:
New Functionality in Adaptive Server Enterprise Version 12.5.x
and similar to iso 8859-1, often called or similar to latin-1, an one-character code page or font like ASCII but with the upper 128 loaded with western european support, umlau-A, diaresis-o and such.
UTF-8 is an variable width encoding of wide unicode such that the ASCII characters match in one character.
So, until the high bit comes on, you are pretty good, perhaps give or take a few symbol shifts like # versus pound-sterling L with slashes.
10 More Discussions You Might Find Interesting
1. Shell Programming and Scripting
Hi,
I need to split a string, either using awk or cut or basic unix commands (no programming) , with a multibyte charectar as a delimeter.
Ex:
abcd-efgh-ijkl
split by -efgh- to get two segments abcd & ijkl
Is it possible?
Thanks
A.H.S (1 Reply)
Discussion started by: azmathshaikh
1 Replies
2. Shell Programming and Scripting
Hello,
Is there any UNIX utility/command/executable that will convert mutlibyte characters to standard single byte ASCII characters in a given file?
and
Is there any UNIX utility/command/executable that will recognize multibyte characters in a given file name?
The typical multibyte... (8 Replies)
Discussion started by: jerardfjay
8 Replies
3. Shell Programming and Scripting
Hi! I'm trying to separate text into sentences, like this:
$pattern = "/(|]|,)**/";
preg_match_all($pattern, $text, $matches);
This works fine unless the text contains multibyte characters, like "åäö". How can I make this work with these exotic characters? (2 Replies)
Discussion started by: Ilja
2 Replies
4. Shell Programming and Scripting
Hi! I'm trying to separate text into sentences, like this:
$pattern = "/(|]|,)**/";
preg_match_all($pattern, $text, $matches);
This works fine unless the text contains multibyte characters, like "åäö". How can I make this work with these exotic characters?
An example phrase that doesn't match:... (1 Reply)
Discussion started by: Ilja
1 Replies
5. Shell Programming and Scripting
Hi,
I have one doubt over sftp.
I am trnasferring a file from server1 to server2 using sftp.
The size of the file shows different in file 1 and file2 after sftp even though it shows same number of byte transferred. I don't understand the problem.
For example:
I have file1 having size... (3 Replies)
Discussion started by: siba.s.nayak
3 Replies
6. Programming
Compiling xpp (The X Printing Panel) on SL6 (RHEL6 essentially):
xpp.cxx: In constructor ‘printFiles::printFiles(int, char**, int&)’:
xpp.cxx:200: error: invalid conversion from ‘const char*’ to ‘char*’
The same error with all c++ constructors - gcc 4.4.4.
If anyone can throw any light on... (8 Replies)
Discussion started by: GSO
8 Replies
7. Shell Programming and Scripting
Hello all
Can anyone help me to solve the below issue
I want to take user input with space separated .The number of inputs can be variable
like if user inputs
1 2 3 4
ouput will stored in as array a where i=4 and I can retreive the value like a =3
any thoughts how to do it
... (2 Replies)
Discussion started by: Pratik4891
2 Replies
8. Shell Programming and Scripting
Hello,
I have two directories - prev and current . They both have same multiple subdirectories and files. Now the current directory can have some updated files and some new files added that is not in prev. I want to find the list of file names that differ. I am doing this because i can not... (2 Replies)
Discussion started by: jakSun8
2 Replies
9. Shell Programming and Scripting
I am using Solaris8, userA's shell '/usr/ace/prog/sdshell', AppuserB's shell '/bin/ksh'.
serverT:/home/userA>LC_CTYPE=iso_8859_1; export LC_CTYPE; vtemp='userA variable'; export vtemp
serverT:/home/userA>echo "LC_CTYPE=$LC_CTYPE, vtemp=$vtemp";
LC_CTYPE=iso_8859_1, vtemp=userA... (4 Replies)
Discussion started by: kchinnam
4 Replies
10. Shell Programming and Scripting
Hi
I have a requirement to insert a dot "." after a position in each line, say 110th position.
For which, I have written the below command.
cat filename | sed 's/./&\./110' > new_filename
The code is working fine, but when we have multi byte (2 or 3) characters in the input file, the... (3 Replies)
Discussion started by: tostay2003
3 Replies
LEARN ABOUT REDHAT
mbrlen
MBRLEN(3) Linux Programmer's Manual MBRLEN(3)
NAME
mbrlen - determine number of bytes in next multibyte character
SYNOPSIS
#include <wchar.h>
size_t mbrlen(const char *s, size_t n, mbstate_t *ps);
DESCRIPTION
The mbrlen function inspects at most n bytes of the multibyte string starting at s and extracts the next complete multibyte character. It
updates the shift state *ps. If the multibyte character is not the null wide character, it returns the number of bytes that were consumed
from s. If the multibyte character is the null wide character, it resets the shift state *ps to the initial state and returns 0.
If the n bytes starting at s do not contain a complete multibyte character, mbrlen returns (size_t)(-2). This can happen even if n >=
MB_CUR_MAX, if the multibyte string contains redundant shift sequences.
If the multibyte string starting at s contains an invalid multibyte sequence before the next complete character, mbrlen returns
(size_t)(-1) and sets errno to EILSEQ. In this case, the effects on *ps are undefined.
If ps is a NULL pointer, a static anonymous state only known to the mbrlen function is used instead.
RETURN VALUE
The mbrlen function returns the number of bytes parsed from the multibyte sequence starting at s, if a non-null wide character was recog-
nized. It returns 0, if a null wide character was recognized. It returns (size_t)(-1) and sets errno to EILSEQ, if an invalid multibyte
sequence was encountered. It returns (size_t)(-2) if it couldn't parse a complete multibyte character, meaning that n should be increased.
CONFORMING TO
ISO/ANSI C, UNIX98
SEE ALSO
mbrtowc(3)
NOTES
The behaviour of mbrlen depends on the LC_CTYPE category of the current locale.
GNU
1999-07-25 MBRLEN(3)