Encode::CN(3pm) Perl Programmers Reference Guide Encode::CN(3pm)NAME
Encode::CN - China-based Chinese Encodings
SYNOPSIS
use Encode qw/encode decode/;
$euc_cn = encode("euc-cn", $utf8); # loads Encode::CN implicitly
$utf8 = decode("euc-cn", $euc_cn); # ditto
DESCRIPTION
This module implements China-based Chinese charset encodings. Encodings supported are as follows.
Canonical Alias Description
--------------------------------------------------------------------
euc-cn /euc.*cn$/i EUC (Extended Unix Character)
/cn.*euc$/i
/GB[-_ ]?2312(?:D.*$|$)/i (see below)
gb2312-raw The raw (low-bit) GB2312 character map
gb12345-raw Traditional chinese counterpart to
GB2312 (raw)
iso-ir-165 GB2312 + GB6345 + GB8565 + additions
MacChineseSimp GB2312 + Apple Additions
cp936 Code Page 936, also known as GBK
(Extended GuoBiao)
hz 7-bit escaped GB2312 encoding
--------------------------------------------------------------------
To find how to use this module in detail, see Encode.
NOTES
Due to size concerns, "GB 18030" (an extension to "GBK") is distributed separately on CPAN, under the name Encode::HanExtra. That module
also contains extra Taiwan-based encodings.
BUGS
When you see "charset=gb2312" on mails and web pages, they really mean "euc-cn" encodings. To fix that, "gb2312" is aliased to "euc-cn".
Use "gb2312-raw" when you really mean it.
The ASCII region (0x00-0x7f) is preserved for all encodings, even though this conflicts with mappings by the Unicode Consortium.
SEE ALSO
Encode
perl v5.16.2 2012-08-26 Encode::CN(3pm)
Check Out this Related Man Page
Encode::CN(3perl) Perl Programmers Reference Guide Encode::CN(3perl)NAME
Encode::CN - China-based Chinese Encodings
SYNOPSIS
use Encode qw/encode decode/;
$euc_cn = encode("euc-cn", $utf8); # loads Encode::CN implicitly
$utf8 = decode("euc-cn", $euc_cn); # ditto
DESCRIPTION
This module implements China-based Chinese charset encodings. Encodings supported are as follows.
Canonical Alias Description
--------------------------------------------------------------------
euc-cn /euc.*cn$/i EUC (Extended Unix Character)
/cn.*euc$/i
/GB[-_ ]?2312(?:D.*$|$)/i (see below)
gb2312-raw The raw (low-bit) GB2312 character map
gb12345-raw Traditional chinese counterpart to
GB2312 (raw)
iso-ir-165 GB2312 + GB6345 + GB8565 + additions
MacChineseSimp GB2312 + Apple Additions
cp936 Code Page 936, also known as GBK
(Extended GuoBiao)
hz 7-bit escaped GB2312 encoding
--------------------------------------------------------------------
To find how to use this module in detail, see Encode.
NOTES
Due to size concerns, "GB 18030" (an extension to "GBK") is distributed separately on CPAN, under the name Encode::HanExtra. That module
also contains extra Taiwan-based encodings.
BUGS
When you see "charset=gb2312" on mails and web pages, they really mean "euc-cn" encodings. To fix that, "gb2312" is aliased to "euc-cn".
Use "gb2312-raw" when you really mean it.
The ASCII region (0x00-0x7f) is preserved for all encodings, even though this conflicts with mappings by the Unicode Consortium.
SEE ALSO
Encode
perl v5.14.2 2010-12-30 Encode::CN(3perl)
Hi,
I need to read one chinese char using lex. I tried using "." ( period ) for pattern matching but in vain.
Could anyone suggest me how do i proceeed.
Sample pgm: to read a chinese char in single quotes.
%{
#include <locale.h>
%}
%%
\'.\' printf("SUCCESS\n");
. ... (0 Replies)
Hello,everyone.I"m from China and a newbie to unix.
I'm glad to know you and now I have installed the linux system.But you know,in China,most of people are using Windows , to some extent , some people who use linux are learning computer science or other something .
And so do I,I seldom use the... (2 Replies)
I have a burning desire to learn how to type in chinese (and after, probably with success, in another languages). How to do this, all symbols are needed, and methods of input also. And a map of chinese keyboar layout will be helpful.
I want very much study program. (3 Replies)
Hello,
I'm trying to figure out how to change the Character code of a text file in a script. Such as the ability to render an out put to Chinese Simple (GB2312) or other language character encodes. (1 Reply)
Trying to load a specific Traditional Chinese Character on an HP UX platform for a Taiwan database. Data is converting to ?. Database has existed since 2008 and this is the first issue I've seen where a user is unable to save the multi-byte character in the database. I'm thinking this is may be... (0 Replies)
At present we are using one application , in which they are loading some files.
the files are some times a mix of chinese and arabic.
Is there any way to encode these literals and do the loading.
Rgds
Rj
---------- Post updated at 04:54 AM ---------- Previous update was at 04:47 AM... (0 Replies)