GB2312(5) BSD File Formats Manual GB2312(5)NAME
gb2312 -- GB2312 encoding method for Chinese text
SYNOPSIS
ENCODING "GB2312"
DESCRIPTION
The GB2312 encoding implements GB 2312-1980, a PRC national standard for the encoding of simplified Chinese characters.
Multibyte characters in the GB2312 encoding can be one byte or two bytes long. GB 11383-1981 (ASCII) characters are represented by single
bytes in the range 0x00 to 0x7F. Simplified Chinese characters are represented by two bytes, both in the range 0xA1-0xFE.
SEE ALSO euc(5), gb18030(5), gbk(5)STANDARDS
The GB2312 encoding is believed to be compatible with GB 2312-1980. This standard has been superseded by GB 18030-2000, but is still in wide
use.
BSD November 7, 2003 BSD
Check Out this Related Man Page
Encode::CN(3pm) Perl Programmers Reference Guide Encode::CN(3pm)NAME
Encode::CN - China-based Chinese Encodings
SYNOPSIS
use Encode qw/encode decode/;
$euc_cn = encode("euc-cn", $utf8); # loads Encode::CN implicitly
$utf8 = decode("euc-cn", $euc_cn); # ditto
DESCRIPTION
This module implements China-based Chinese charset encodings. Encodings supported are as follows.
Canonical Alias Description
--------------------------------------------------------------------
euc-cn /euc.*cn$/i EUC (Extended Unix Character)
/cn.*euc$/i
/GB[-_ ]?2312(?:D.*$|$)/i (see below)
gb2312-raw The raw (low-bit) GB2312 character map
gb12345-raw Traditional chinese counterpart to
GB2312 (raw)
iso-ir-165 GB2312 + GB6345 + GB8565 + additions
MacChineseSimp GB2312 + Apple Additions
cp936 Code Page 936, also known as GBK
(Extended GuoBiao)
hz 7-bit escaped GB2312 encoding
--------------------------------------------------------------------
To find how to use this module in detail, see Encode.
NOTES
Due to size concerns, "GB 18030" (an extension to "GBK") is distributed separately on CPAN, under the name Encode::HanExtra. That module
also contains extra Taiwan-based encodings.
BUGS
When you see "charset=gb2312" on mails and web pages, they really mean "euc-cn" encodings. To fix that, "gb2312" is aliased to "euc-cn".
Use "gb2312-raw" when you really mean it.
The ASCII region (0x00-0x7f) is preserved for all encodings, even though this conflicts with mappings by the Unicode Consortium.
SEE ALSO
Encode
perl v5.16.2 2012-08-26 Encode::CN(3pm)
Hello,
I have installed Nagios 2.0b3 and running on Enterprise Linux ES4. I can view the Nagios home page and view the documentation but cannot get into any of the CGI pages. I am getting two different error messages. The first error message is that of Forbidden access. This is simply telling... (7 Replies)
Hi, new here and need some help. Sometimes my site is extremely slow, if when there aren't too many people on, whereas when there are over 300 online members the site may be very fast. We use CentOS, PHP 5.26. The server has 4GB and Plesk usually shows about 2 or 3 GB free.
I believe I can see... (4 Replies)
Hello,
I'm trying to figure out how to change the Character code of a text file in a script. Such as the ability to render an out put to Chinese Simple (GB2312) or other language character encodes. (1 Reply)
I had just added Japanese language as my secondary language with yast and I am still not able to view Japanese characters (they appear mangled)
I've copied two text files from windows to the SUSE Linux Enterprise Server 10 (i586), one with UTF and the other with ANSI. When viewed with cat and... (4 Replies)
Hello,
I am stuck... i dunno why does my test fail... any idea ?
#!/bin/bash
dos2unix info.txt
Distor=Distributeur
LINE=$(cat info.txt | sed -n 1p)
echo $LINE
echo $Distor
echo ""
echo "123-$LINE-123"
echo "123-$Distor-123"
if ; then
LINE2=$(cat info.txt | sed -n 2p)
echo $Distor... (14 Replies)
I am planning to run an automation , Could anyone try to help me to how to write an html in unix scripting so when I try to send email it should work especially with Bold and colors (6 Replies)
I have a file in my Unix ( SOLARIS ) with EBCDIC format...I want this file to read in ASCII OR unicode...Is it possible with UNIX to convert this file on ASCII OR UNICODE format from EBCDIC format?
I was searching through web and found only conversion table :(
Request Rejected
Below is... (16 Replies)
Hi all,
am running the following code on a RHEL 6.6 box to list which charsets are loaded and which are available:
#!/usr/bin/perl -w
use strict;
use Encode;
my @list = Encode->encodings();
my @all_encodings = Encode->encodings(":all");
print "@list\n\n";
print "@all_encodings\n";
... (3 Replies)
Hi,
I have been using ksh till now but was forced to use tcsh from last few days. I am getting a gvimrc error which says "set: Syntax error". What makes this pesky is that it is not showing which line number is the error from. How do I get the line number of the error?
Also, I am using the... (11 Replies)
I have some data like this:
4258092
TRXCODE a 19
CARDNBR a 10
PINFLAG a 6
FUISSUER a 12
PRODUCT a 24
STATE n 1
I want out put this format:
<?xml version="1.0" encoding="GB2312"?>
<convGrp>
<grpid>4258092</grpid>
<script>
<!]>
... (4 Replies)
We have a user who has about a 50% missed rate on spam detection. I'm wondering if his user prefs or something is preventing scanning of all messages?
SpamAssassin version 3.4.1, running on Perl version 5.20.3, sendmail Version 8.15.2
The contents of the user_prefs file:
# How many points... (2 Replies)
Hi Forum.
I have a unique problem that I'm hoping someone can assist me.
I'm generating a fixed width file and one of the output column (person_name at col. pos.#483 defined as string(36) sometimes contains french characters in the name and it causes the next column of data to shift to the... (10 Replies)