GB2312(5) BSD File Formats Manual GB2312(5)NAME
gb2312 -- GB2312 encoding method for Chinese text
SYNOPSIS
ENCODING "GB2312"
DESCRIPTION
The GB2312 encoding implements GB 2312-1980, a PRC national standard for the encoding of simplified Chinese characters.
Multibyte characters in the GB2312 encoding can be one byte or two bytes long. GB 11383-1981 (ASCII) characters are represented by single
bytes in the range 0x00 to 0x7F. Simplified Chinese characters are represented by two bytes, both in the range 0xA1-0xFE.
SEE ALSO euc(5), gb18030(5), gbk(5)STANDARDS
The GB2312 encoding is believed to be compatible with GB 2312-1980. This standard has been superseded by GB 18030-2000, but is still in wide
use.
BSD November 7, 2003 BSD
Check Out this Related Man Page
GB18030(5) BSD File Formats Manual GB18030(5)NAME
gb18030 -- GB 18030 encoding method for Chinese text
SYNOPSIS
ENCODING "GB18030"
DESCRIPTION
The GB18030 encoding implements GB 18030-2000, a PRC national standard for the encoding of Chinese characters. It is a superset of the older
GB 2312-1980 and GBK encodings, and incorporates Unicode's Unihan Extension A completely. It also provides code space for all Unicode 3.0
code points.
Multibyte characters in the GB18030 encoding can be one byte, two bytes, or four bytes long. There are a total of over 1.5 million code
positions.
GB 11383-1981 (ASCII) characters are represented by single bytes in the range 0x00 to 0x7F.
Chinese characters are represented as either two bytes or four bytes. Characters that are represented by two bytes begin with a byte in the
range 0x81-0xFE and end with a byte either in the range 0x40-0x7E or 0x80-0xFE.
Characters that are represented by four bytes begin with a byte in the range 0x81-0xFE, have a second byte in the range 0x30-0x39, a third
byte in the range 0x81-0xFE and a fourth byte in the range 0x30-0x39.
SEE ALSO euc(5), gb2312(5), gbk(5), utf8(5)
Chinese National Standard GB 18030-2000: Information Technology -- Chinese ideograms coded character set for information interchange --
Extension for the basic set, March 2000.
The Unicode Standard, Version 3.0, The Unicode Consortium, 2000.
STANDARDS
The GB18030 encoding is believed to be compatible with GB 18030-2000.
BSD August 10, 2003 BSD
Hello,
I have installed Nagios 2.0b3 and running on Enterprise Linux ES4. I can view the Nagios home page and view the documentation but cannot get into any of the CGI pages. I am getting two different error messages. The first error message is that of Forbidden access. This is simply telling... (7 Replies)
Hi, new here and need some help. Sometimes my site is extremely slow, if when there aren't too many people on, whereas when there are over 300 online members the site may be very fast. We use CentOS, PHP 5.26. The server has 4GB and Plesk usually shows about 2 or 3 GB free.
I believe I can see... (4 Replies)
Hello,
I'm trying to figure out how to change the Character code of a text file in a script. Such as the ability to render an out put to Chinese Simple (GB2312) or other language character encodes. (1 Reply)
I had just added Japanese language as my secondary language with yast and I am still not able to view Japanese characters (they appear mangled)
I've copied two text files from windows to the SUSE Linux Enterprise Server 10 (i586), one with UTF and the other with ANSI. When viewed with cat and... (4 Replies)
Hello,
I am stuck... i dunno why does my test fail... any idea ?
#!/bin/bash
dos2unix info.txt
Distor=Distributeur
LINE=$(cat info.txt | sed -n 1p)
echo $LINE
echo $Distor
echo ""
echo "123-$LINE-123"
echo "123-$Distor-123"
if ; then
LINE2=$(cat info.txt | sed -n 2p)
echo $Distor... (14 Replies)
I am planning to run an automation , Could anyone try to help me to how to write an html in unix scripting so when I try to send email it should work especially with Bold and colors (6 Replies)
I have a file in my Unix ( SOLARIS ) with EBCDIC format...I want this file to read in ASCII OR unicode...Is it possible with UNIX to convert this file on ASCII OR UNICODE format from EBCDIC format?
I was searching through web and found only conversion table :(
Request Rejected
Below is... (16 Replies)
Hi all,
am running the following code on a RHEL 6.6 box to list which charsets are loaded and which are available:
#!/usr/bin/perl -w
use strict;
use Encode;
my @list = Encode->encodings();
my @all_encodings = Encode->encodings(":all");
print "@list\n\n";
print "@all_encodings\n";
... (3 Replies)
Hi,
I have been using ksh till now but was forced to use tcsh from last few days. I am getting a gvimrc error which says "set: Syntax error". What makes this pesky is that it is not showing which line number is the error from. How do I get the line number of the error?
Also, I am using the... (11 Replies)
I have some data like this:
4258092
TRXCODE a 19
CARDNBR a 10
PINFLAG a 6
FUISSUER a 12
PRODUCT a 24
STATE n 1
I want out put this format:
<?xml version="1.0" encoding="GB2312"?>
<convGrp>
<grpid>4258092</grpid>
<script>
<!]>
... (4 Replies)
We have a user who has about a 50% missed rate on spam detection. I'm wondering if his user prefs or something is preventing scanning of all messages?
SpamAssassin version 3.4.1, running on Perl version 5.20.3, sendmail Version 8.15.2
The contents of the user_prefs file:
# How many points... (2 Replies)
Hi Forum.
I have a unique problem that I'm hoping someone can assist me.
I'm generating a fixed width file and one of the output column (person_name at col. pos.#483 defined as string(36) sometimes contains french characters in the name and it causes the next column of data to shift to the... (10 Replies)