Linux and UNIX Man Pages

Linux & Unix Commands - Search Man Pages

gbk(5) [osf1 man page]

GBK(5)								File Formats Manual							    GBK(5)

NAME
GBK, gbk - A character encoding system (codeset) for Simplified Chinese DESCRIPTION
The GBK character set is an extension to the GB 2312-80 character set. (The "K" in "GBK" is the first sound in the Chinese word "Kuo Zhan," which means "extension.") GBK includes all the Hanzi characters specified by the ISO 10646-1:1993 standard (characters also known as the GB 13000.1.93 character set) that are not included in GB 2312-80. GBK is therefore defined as a normative annex of GB13000.1-93. GBK Value Ranges and Code Points The GBK codeset is divided into five levels, as follows: ------------------------------------------------------------ Level Encoding Range Code Points Characters ------------------------------------------------------------ GBK/1 0xA1A1-0xA9FE 846 717 GBK/2 0xB0A1-0xF7FE 6,768 6,763 GBK/3 0x8140-0xA0FE 6,080 6,080 GBK/4 0xAA40-0xFE40 8,160 8,160 GBK/5 0xA840-0xA9A0 192 166 ------------------------------------------------------------ In addition, GBK includes code points for user-defined characters, as follows: ----------------------------- Encoding Range Code Points ----------------------------- 0xAAA1-0xAFFE 564 0xF8A1-0xFEFE 658 0xA140-0xA7A0 672 ----------------------------- GBK therefore provides a total of 23,940 code points, 21,886 of which are assigned. Each row in the GBK code table consists of 190 characters. ASCII characters, which are single-byte characters, are defined in the range 0x21-0x7E. Encoding ranges for two-byte characters are as follows: Encoding range for the first byte: 0x81-0xFE Encoding ranges for the second byte: 0x40-0x7E and 0x80-0xFE Note In terms of character-to-code allocation, the sub-range for GB2321-80 characters (0xA1A1-0xFEFE) in GBK is the same encoding range defined for these characters in Extended UNIX Code (EUC). GBK is therefore backward compatible with Chinese EUC encoding as well as forward compat- ible with the encoding as defined by ISO 10646-1:1993. GBK is the standard character set and encoding used in the Simplified Chinese version of Windows 95. Codeset Converters for GBK The following codeset converters are available for GBK: GBK_UCS-2 GBK_UCS-4 GBK_UTF-8 UCS-2_GBK UCS-4_GBK UTF-8_GBK See iconv_intro(5) for more information about codeset converters and Unicode(5) for information about the UCS-2, UCS-4, and UTF-8 encoding formats. Fonts for GBK The operating system provides the following TrueType fonts for GBK: -huatian-fangsong-medium-r-normal--0-0-0-0-c-0-gb2312.1980-0 -huatian- fangsong-medium-r-normal--0-0-0-0-c-0-gb2312.1980-1 -huatian-fangsong-medium-r-normal--0-0-0-0-c-0-gbk-1 -huatian-fangsong-medium-r-nor- mal--0-0-0-0-m-0-iso8859-1 -huatian-heiti-medium-r-normal--0-0-0-0-c-0-gb2312.1980-0 -huatian-heiti-medium-r-normal--0-0-0-0-c-0-gb2312.1980-1 -huatian-heiti- medium-r-normal--0-0-0-0-c-0-gbk-1 -huatian-heiti-medium-r-normal--0-0-0-0-m-0-iso8859-1 -huatian-kaiti-medium-r-normal--0-0-0-0-c-0-gb2312.1980-0 -huatian-kaiti-medium-r-normal--0-0-0-0-c-0-gb2312.1980-1 -huatian-kaiti- medium-r-normal--0-0-0-0-c-0-gbk-1 -huatian-kaiti-medium-r-normal--0-0-0-0-m-0-iso8859-1 -huatian-songti-medium-r-normal--0-0-0-0-c-0-gb2312.1980-0 -huatian-songti-medium-r-normal--0-0-0-0-c-0-gb2312.1980-1 -huatian- songti-medium-r-normal--0-0-0-0-c-0-gbk-1 -huatian-songti-medium-r-normal--0-0-0-0-m-0-iso8859-1 These fonts can be used for printing only with Chinese text printers. The SongTi fonts are the default screen fonts for the GBK codeset. SEE ALSO
Commands: locale(1) Others: ascii(5), big5(5), Chinese(5), dechanyu(5), dechanzi(5), eucTW(5), i18n_intro(5), i18n_printing(5), l10n_intro(5), sbig5(5), tele- code(5) GBK(5)
Man Page