Linux and UNIX Man Pages

Linux & Unix Commands - Search Man Pages

sdeckanji(5) [osf1 man page]

sdeckanji(5)							File Formats Manual						      sdeckanji(5)

NAME
sdeckanji - A character encoding system (codeset) for Japanese DESCRIPTION
The Super DEC Kanji codeset extends the DEC Kanji codeset to support the CS2 (JIS Katakana) and CS3 (JIS X0212) character sets that are also included in the Japanese EUC codeset. Super DEC Kanji is therefore a superset of both DEC Kanji and Japanese EUC and can handle data encoded in either DEC Kanji or Japanese EUC. The codeset was implemented to ease the transition from DEC Kanji, which is proprietary encod- ing, to Japanese EUC, which is encoding specified by an industry standard. In addition to supporting standard Japanese character sets, Super DEC Kanji provides three areas for defining User-Defined Characters (UDC). The UDC areas are as follows: ------------------------------------------------------------ Area Usage Row Range Number of Char- Code Range acters ------------------------------------------------------------ JIS X0208 85-94 940 F5A1-FEFE JIS X0212 78-94 1598 SS3 [EEA1-FEFE] UDC 1-94 8836 A121-FE7E ------------------------------------------------------------ The representation of ASCII/JIS Roman, JIS X0208 and User-Defined Characters (UDC) in Super DEC Kanji is the same as that in DEC Kanji. The representation of CS2 and CS3 in Super DEC Kanji is the same as that in Japanese EUC. Codeset Conversion The following codeset converter pairs are available for converting Japanese characters between sdeckanji and other encoding formats. Refer to iconv_intro(5) for an introduction to codeset conversion. For more information about the other codeset for which sdeckanji is the input or output, see the reference page specified in the list item. deckanji_sdeckanji, sdeckanji_deckanji Converting from and to the DEC Kanji codeset: deckanji(5). eucJP_sdeckanji, sdeckanji_eucJP Converting from and to Japanese Extended UNIX Code: eucJP(5). ISO-2022-JP_sdeckanji, sdeckanji_ISO-2022JP Converting from and to the ISO 2022-JP codeset: iso2022jp(5). ISO-2022-JPext_sdeckanji, sdeckanji_ISO-2022JPext Converting from and to the ISO 2022-JP Extended codeset: iso2022jp(5). JIS7_sdeckanji or jiskanji7_sdeckanji, sdeckanji_JIS7 or sdeckanji_jiskanji7 Converting from and to 7-bit JIS Kanji code: jiskanji(5). SJIS_sdeckanji, sdeckanji_SJIS Converting from and to the Shift JIS codeset: SJIS(5). Shift JIS encoding is identical to encoding used in the Microsoft Japanese code page for PC systems. Therefore, you can use these converters to convert Japanese characters between Super DEC Kanji and PC code-page format. For information on how the operating sys- tem supports PC code pages, see code_page(5). UCS-2_sdeckanji, sdeckanji_UCS-2 Converting from and to UCS-2 format: Unicode(5). UCS-4_sdeckanji, sdeckanji_UCS-4 Converting from and to UCS-4 format: Unicode(5). UTF-8_sdeckanji, sdeckanji_UTF-8 Converting from and to UTF--8 format: Unicode(5). Font Support for Super DEC Kanji For display devices, the operating system supports sdeckanji code by converting it to deckanji and using fonts available for deckanji. Refer to i18n_printing(5) and Japanese(5) for information about supporting print jobs that contain Japanese characters. SEE ALSO
Commands: locale(1) Others: ascii(5), code_page(5), deckanji(5), eucJP(5), i18n_intro(5), i18n_printing(5), iconv_intro(5), iso2022jp(5), Japanese(5), jiskanji(5), l10n_intro(5), shiftjis(5), Unicode(5) sdeckanji(5)

Check Out this Related Man Page

iso2022jp(5)							File Formats Manual						      iso2022jp(5)

NAME
iso2022jp, iso-2022-jp, ISO-2022-JP - A character encoding system (codeset) for Japanese DESCRIPTION
The ISO-2022-JP codeset consists of the following character sets: ASCII For information on the ASCII character set, refer to ascii(5). JIS X0201-1976 Only the Roman letters in this character set are included. For details, refer to deckanji(5). JIS X0208-1978 JIS X0208-1983 JIS X0208-1983 is a revised version of JIS X0208-1978 and remapped some characters of JIS X0208-1978 to other positions. Before a character is used, its corresponding character set must be designated. In ISO-2022-JP, the designation of a character set is done by using an escape sequence as follows: --------------------------------------------------------------- Escape Sequence Character Set --------------------------------------------------------------- ESC ( B ACSII ESC ( J JIS X0201-1976 (left-hand part) ESC $ @ JIS X0208-1978 ESC $ ( 0 User-defined characters (This range of char- acters is proprietary to Compaq.) ESC $ B JIS X0208-1983 --------------------------------------------------------------- It is assumed that the starting code of a line is ASCII (including CR alone and LF alone, but not including the combination CRLF). If there are JIS X0208 characters on a line, there must be a switch to ASCII or to the left-hand part of JIS X0201 (Roman letters) before the end of the line (in other words, before the CRLF, or carriage return and line feed). For example, if a line starts with the ASCII character 9, followed by the JIS X0208-1978 character at row 16 column 1, the line is encoded as follows: 39h ESC $ @ 30h 21h .... ESC ( B .... CRLF If a line starts with the JIS X0208-1978 character at row 16 column 1, followed by the ASCII character 9, then the line is encoded as fol- lows: ESC $ @ 30h 21h ESC ( B 39h .... CRLF Once a character set is designated, there is no need to redesignate the character set if the adjacent character belongs to the same charac- ter set. For example, the following practice is not recommended: ESC $ B .... ESC $ B .... Currently, the operating system supports the ISO 2022-JP codeset only through codeset converters; there is no direct support through locales or fonts. For your options in printing and displaying Japanese characters, refer to i18n_printing(5) and Japanese(5). Codeset Conversion The following codeset converter pairs are available for converting Japanese characters between ISO-2022-JP and other encoding formats. In converter names, the string ISO-2022-JP indicates that user-defined characters are not included in the conversion jwhile the string ISO-2022-JPext indicates that user-defined characters are included in the convertsion. Refer to iconv_intro(5) for an introduction to code- set conversion. For more information about the other codeset for which ISO-2022-JP or ISO-2022-JPext is the input or output, see the refer- ence page specified in the list item. deckanji_ISO-2022-JP or deckanji_ISO-2022-JPext, ISO-2022-JP_deckanji or ISO-2022-JPext_deckanji Converting from and to the DEC Kanji codeset: deckanji(5). eucJP_ISO-2022-JP or eucJP_ISO-2022-JPext, ISO-2022-JP_eucJP or ISO-2022-JPext_eucJP Converting from and to Japanese Extended UNIX Code: eucJP(5). sdeckanji_ISO-2022-JP or sdeckanji_ISO-2022-JPext, ISO-2022-JP_sdeck- anji or ISO-2022-JPext_sdeckanji Converting from and to the Super DEC Kanji codeset: sdeckanji(5). SJIS_ISO-2022-JP or SJIS_ISO-2022-JPext, ISO-2022-JP_SJIS or ISO-2022-JPext_SJIS Converting from and to the Shift JIS codeset: SJIS(5). Note that SJIS encoding is equivalent to the Microsoft code-page format used on PC systems. Therefore, you can use these converters to convert Japanese characters between the ISO 2022-JP and PC code-page formats. See code_page(5) for information on how the operat- ing system supports PC code pages. SEE ALSO
Commands: locale(1) Others: ascii(5), code_page(5), deckanji(5), eucJP(5), i18n_intro(5), i18n_printing(5), iconv_intro(5), iso2022(5), Japanese(5), jiskanji(5), l10n_intro(5), sdeckanji(5), shiftjis(5) iso2022jp(5)
Man Page