Linux and UNIX Man Pages

Linux & Unix Commands - Search Man Pages

iso-2022-jp(5) [osf1 man page]

iso2022jp(5)							File Formats Manual						      iso2022jp(5)

NAME
iso2022jp, iso-2022-jp, ISO-2022-JP - A character encoding system (codeset) for Japanese DESCRIPTION
The ISO-2022-JP codeset consists of the following character sets: ASCII For information on the ASCII character set, refer to ascii(5). JIS X0201-1976 Only the Roman letters in this character set are included. For details, refer to deckanji(5). JIS X0208-1978 JIS X0208-1983 JIS X0208-1983 is a revised version of JIS X0208-1978 and remapped some characters of JIS X0208-1978 to other positions. Before a character is used, its corresponding character set must be designated. In ISO-2022-JP, the designation of a character set is done by using an escape sequence as follows: --------------------------------------------------------------- Escape Sequence Character Set --------------------------------------------------------------- ESC ( B ACSII ESC ( J JIS X0201-1976 (left-hand part) ESC $ @ JIS X0208-1978 ESC $ ( 0 User-defined characters (This range of char- acters is proprietary to Compaq.) ESC $ B JIS X0208-1983 --------------------------------------------------------------- It is assumed that the starting code of a line is ASCII (including CR alone and LF alone, but not including the combination CRLF). If there are JIS X0208 characters on a line, there must be a switch to ASCII or to the left-hand part of JIS X0201 (Roman letters) before the end of the line (in other words, before the CRLF, or carriage return and line feed). For example, if a line starts with the ASCII character 9, followed by the JIS X0208-1978 character at row 16 column 1, the line is encoded as follows: 39h ESC $ @ 30h 21h .... ESC ( B .... CRLF If a line starts with the JIS X0208-1978 character at row 16 column 1, followed by the ASCII character 9, then the line is encoded as fol- lows: ESC $ @ 30h 21h ESC ( B 39h .... CRLF Once a character set is designated, there is no need to redesignate the character set if the adjacent character belongs to the same charac- ter set. For example, the following practice is not recommended: ESC $ B .... ESC $ B .... Currently, the operating system supports the ISO 2022-JP codeset only through codeset converters; there is no direct support through locales or fonts. For your options in printing and displaying Japanese characters, refer to i18n_printing(5) and Japanese(5). Codeset Conversion The following codeset converter pairs are available for converting Japanese characters between ISO-2022-JP and other encoding formats. In converter names, the string ISO-2022-JP indicates that user-defined characters are not included in the conversion jwhile the string ISO-2022-JPext indicates that user-defined characters are included in the convertsion. Refer to iconv_intro(5) for an introduction to code- set conversion. For more information about the other codeset for which ISO-2022-JP or ISO-2022-JPext is the input or output, see the refer- ence page specified in the list item. deckanji_ISO-2022-JP or deckanji_ISO-2022-JPext, ISO-2022-JP_deckanji or ISO-2022-JPext_deckanji Converting from and to the DEC Kanji codeset: deckanji(5). eucJP_ISO-2022-JP or eucJP_ISO-2022-JPext, ISO-2022-JP_eucJP or ISO-2022-JPext_eucJP Converting from and to Japanese Extended UNIX Code: eucJP(5). sdeckanji_ISO-2022-JP or sdeckanji_ISO-2022-JPext, ISO-2022-JP_sdeck- anji or ISO-2022-JPext_sdeckanji Converting from and to the Super DEC Kanji codeset: sdeckanji(5). SJIS_ISO-2022-JP or SJIS_ISO-2022-JPext, ISO-2022-JP_SJIS or ISO-2022-JPext_SJIS Converting from and to the Shift JIS codeset: SJIS(5). Note that SJIS encoding is equivalent to the Microsoft code-page format used on PC systems. Therefore, you can use these converters to convert Japanese characters between the ISO 2022-JP and PC code-page formats. See code_page(5) for information on how the operat- ing system supports PC code pages. SEE ALSO
Commands: locale(1) Others: ascii(5), code_page(5), deckanji(5), eucJP(5), i18n_intro(5), i18n_printing(5), iconv_intro(5), iso2022(5), Japanese(5), jiskanji(5), l10n_intro(5), sdeckanji(5), shiftjis(5) iso2022jp(5)
Man Page