Linux and UNIX Man Pages

Linux & Unix Commands - Search Man Pages

euckr(5) [osf1 man page]

eucKR(5)							File Formats Manual							  eucKR(5)

NAME
eucKR - A character encoding system (codeset) for Korean DESCRIPTION
EUC (Extended UNIX Code) is a codeset extended by AT&T Bell Laboratories for use in various countries in the world. Korean EUC is the EUC codeset for representing Korean data. The encoding of Korean EUC (eucKR) is currently identical to that of the DEC Korean codeset (see deckorean(5)). Codeset Conversion The following codeset converter pairs are available for converting Korean characters between eucKR and other encoding formats. Refer to iconv_intro(5) for an introduction to codeset conversion. For more information about the other codeset for which eucKR is the input or out- put, see the reference page specified in the list item. deckorean_eucKR, eucKR_deckorean Converting from and to the DEC Korean codeset: deckorean(5). Because DEC Korean is currently identical to Korean Extended UNIX Code, this converter is not useful. ISO-2022-KR_eucKR, eucKR_ISO-2022-KR Converting from and to the ISO 2022 Korean codeset: iso2022(5). Font Support for Korean EUC For both display devices and printers, the operating system supports the Korean EUC codeset through DEC Korean fonts. See deckorean(5) for information about Korean bitmap and PostScript fonts. SEE ALSO
Commands: locale(1) Others: ascii(5), deckorean(5), iconv_intro(5), i18n_intro(5), i18n_printing(5), l10n_intro(5), Korean(5) eucKR(5)

Check Out this Related Man Page

deckorean(5)							File Formats Manual						      deckorean(5)

NAME
deckorean - A character encoding system (codeset) for Korean DESCRIPTION
The DEC Korean (deckorean) codeset consists of the following character sets: ASCII KSC 5601-1987 For the symbols and ideographic characters defined in the KSC 5601-1987 character set, DEC Korean uses 2-byte data representation. For ASCII characters, DEC Korean uses single-byte 7-bit data representation; that is, the most significant bit (MSB) of the byte that repre- sents an ASCII character value is always set off. For more information on the ASCII character set, refer to ascii(5). KSC 5601-1987 Characters KSC 5601-1987 is a national standard that defines a primary set of graphic characters for Korean information interchange. The standard defines a character set with a total of 8224 characters that are arranged in a code table. The code table has 94 rows, numbered from 1 to 94. Each row has 94 columns, also numbered from 1 to 94. Different kinds of characters occupy different areas of the code table as follows: Special characters: 986 graphic symbols that reside in rows 1 to 12 Hangul characters: 2350 Korean (Hangul) characters that reside in rows 16 to 40 Hanja characters: 4888 Chinese characters that reside in rows 42 to 93 DEC Korean Encoding Values To differentiate KSC 5601-1987 codes from ASCII codes, the most significant bit (MSB) of both the first and the second byte of a KSC 5601 character value is always set on. The value of a KSC 5601 character can be determined from its row and column number as follows: 1st byte = A0 + Row number 2nd byte = A0 + Column number For example, if a character is positioned at the first column of the 36th row, its value is CA41, which is calulated as follows: 1st byte = A0(hex) + 36 = C4 (hex) 2nd byte = A0(hex) + 01 = A1 (hex) Codeset Conversion The following codeset converter pairs are available for converting Korean characters between deckorean and other encoding formats. Refer to iconv_intro(5) for an introduction to codeset conversion. For more information about the other codeset for which deckorean is the input or output, see the reference page specified in the list item. eucKR_deckorean, deckorean_eucKR Converting from and to Korean Extended UNIX Code: eucKR(5). UCS-2_deckorean, deckorean_UCS-2 Converting from and to UCS-2 format: Unicode(5). UCS-4_deckorean, deckorean_UCS-4 Converting from and to UCS-4 format: Unicode(5). UTF-8_deckorean, deckorean_UTF-8 Converting from and to UTF-8 format: Unicode(5). There are also codeset converters that convert between the Microsoft Korean code-page format (cp949) used on PC systems and UCS-2, UCS-4, and UTF-8 formats. Note that if the UCS-2, UCS-4, or UTF-8 output from these converters is then converted to DEC Korean, some Hangul char- acters may be lost. For more information, see code_page(5). DEC Korean Fonts The operating system provides Korean fonts for both screen display and printers. The following bitmap fonts are available in various sizes and typefaces for 75dpi and 100dpi display devices: Fonts in Gotic Family: -adecw-gotic-medium-r-normal--16-160-75-75-m-160-ksc5601.1987-1 -adecw-gotic-medium-r-normal--24-240-75-75-m-240-ksc5601.1987-1 -adecw-gotic-medium-r-normal--16-160-100-100-m-160-ksc5601.1987-1 -adecw-gotic-medium-r-normal--24-240-100-100-m-240-ksc5601.1987-1 Fonts in Myungcho Family: -adecw-myungcho-medium-r-normal--16-160-75-75-m-160-ksc5601.1987-1 -adecw-myungcho-medium-r-nor- mal--32-320-75-75-m-320-ksc5601.1987-1 -adecw-myungcho-medium-r-normal--24-240-75-75-m-240-ksc5601.1987-1 -adecw-myungcho-medium-r- normal--16-160-100-100-m-160-ksc5601.1987-1 -adecw-myungcho-medium-r-normal--24-240-100-100-m-240-ksc5601.1987-1 -adecw-myungcho- medium-r-normal--32-320-100-100-m-320-ksc5601.1987-1 Fonts in Screen Family: -adecw-screen-medium-r-normal--18-180-75-75-m-160-ksc5601.1987-1 -adecw-screen-medium-r-normal--24-240-75-75-m-240-ksc5601.1987-1 -adecw-screen-medium-r-normal--18-180-100-100-m-160-ksc5601.1987-1 -adecw-screen-medium-r-nor- mal--24-240-100-100-m-240-ksc5601.1987-1 -adecw-screen-medium-r-normal--18-180-100-100-m-160-ksc5601.1987-1 -adecw-screen-medium-r- normal--24-240-100-100-m-240-ksc5601.1987-1 For PostScript printers, the operating system provides only Munjo fonts. For general information on printing non-English text, refer to i18n_printing(5). SEE ALSO
Commands: locale(1) Others: ascii(5), code_page(5), i18n_intro(5), i18n_printing(5), iconv_intro(5), l10n_intro(5), eucKR(5), Korean(5), Unicode(5) deckorean(5)
Man Page