eucJP(5) Standards, Environments, and Macros eucJP(5)NAME
eucJP, eucjp - map between Japanese EUC and character set
DESCRIPTION
In SunOS and JFP, Japanese EUC (Extended UNIX code) is used as character code system expressing multi-byte languages including Japanese
character in the "ja" locale. This manual page shows map between Japanese EUC and character set.
The following is a map table for Japanese EUC and character set (SS2 stands for 0x8e, and SS3 stands for 0x8f ).
Table 1
Japanese EUC character set
0x00 - 0x19 JIS X 0201-1976 function
character set
0x20 JIS X 0201-1976 space char-
acter
0x21 - 0x7e JIS X 0201-1976 figure char-
acter set for roman charac-
ter
0x7f JIS X 0201-1976 erase char-
acter
0x80 - 0x9f ISO 6429 C1 control charac-
ter (except for 0x8e, 0x8f)
SS2 0xa1 - 0xdf JIS X 0201-1976 figure char-
acter set for katakana
(except for an area of unde-
fined character E/0 - F/14)
SS3 0xa1a1 - 0xa1fe JIS X 0212-1990 (1 ku 1 ten
- 1 ku 94 ten)
SS3 0xa2a1 - 0xa2fe JIS X 0212-1990 (2 ku 1 ten
- 2 ku 94 ten)
: :
: :
SS3 0xf3a1 - 0xf3fe JIS X 0212-1990 (83 ku 1 ten
- 83 ku 94 ten)
SS3 0xf4a1 - 0xf4fe JIS X 0212-1990 (84 ku 1 ten
- 84 ku 94 ten)
0xa1a1 - 0xa1fe JIS X 0208-1990 (1 ku 1 ten
- 1 ku 94 ten)
0xa2a1 - 0xa2fe JIS X 0208-1990 (2 ku 1 ten
- 2 ku 94 ten)
: :
: :
0xf3a1 - 0xf3fe JIS X 0208-1990 (83 ku 1 ten
- 83 ku 94 ten)
0xf4a1 - 0xf4fe JIS X 0208-1990 (84 ku 1 ten
- 84 ku 94 ten)
Each character set corresponds with Japanese EUC code set numbers as follows.
Table 2
character set (coding charac- Japanese EUC extended code set no.
ter set)
JIS X 0201 figure character 0
set for roman character
JIS X 0208-1990 1
JIS X 0201 figure character 2
set for katakana
JIS X 0212-1990 3
NOTES
In Japanese EUC area below has special meaning.
Table 3
Japanese EUC area of character set meaning
SS3 0xf3a1 - 0xf4fe JIS X 0212-1990 83 ku Vender Defined Character
- 84 ku
(IBM extension character
not included in JIS X
0212-1990)
SS3 0xf5a1 - 0xfefe User Defined Character
11 ku - 20 ku
0xada1 - 0xadfe JIS X 0208-1990 13 ku Vender Defined Character
(special symbols)
0xf5a1 - 0xfefe User Defined Character
1 ku - 10 ku
SEE ALSO PCK(5)SunOS 5.10 12 June 2000 eucJP(5)
Check Out this Related Man Page
jistoeuc(1) User Commands jistoeuc(1)NAME
jistoeuc, jistosj, euctojis, euctosj, sjtojis, sjtoeuc - Code conversion between JIS, PC kanji, and Japanese EUC
SYNOPSIS
jistoeuc [-8] [-U] [filename...]
jistosj [-8] [-U] [filename...]
euctojis [-8] [-U] [filename...]
euctosj [-U] [filename...]
sjtojis [-8] [-U] [filename...]
sjtoeuc [-U] [filename...]
AVAILABILITY
SUNWjfpu
DESCRIPTION
For Japanese language handling, the jistoeuc family provides conversion between different code standards. command [ filename ...] does the
specified conversion on the contents of the input filenames and writes it to stdout.
If filename is not given, it reads and converts characters from the standard input.
jistoeuc converts JIS to Japanese EUC
jistosj converts JIS to PC kanji
euctojis converts Japanese EUC to JIS
euctosj converts Japanese EUC to PC kanji
sjtojis converts PC kanji to JIS
sjtoeuc converts PC kanji to Japanese EUC
OPTIONS -8 With this option specified, the commands jistoeuc, jistosj, sjtojis, and sjtoeuc, can support JIS X 0201 (Half-Size Katakana).
This 8-bit JIS code does not use ISO Shift-In and Shift-Out escape sequences.
-U The output is not buffered (The default is buffered output).
SEE ALSO iconv(1), iconv_ja(5)NOTES
This command can handle shift-in escape sequences for the following character sets:
JIS X 0208 shift-in escape - E$B, E$(B, E$@
JIS X 0212 shift-in escape - E$(D
JIS X 0201 Roman shift-in escape - E(J, E(H
ASCII shift-in escape - E(B
euctojis and sjtojis can handle shift-in escape sequences for the following character sets:
JIS X 0208 shift-in - E$B
JIS X 0212 shift-in - E$(D (except when sjtojis command is specified)
JIS X 0201 Roman shift-in - E(J
jistoeuc does not check whether or not each code in the input file is correct. Conversion with PC kanji is not based on TOG Japanese Ven-
dors Council (TOG/JVC) Recommended Code Set Conversion Specification between Japanese EUC and Shift-JIS. The iconv(1) utility provides
these functions. See iconv(1) and iconv_ja(5) for more information.
BUGS
If JIS X 0212 character set is specified as input, jistosj and euctosj can not support the conversion correctly. euctosj, sjtoeuc, jis-
tosj, and sjtojis can support conversion correctly only if JIS X 0208 1 ku - 84 ku is specified as input.
SunOS 5.10 10 Jan 2003 jistoeuc(1)