euctoibmj, ibmjtoeuc - Code conversion between Japanese EUC and IBM-Japanese SYNOPSIS
euctoibmj [-t] [-u code] [-U] [filename...] ibmjtoeuc [-u code] [-U] [filename...] AVAILABILITY
euctoibmj converts the contents of the specified filenames from ASCII/ Japanese EUC to EBCDIC/IBM-Japanese. ibmjtoeuc converts the con- tents of the specified filenames from EBCDIC/IBM-Japanese to ASCII/ Japanese EUC. The both commands write the resultant code to stdout. If filename is not given, input characters are read from the standard input. For Japanese language handling, the euctoibmj/ibmjtoeucj pair of commands provide conversion only between the two code standards. Code con- version among Japanese EUC, JIS, and PC kanji are supported by another set of commands, jistoeuc(1) family or iconv(1). OPTIONS
-u code With this option specified, characters in one code set that do not have corresponding characters in the other are mapped to the code given in four-digit hexadecimal HOST CODE of IBM Japanese (for euctoibmj) or in four-digit JIS Ku-Ten code (for ibmjtoeuc). Without this option, such characters are mapped to HOST CODE 4040 (for euctoibmj) or JIS Ku-Ten code 0101 (for ibmjtoeuc). -U The output is not buffered (The default is buffered output). -t With this option specified, euctoibmj translates Half-Size Katakana (Code Set 2) in Japanese EUC to the corresponding characters in Code Set 1 prior to conversion. Without this option, Code Set 2 characters in Japanese EUC are processed to the illegal charac- ter. ENVIRONMENT VARIABLES
The environment variables LC_CTYPE and LANG control the character classification throughout these commands. For euctoibmj and ibmjtoeuc to work correctly, one or both of the environment variables must be set to ja or an equivalent locale. On entry to these commands, these envi- ronment variables are checked in the following order: LC_CTYPE and LANG. When a valid value is found, remaining environment variables for character classification are ignored. FILES
/usr/lib/jcodetables/ibmj-euc Code conversion table for IBM Japanese. SEE ALSO
iconv(1), jistoeuc(1), iconv_ja(5) DIAGNOSTICS
unexpected data encountered in input. Illegal character code is found in input file. BUGS
The ASCII/EBCDIC conversion table are taken from the 256 character standard in the CACM Nov, 1968. The conversion, while less blessed as a standard, corresponds better to certain IBM print train convertions. There is no universal solution. The Japanese EUC/IBM Japanese conversion table is based on the IBM Kanji codebook (4th edition - September 1987), JIS X 0201, and JIS X 0208-1983. If JIS X 0212 caracter set is specified as input, euctoibmj can not support the conversion correctly. SunOS 5.10 10 Jan 2003 euctoibmj(1)

eucJP(5)						Standards, Environments, and Macros						  eucJP(5)

eucJP, eucjp - map between Japanese EUC and character set DESCRIPTION
In SunOS and JFP, Japanese EUC (Extended UNIX code) is used as character code system expressing multi-byte languages including Japanese character in the "ja" locale. This manual page shows map between Japanese EUC and character set. The following is a map table for Japanese EUC and character set (SS2 stands for 0x8e, and SS3 stands for 0x8f ). Table 1 Japanese EUC character set 0x00 - 0x19 JIS X 0201-1976 function character set 0x20 JIS X 0201-1976 space char- acter 0x21 - 0x7e JIS X 0201-1976 figure char- acter set for roman charac- ter 0x7f JIS X 0201-1976 erase char- acter 0x80 - 0x9f ISO 6429 C1 control charac- ter (except for 0x8e, 0x8f) SS2 0xa1 - 0xdf JIS X 0201-1976 figure char- acter set for katakana (except for an area of unde- fined character E/0 - F/14) SS3 0xa1a1 - 0xa1fe JIS X 0212-1990 (1 ku 1 ten - 1 ku 94 ten) SS3 0xa2a1 - 0xa2fe JIS X 0212-1990 (2 ku 1 ten - 2 ku 94 ten) : : : : SS3 0xf3a1 - 0xf3fe JIS X 0212-1990 (83 ku 1 ten - 83 ku 94 ten) SS3 0xf4a1 - 0xf4fe JIS X 0212-1990 (84 ku 1 ten - 84 ku 94 ten) 0xa1a1 - 0xa1fe JIS X 0208-1990 (1 ku 1 ten - 1 ku 94 ten) 0xa2a1 - 0xa2fe JIS X 0208-1990 (2 ku 1 ten - 2 ku 94 ten) : : : : 0xf3a1 - 0xf3fe JIS X 0208-1990 (83 ku 1 ten - 83 ku 94 ten) 0xf4a1 - 0xf4fe JIS X 0208-1990 (84 ku 1 ten - 84 ku 94 ten) Each character set corresponds with Japanese EUC code set numbers as follows. Table 2 character set (coding charac- Japanese EUC extended code set no. ter set) JIS X 0201 figure character 0 set for roman character JIS X 0208-1990 1 JIS X 0201 figure character 2 set for katakana JIS X 0212-1990 3 NOTES
In Japanese EUC area below has special meaning. Table 3 Japanese EUC area of character set meaning SS3 0xf3a1 - 0xf4fe JIS X 0212-1990 83 ku Vender Defined Character - 84 ku (IBM extension character not included in JIS X 0212-1990) SS3 0xf5a1 - 0xfefe User Defined Character 11 ku - 20 ku 0xada1 - 0xadfe JIS X 0208-1990 13 ku Vender Defined Character (special symbols) 0xf5a1 - 0xfefe User Defined Character 1 ku - 10 ku SEE ALSO
PCK(5) SunOS 5.10 12 June 2000 eucJP(5)
