Linux and UNIX Man Pages

Linux & Unix Commands - Search Man Pages

sjtojis(1) [sunos man page]

jistoeuc(1)							   User Commands						       jistoeuc(1)

NAME
jistoeuc, jistosj, euctojis, euctosj, sjtojis, sjtoeuc - Code conversion between JIS, PC kanji, and Japanese EUC SYNOPSIS
jistoeuc [-8] [-U] [filename...] jistosj [-8] [-U] [filename...] euctojis [-8] [-U] [filename...] euctosj [-U] [filename...] sjtojis [-8] [-U] [filename...] sjtoeuc [-U] [filename...] AVAILABILITY
SUNWjfpu DESCRIPTION
For Japanese language handling, the jistoeuc family provides conversion between different code standards. command [ filename ...] does the specified conversion on the contents of the input filenames and writes it to stdout. If filename is not given, it reads and converts characters from the standard input. jistoeuc converts JIS to Japanese EUC jistosj converts JIS to PC kanji euctojis converts Japanese EUC to JIS euctosj converts Japanese EUC to PC kanji sjtojis converts PC kanji to JIS sjtoeuc converts PC kanji to Japanese EUC OPTIONS
-8 With this option specified, the commands jistoeuc, jistosj, sjtojis, and sjtoeuc, can support JIS X 0201 (Half-Size Katakana). This 8-bit JIS code does not use ISO Shift-In and Shift-Out escape sequences. -U The output is not buffered (The default is buffered output). SEE ALSO
iconv(1), iconv_ja(5) NOTES
This command can handle shift-in escape sequences for the following character sets: JIS X 0208 shift-in escape - E$B, E$(B, E$@ JIS X 0212 shift-in escape - E$(D JIS X 0201 Roman shift-in escape - E(J, E(H ASCII shift-in escape - E(B euctojis and sjtojis can handle shift-in escape sequences for the following character sets: JIS X 0208 shift-in - E$B JIS X 0212 shift-in - E$(D (except when sjtojis command is specified) JIS X 0201 Roman shift-in - E(J jistoeuc does not check whether or not each code in the input file is correct. Conversion with PC kanji is not based on TOG Japanese Ven- dors Council (TOG/JVC) Recommended Code Set Conversion Specification between Japanese EUC and Shift-JIS. The iconv(1) utility provides these functions. See iconv(1) and iconv_ja(5) for more information. BUGS
If JIS X 0212 character set is specified as input, jistosj and euctosj can not support the conversion correctly. euctosj, sjtoeuc, jis- tosj, and sjtojis can support conversion correctly only if JIS X 0208 1 ku - 84 ku is specified as input. SunOS 5.10 10 Jan 2003 jistoeuc(1)

Check Out this Related Man Page

euctoibmj(1)							   User Commands						      euctoibmj(1)

NAME
euctoibmj, ibmjtoeuc - Code conversion between Japanese EUC and IBM-Japanese SYNOPSIS
euctoibmj [-t] [-u code] [-U] [filename...] ibmjtoeuc [-u code] [-U] [filename...] AVAILABILITY
SUNWjfpu DESCRIPTION
euctoibmj converts the contents of the specified filenames from ASCII/ Japanese EUC to EBCDIC/IBM-Japanese. ibmjtoeuc converts the con- tents of the specified filenames from EBCDIC/IBM-Japanese to ASCII/ Japanese EUC. The both commands write the resultant code to stdout. If filename is not given, input characters are read from the standard input. For Japanese language handling, the euctoibmj/ibmjtoeucj pair of commands provide conversion only between the two code standards. Code con- version among Japanese EUC, JIS, and PC kanji are supported by another set of commands, jistoeuc(1) family or iconv(1). OPTIONS
-u code With this option specified, characters in one code set that do not have corresponding characters in the other are mapped to the code given in four-digit hexadecimal HOST CODE of IBM Japanese (for euctoibmj) or in four-digit JIS Ku-Ten code (for ibmjtoeuc). Without this option, such characters are mapped to HOST CODE 4040 (for euctoibmj) or JIS Ku-Ten code 0101 (for ibmjtoeuc). -U The output is not buffered (The default is buffered output). -t With this option specified, euctoibmj translates Half-Size Katakana (Code Set 2) in Japanese EUC to the corresponding characters in Code Set 1 prior to conversion. Without this option, Code Set 2 characters in Japanese EUC are processed to the illegal charac- ter. ENVIRONMENT VARIABLES
The environment variables LC_CTYPE and LANG control the character classification throughout these commands. For euctoibmj and ibmjtoeuc to work correctly, one or both of the environment variables must be set to ja or an equivalent locale. On entry to these commands, these envi- ronment variables are checked in the following order: LC_CTYPE and LANG. When a valid value is found, remaining environment variables for character classification are ignored. FILES
/usr/lib/jcodetables/ibmj-euc Code conversion table for IBM Japanese. SEE ALSO
iconv(1), jistoeuc(1), iconv_ja(5) DIAGNOSTICS
unexpected data encountered in input. Illegal character code is found in input file. BUGS
The ASCII/EBCDIC conversion table are taken from the 256 character standard in the CACM Nov, 1968. The conversion, while less blessed as a standard, corresponds better to certain IBM print train convertions. There is no universal solution. The Japanese EUC/IBM Japanese conversion table is based on the IBM Kanji codebook (4th edition - September 1987), JIS X 0201, and JIS X 0208-1983. If JIS X 0212 caracter set is specified as input, euctoibmj can not support the conversion correctly. SunOS 5.10 10 Jan 2003 euctoibmj(1)
Man Page