Sponsored Content
Operating Systems AIX Locales and conversion tables confusion - Some characters don't get translated Post 302503376 by Adamm on Thursday 10th of March 2011 10:37:12 AM
Old 03-10-2011
Locales and conversion tables confusion - Some characters don't get translated

Hello,

my program running on AIX receives a message (from MQ) which is in CCSID 870 (LATIN-2 EBCDIC). It needs to convert it to CCSID 912 (ISO-8859-2). For this conversion, it uses the Websphere MQ functionality, but which, according to the manual, uses AIX's system conversion tables. (Btw - it only works like this on AIX. On Linux or Windows, Websphere MQ uses its own coversion logic, ignoring system conversion tables - and it works there.)

The problem is, that some characters like š,č,ť,ž get translated to chr(26).

My checklist was:
1. The LANG environment variable is set to sk_SK
2. Conversion tables seem to be there
(root):/usr/lib/nls/loc/iconvTable# ls -l
...
-r--r--r-- 1 bin bin 288 Aug 12 2010 IBM-1250_IBM-5346
-r--r--r-- 1 bin bin 288 Aug 12 2010 IBM-1282_IBM-5346
-r--r--r-- 1 bin bin 288 Aug 12 2010 IBM-1284_IBM-5346
-r--r--r-- 1 bin bin 288 Aug 12 2010 IBM-1285_IBM-5346
-r--r--r-- 1 bin bin 288 Aug 12 2010 IBM-5346_IBM-1250
-r--r--r-- 1 bin bin 288 Aug 12 2010 IBM-5346_IBM-1282
-r--r--r-- 1 bin bin 288 Aug 12 2010 IBM-5346_IBM-1284
-r--r--r-- 1 bin bin 288 Aug 12 2010 IBM-5346_IBM-1285
-r--r--r-- 1 bin bin 288 Aug 12 2010 IBM-5346_IBM-852
-r--r--r-- 1 bin bin 288 Aug 12 2010 IBM-5346_IBM-870
-r--r--r-- 1 bin bin 288 Aug 12 2010 IBM-5346_ISO8859-2
-r--r--r-- 1 bin bin 288 Aug 12 2010 IBM-852_IBM-5346
-r--r--r-- 1 bin bin 288 Aug 12 2010 IBM-870_IBM-5346
-r--r--r-- 1 bin bin 288 Aug 12 2010 ISO8859-2_IBM-5346

Can you please help me find out what am I forgeting? Thanx.
 

8 More Discussions You Might Find Interesting

1. UNIX for Dummies Questions & Answers

Locales - Solaris 8

Can anyone tell me the difference between the 2 Locales: North American Partial Locales en_US.ISO8859-1 en_US.ISO8859-15 Does it matter which one to use? What are the differences? Thanks. -I have found the below info but I don't know what it means(Also no RFC# for 8859-15): ... (2 Replies)
Discussion started by: finster
2 Replies

2. AIX

Locales in AIX

Currently we have the following locales installed on two of our servers SERVER-1 $ locale -a C POSIX en_US en_US.8859-15 en_US.ISO8859-1 EN_US EN_US.UTF-8 ZH_CN ZH_CN.UTF-8 HI_IN HI_IN.UTF-8 ko_KR ko_KR.IBM-eucKR zh_TW zh_TW.IBM-eucTW SERVER-2 (0 Replies)
Discussion started by: voorkey
0 Replies

3. Shell Programming and Scripting

Converting tables of row data into columns of tables

I am trying to transpose tables listed in the format into format. Any help would be greatly appreciated. Input: test_data_1 1 2 90% 4 3 91% 5 4 90% 6 5 90% 9 6 90% test_data_2 3 5 92% 5 4 92% 7 3 93% 9 2 92% 1 1 92% ... Output:... (7 Replies)
Discussion started by: justthisguy
7 Replies

4. AIX

SMTP and characters conversion

Hello, I'm working with an AIX 5.3 This server receive emails from an application (ServiceDesk management) and these emails are read and integrated in another application via an EAI. I work for a french company with french people that write in french : with accentued characters (é,ç,à,...). Sadly,... (2 Replies)
Discussion started by: Djizus
2 Replies

5. Shell Programming and Scripting

Setting locales

I'm having a problem with my locale setting. Essentially, when using Xmgrace or Awk, all . are replaced with , leading to wrong arithmetic. Here is an explanation of the problem: awk printing thousands with commas. see LC_ALL=en_US.UTF-8 How can I permanently set the locale... (1 Reply)
Discussion started by: chrisjorg
1 Replies

6. IP Networking

Does Translated IP Matter For Proxy Server (SQUID)

I am using Squid to create a proxy server for framework related to an application. My users sit on a private network in 191.xx.xx.xx space and my proxy sits in a different private network in 188.xx.xx.xx space. There will be a NAT in place to allow bidirectional communications but is there an... (5 Replies)
Discussion started by: BobSpero
5 Replies

7. Linux

File conversion and removing special characters from a file in Linux

I have a .CSV file when I check for the special characters in the file using the command cat -vet filename.csv, i get very lengthy lines with "^@", "^I^@" and "^@^M" characters in between each alphabet in all of the records. Using the code below file filename.csv I get the output as I have a... (2 Replies)
Discussion started by: dhruuv369
2 Replies

8. UNIX for Beginners Questions & Answers

Replacing string/special characters using a 'conversion' table

Hi, Does anyone know if there is a script or program available out there that uses a conversion table to replace special characters from a file? I am trying to remove some special characters from a file but there are several unprintable/control characters that some I need to remove but some I... (2 Replies)
Discussion started by: newbie_01
2 Replies
iconv_ja(5)						Standards, Environments, and Macros					       iconv_ja(5)

NAME
iconv_ja - code set conversions in ja locale DESCRIPTION
The following code set conversions are supported: +-----------------------------------------------------------+ | Code Set|Conversions Supported | | Source Code | Target Code | |eucJP | PCK | |eucJP | ISO-2022-JP | |eucJP | ISO-2022-JP.RFC1468 | |eucJP | JIS7 | |eucJP | SJIS | |eucJP | UTF-8 | |eucJP | UTF-8-Java | |eucJP | jis | |eucJP | ibmj | |eucJP | ibmj-EBCDIK | |SJIS | eucJP | |SJIS | ISO-2022-JP | |SJIS | UTF-8 | |SJIS | jis | |SJIS | ibmj | |PCK | eucJP | |PCK | UTF-8 | |PCK | UTF-8-Java | |PCK | ISO-2022-JP | |PCK | ISO-2022-JP.RFC1468 | |PCK | jis | |PCK | ibmj | |PCK | ibmj-EBCDIK | |ISO-2022-JP | eucJP | |ISO-2022-JP | PCK | |ISO-2022-JP | SJIS | |ISO-2022-JP | UTF-8 | |UTF-8 | eucJP | |UTF-8 | SJIS | |UTF-8 | PCK | |UTF-8 | ISO-2022-JP | |UTF-8 | ISO-2022-JP.RFC1468 | |UTF-8-Java | eucJP | |UTF-8-Java | PCK | |JIS7 | eucJP | |jis | eucJP | |jis | PCK | |jis | SJIS | |ibmj | eucJP | |ibmj | PCK | |ibmj | SJIS | |ibmj-EBCDIK | eucJP | |ibmj-EBCDIK | PCK | +----------------------+------------------------------------+ +-------------------------------------------------------------+ | Code Set Conversions Supported | | Source Code | Target Code | |eucJP | ibm930 | |eucJP | ibm931 | |eucJP | ibm939 | |eucJP | ibm5026 | |eucJP | ibm5035 | |eucJP | FujitsuJEF-ascii-code | |eucJP | FujitsuJEF-kana-code | |eucJP | FujitsuJEF-ascii-face | |eucJP | FujitsuJEF-kana-face | |eucJP | HitachiKEIS83 | |eucJP | HitachiKEIS90 | |eucJP | NECJIPS | |PCK | ibm930 | |PCK | ibm931 | |PCK | ibm939 | |PCK | ibm5026 | |PCK | ibm5035 | |PCK | FujitsuJEF-ascii-code | |PCK | FujitsuJEF-kana-code | |PCK | FujitsuJEF-ascii-face | |PCK | FujitsuJEF-kana-face | |PCK | HitachiKEIS83 | |PCK | HitachiKEIS90 | |PCK | NECJIPS | |UTF-8 | ibm930 | |UTF-8 | ibm931 | |UTF-8 | ibm939 | |UTF-8 | ibm5026 | |UTF-8 | ibm5035 | |UTF-8 | ms932 | |UTF-8 | UTF-8-ms932 | |UTF-8 | FujitsuJEF-ascii-code | |UTF-8 | FujitsuJEF-kana-code | |UTF-8 | FujitsuJEF-ascii-face | |UTF-8 | FujitsuJEF-kana-face | |UTF-8 | HitachiKEIS83 | |UTF-8 | HitachiKEIS90 | |UTF-8 | NECJIPS | |UTF-8-ms932 | UTF-8 | |ibm930 | eucJP | |ibm930 | PCK | |ibm930 | UTF-8 | |ibm931 | eucJP | |ibm931 | PCK | |ibm931 | UTF-8 | |ibm939 | eucJP | |ibm939 | PCK | |ibm939 | UTF-8 | |ibm5026 | eucJP | |ibm5026 | PCK | |ibm5026 | UTF-8 | |ibm5035 | eucJP | |ibm5035 | PCK | |ibm5035 | UTF-8 | |FujitsuJEF-ascii-code | eucJP | |FujitsuJEF-ascii-code | PCK | |FujitsuJEF-ascii-code | UTF-8 | |FujitsuJEF-kana-code | eucJP | |FujitsuJEF-kana-code | PCK | |FujitsuJEF-kana-code | UTF-8 | |FujitsuJEF-ascii-face | eucJP | |FujitsuJEF-ascii-face | PCK | |FujitsuJEF-ascii-face | UTF-8 | +----------------------+--------------------------------------+ +------------------------------------------------------------+ | Code Set Conversions Supported | | Source Code | Target Code | |FujitsuJEF-kana-face | eucJP | |FujitsuJEF-kana-face | PCK | |FujitsuJEF-kana-face | UTF-8 | |HitachiKEIS83 | eucJP | |HitachiKEIS83 | PCK | |HitachiKEIS83 | UTF-8 | |HitachiKEIS90 | eucJP | |HitachiKEIS90 | PCK | |HitachiKEIS90 | UTF-8 | |NECJIPS | eucJP | |NECJIPS | PCK | |NECJIPS | UTF-8 | |ms932 | UTF-8 | +---------------------+--------------------------------------+ The descriptions of each code sets in the above table are as follows: +-----------------------------------------------------------+ | Description of Supported Code Sets | | Codeset | Description | |eucJP |Japanese EUC | |PCK |PC kanji | |SJIS |the same as PC kanji (eol in future) | |ISO-2022-JP |Coded representation of the charac- | | |ter sets ISO 646 IRV or JIS X 0201, | | |JIS X 0208, and JIS X 0212 according | | |to UI/OSF Application Platform Pro- | | |file for Japanese Environment Ver- | | |sion 1.1 item 7.1 using the designa- | | |tion sequence to G0 specified by ISO | | |2022 | +---------------------+-------------------------------------+ +-----------------------------------------------------------+ | Description of Supported Code Sets | | Codeset | Description | |JIS7 |same as ISO-2022-JP | |ISO-2022-JP.RFC1468 |Coded representation of the charac- | | |ter sets ISO 646 IRV or JIS X | | |0201-1976 (except for figure charac- | | |ter set for katakana), and JIS X | | |0208-1983 according to RFC1468 | | |(Request for Comments: 1468 Japanese | | |Character Encoding for Internet Mes- | | |sages) using the designation | | |sequence to G0 specified by ISO 2022 | |jis |JIS 7bit code used in JLE, JFP 2.4 | | |and the preceding releases | |ibmj |IBM Kanji code | |ibmj-EBCDIK |Maps single-byte code set (SBCS) of | | |IBM host code to the character set | | |that is called the EBCDIK code set | | |in general. The character code set | | |includes the IBM code page 290 and | | |threee more characters '`' | | |(0x79),'{' (0xc0), and '}' (0xd0). | | |Japanese katakana characters are | | |included, but lowercase alphabet | | |letters are not. In case of double- | | |byte code set (DBCS), the descrip- | | |tion is the same as the code set | | |"ibmj." | |UTF-8 |UNI CODE | |UTF-8-Java |UNI CODE implemented in Java | +---------------------+-------------------------------------+ +-------------------------------------------------------------+ | Description of Supported Code Sets | | Codeset | Description | |ibm930 | IBM CCSID 930: SBSC code page 290 | | | (extended), character set 1172, DBCS | | | code page 300, character set 1001 | | | 4370 user defined characters | |ibm931 | IBM CCSID 931: SBSC code page 37, | | | character set 101, DBCS code page | | | 300, character set 1001 | | | 4370 user defined characters | |ibm939 | IBM CCSID 930: SBSC code page 1027, | | | character set 1172, DBCS code page | | | 300, character set 1001 4370 user | | | defined characters | |ibm5026 | IBM CCSID 5026: same as ibm930, | | | except this code set supports 1880 | | | user defined characters | |ibm5035 | IBM CCSID 5035: same as ibm939, | | | except this code set supports 1880 | | | user defined characters | |FujitsuJEF-ascii-code | Fujitsu JEF code which uses | | | EBCDIC(ASCII) as single byte charac- | | | ter set. Charatcters differently | | | handled between JIS C 6226:1978 and | | | JIS X 0208:1983 are mapped based on | | | code value. | |FujitsuJEF-ascii-face | Fujitsu JEF code which uses | | | EBCDIC(ASCII) as single byte charac- | | | ter set. Charatcters differently | | | handled between JIS C 6226:1978 and | | | JIS X 0208:1983 are mapped based on | | | character face. | |FujitsuJEF-kana-code | Fujitsu JEF code which uses | | | EBCDIC(Kana) as single byte charac- | | | ter set. Caratcters differently | | | handled between JIS C 6226:1978 and | | | JIS X 0208:1983 are mapped based on | | | code value. | |FujitsuJEF-kana-face | Fujitsu JEF code which uses | | | EBCDIC(Kana) as single byte charac- | | | ter set. Charatcters differently | | | handled between JIS C 6226:1978 and | | | JIS X 0208:1983 are mapped based on | | | character face. | +----------------------+--------------------------------------+ +-----------------------------------------------------------+ | Description of Supported Code Sets | | Codeset | Description | |HitachiKEIS8 |Hitachi KEIS83 | |HitachiKEIS90 |Hitachi KEIS83 | |NECJIPS |NEC JIPS code | |ms932 |Shift JIS codeset which is supported | | |by Windows NT 3.51. Conversion | | |betwenn this codeset and UTF-8 is | | |done in the same way Windows NT 3.51 | | |does. | |UTF-8-ms932 |UTF-8 encoded Unicode which was con- | | |verted from ms932 | +---------------------+-------------------------------------+ Conversions are performed as described below. For all conversions, if the source code set includes characters not included in the target code set, conversion and output for all such characters will be done using a substitute character. eucJP to PCK (SJIS) and PCK (SJIS) to eucJP Conversion between eucJP and PCK (SJIS) can be used to convert JIS X 0201, JIS X 0208, JIS X 0212, and user-defined and vendor-defined characters based on TOG Japanese Vendors Council (TOG/JVC) Recommended Code Set Conversion Specification between Japanese EUC and Shift-JIS. If input data which does not belong to the source code set is encountered, iconv(3C) will return EILSEQ for errno. iconv(1) stops at the last point of successful conversion. eucJP to ISO-2022-JP(JIS7) and ISO-2022-JP(JIS7) to eucJP Conversion between eucJP and ISO-2022-JP(JIS7) can be used to convert JIS X 0201, JIS X 0208 and JIS X 0212. If input data which does not belong to the source code set is encountered, iconv(3C) will return EILSEQ for errno. iconv(1) stops at the last point of success- ful conversion. eucJP to ISO-2022-JP.RFC1468 Conversion from eucJP to ISO-2022-JP.RFC1468 can be used to convert JIS X 0201 (except for figure character set for katakana) and JIS X 0208. If JIS X 0201 (figure character set for katakana), JIS X 0212, a user-defined, or a vendor-defined character is encountered among input data, it will be replaced with the substitute character ` ? ' (0x3f). If input data which does not belong to these code sets is encountered, iconv(3C) will return EILSEQ for errno. iconv(1) stops at the last point of successful conversion. eucJP to jis and jis to eucJP Conversion between eucJP and jis is provided for the compatibility with ujtojis7() and jis7touj() libraries, and euctojis and jis- toeuc utilities. It is extended to handle JIS X 0212. See jistoeuc(1). eucJP to UTF-8 and UTF-8 to eucJP Conversion between eucJP and UTF-8 can be used to convert JIS X 0201, JIS X 0208, JIS X 0212, a user-defined, and a vendor-defined character. If input data which does not have the corresponding character in the target code set is encountered, it will be replaced with the substitute character (eucJP: ` ? ' (0x3f), UTF-8: U+FFFD(0xefbfbd)). If input data which does not belong to these code sets is encountered, iconv(3C) will return EILSEQ for errno. iconv(1) stops at the last point of successful conversion. eucJP to UTF-8-Java and UTF-8-Java to eucJP Conversion between eucJP and UTF-8-Java can be used to convert JIS X 0201, JIS X 0208, and JIS X 0212. If a user-defined or vendor- defined character is encountered among input data, it will be replaced with the substitute character (eucJP: ` ? ' (0x3f), UTF-8: U+FFFD(0xefbfbd)). If input data which does not belong to these code sets is encountered, iconv(3C) will return EILSEQ for errno. iconv(1) stops at the last point of successful conversion. eucJP to ibmj and ibmj to eucJP Conversion between eucJP and ibmj is based on the IBM Kanji codebook (4th edition - September 1987), JIS X 0201, and JIS X 0208-1983. If you convert eucJP to ibmj, JISX 0201 and JIS X 0201 are all converted to substitute character. eucJP to ibmj-EBCDIK and ibmj-EBCDIK to eucJP Conversion between eucJP and ibmj-EBCDIK is based on the IBM Kanji codebook (4th edition - September 1987), JIS X 0201, and JIS X 0208-1983. If you convert eucJP to ibmj-EBCDIK, JISX 0201 and JIS X 0201 that have not correspondence characters with ibmj-EBCDIKare all converted to substitute character. PCK (SJIS) to ISO-2022-JP and ISO-2022-JP to PCK (SJIS) Conversion between PCK (SJIS) and ISO-2022-JP can be used to convert JIS X 0201, JIS X 0208, JIS X 0212, and user-defined and vendor- defined characters based on TOG Japanese Vendors Council (TOG/JVC) Recommended Code Set Conversion Specification between Japanese EUC and Shift-JIS. If input data which does not belong to the source code set is encountered, iconv(3C) will return EILSEQ for errno. iconv(1) stops at the last point of successful conversion. PCK (SJIS) to ISO-2022-JP.RFC1468 Conversion from PCK (SJIS) to ISO-2022-JP.RFC1468 can be used to convert JIS X 0201 (except for figure character set for katakana) and JIS X 0208. If JIS X 0201 (figure character set for katakana), a user-defined, or a vendor-defined character is encountered among input data, it will be replaced with the substitute character ` ? ' (0x3f). If input data which does not belong to these code sets is encoun- tered, iconv(3C) will return EILSEQ for errno. iconv(1) stops at the last point of successful conversion. PCK (SJIS) to UTF-8 and UTF-8 to PCK (SJIS) Conversion between PCK (SJIS) and UTF-8 can be used to convert JIS X 0201, JIS X 0208, a user-defined, and a vendor-defined character. If input data which does not have the corresponding character in the target code set is encountered, it will be replaced with the sub- stitute character (PCK: ` ? ' (0x3f), UTF-8: U+FFFD(0xefbfbd)). If input data which does not belong to these code sets is encountered, iconv(3C) will return EILSEQ for errno. iconv(1) stops at the last point of successful conversion. PCK (SJIS) to UTF-8-Java and UTF-8-Java to PCK (SJIS) Conversion between PCK (SJIS) and UTF-8-Java can be used to convert JIS X 0201 and JIS X 0208. If a user-defined or vendor-defined character is encountered among input data, it will be replaced with the substitute character (PCK: ` ? ' (0x3f), UTF-8: U+FFFD (0xef- bfbd)). If input data which does not belong to these code sets is encountered, iconv(3C) will return EILSEQ for errno. iconv(1) stops at the last point of successful conversion. PCK (SJIS) to jis and jis to PCK (SJIS) Conversion between PCK (SJIS) and jis is provided for the compatibility with sjtojis7() and jis7tosj() libraries , and sjtojis jistosj utilities. It is extended besed on TOG Japanese Vendors Council (TOG/JVC) Recommended Code Set Conversion Specification between Japa- nese EUC and Shift-JIS. See jistosj(1). PCK (SJIS) to ibmj and ibmj to PCK (SJIS) Conversion between PCK (SJIS) and ibmj is based on the IBM Kanji codebook (4th edition - September 1987), JIS X 0201, and JIS X 0208-1983. If you convert PCK (SJIS) to ibmj, all characters converted to JIS X 0212 by kana characters (0xa1 to 0xdf) and TOG Japanese Vendors Council (TOG/JVC) Recommended Code Set Conversion Specification between Japanese EUC and Shift-JIS are all converted to substi- tute character. PCK to ibmj-EBCDIK and ibmj-EBCDIK to PCK Conversion between PCK and ibmj-EBCDIK is based on the IBM Kanji codebook (4th edition - September 1987), JIS X 0201, and JIS X 0208-1983. If you convert PCK to ibmj-EBCDIK, all characters converted to JIS X 0212 by JIS X 0212 and TOG Japanese Vendors Council (TOG/JVC) Recommended Code Set Conversion Specification between Japanese EUC and Shift-JIS are all converted to substitute character. ISO-2022-JP to UTF-8 and UTF-8 to ISO-2022-JP Conversion between ISO-2022-JP and UTF-8 can be used to convert JIS X 0201, JIS X 0208, JIS X 0212, and user-defined and vendor-defined characters. If input data which does not have the corresponding character in the target code set is encountered, it will be replaced with the substitute character (ISO-2022-JP: ` ? ' (0x3f), UTF-8: U+FFFD(0xefbfbd)). If input data which does not belong to these code sets is encountered, iconv(3C) will return EILSEQ for errno. iconv(1) stops at the last point of successful conversion. UTF-8 to ISO-2022-JP.RFC1468 Conversion from UTF-8 to ISO-2022-JP.RFC1468 can be used to convert JIS X 0201 (except for figure character set for katakana) and JIS X 0208. If JIS X 0201 (figure character set for katakana), JIS X 0212, a user-defined, or a vendor-defined character is encountered among input data, it will be replaced with the substitute character ` ? ' (0x3f). If input data which does not belong to these code sets is encountered, iconv(3C) will return EILSEQ for errno. iconv(1) stops at the last point of successful conversion. eucJP, PCK, UTF-8 to ibm930, ibm931, ibm939, ibm5026, ibm5035 Conversion from eucJP, PCK, or UTF-8 to ibm930,ibm931,ibm939, ibm5026,ibm5035 can be used to convert JIS X 0201, JIS X 0208, JIS X 0212, IBM extension characters, and user defined character. Input data which does not have corresponding character in the target code set is replaced with the substitute character. Since ibm931 does not support Kana characters in its single byte code set (SBCS), JIS X 0201 Kana characters are replaced with substitute characters in conversion to ibm931. ibm930, ibm931, ibm939, ibm5026, or ibm5035 to eucJP, PCK, or UTF-8 Conversion from ibm930, ibm931, ibm939, ibm5026, or ibm5035 to eucJP, PCK, or UTF-8 can be used to convert SBCS/DBCS characters defined in input code set. Input data which does not have corresponding character in the target code set is replaced with the substitute char- acter. FujitsuJEF-ascii-code or FujitsuJEF-kana-code to eucJP, PCK, or UTF-8: eucJP, PCK, or UTF-8 to FujitsuJEF-ascii-code or FujitsuJEF-kana- code Conversion from FujitsuJEF-ascii-code or FujitsuJEF-kana-code to eucJP, PCK, or UTF-8 and eucJP, PCK, or UTF-8 to FujitsuJEF-ascii-code or FujitsuJEF-kana-code can be used to convert JIS X 0201, JIS X 0208, and JIS X 0212 characters defined in input code set. Input data which does not have corresponding character in the target code set is replaced with the substitude character. Characters handled dif- ferently between JIS C 6226:1978 and JIS X 0208:1983 are converted based on code value. In conversion to FujitsuJEF-kana-code, JIS X 0201 Katakana characters are replaced with the substitute character. FujitsuJEF-ascii-face or FujitsuJEF-kana-face to eucJP, PCK, or UTF-8: eucJP, PCK, UTF-8 to FujitsuJEF-ascii-face or FujitsuJEF-kana-face Characters handled differently between JIS C 6226:1978 and JIS X 0208:1983 are converted based on character face. Other behavior is same as that of FujitsuJEF-ascii-code or FujitsuJEF-kana-code. HitachiKEIS83 to eucJP, PCK, or UTF-8: eucJP, PCK, UTF-8 to HitachiKEIS83 Conversion from HitachiKEIS83 to eucJP, PCK, or UTF-8 and from eucJP, PCK, UTF-8 to HitachiKEIS83 can be used to convert JIS X 0201, JIS X 0208, and JIS X 0212 characters. Input data which does not have corresponding character in the target code set is replaced with the substitude character. HitachiKEIS90 to eucJP, PCK, or UTF-8: eucJP, PCK, or UTF-8 to HitachiKEIS90 Behavior of conversion from HitachiKEIS90 to eucJP, PCK, or UTF-8 and from eucJP, PCK, or UTF-8 to HitachiKEIS90 is same as that of KEIS83 except that this conversion uses KEIS90 instead of KEIS83. NECJIPS to eucJP, PCK, or UTF-8: eucJP, PCK, or UTF-8 to NECJIPS Conversion from NECJIPS to eucJP, PCK, or UTF-8 and eucJP, PCK, or UTF-8 to NECJIPS can be used to convert JIS X 0201, JIS X 0208, and JIS X 0212 characters. Input data which does not have corresponding character in the target code set is replaced with the substitude character. EBCDIC Katakana (EK) code is used for single byte character set. ms932 to UTF-8 and UTF-8 to ms932 Conversion between ms932 and UTF-8 is done using same way of mapping characters between the two codesets as Windows NT 3.51 does. UTF-8 to UTF-8-ms932 and UTF-8-ms932 to UTF-8 This converts between "UTF-8" and "UTF-8-ms932", which are UTF-8 encoded Unicode converted from PCK, and that converted from ms932. SEE ALSO
iconv(1), jistoeuc(1), jistosj(1), iconv(3C), iconv(5), iconv_unicode(5) SunOS 5.10 10 Jan 2003 iconv_ja(5)
All times are GMT -4. The time now is 02:51 AM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy