Stopping Language Character Set Spam Post: 34324

8 More Discussions You Might Find Interesting

1. Solaris

latin 2 character-set with xterm

Hi, We have problems with the latin 2 Character-set with xterm. We have installed SunRay-Server with Solaris 8. Our Thinclients use hu- and cz-keyboards. I have set the right local-settings and xmodemaps. If I use the dtterm all is running fine. As soon as I use the xterm, it cannot display...

2. Programming

character set solaris

hi , i am trying to work on a script that transforms some special Dutch characters and send them to a Xerox printer .. the problem is that while doing so iam unable to identify th correct character set that is used by solaris , to transfer these characcters to Xerox character set . thanks...

3. UNIX for Advanced & Expert Users

iconv -l and ANSEL character set

I am forced to use the ANSEL character set for some GEDCOM documents but must convert them to a more modern set for another app which doesn't recognize ANSEL. I am unable to locate an ISO code for ANSEL in a search of the web. Would someone plese identify the ANSEL character set from the list given...

4. Shell Programming and Scripting

Unix character set problem

Hi All, We are getting file into our unix box with multibyte characters. When we tried to view the file the record looks like this Frédéric Actually the data sent to us is Fr�d�ric --> my locale charmap of unix is set to UTF8 only ... but still i am getting this problem. I...

5. Solaris

help me to change the character set

dears i am using solaris 10 i am facing a problem when i make setup for solaris i choose the country egypt and i select the language north america but i forget to do that the i found the date Jun written in arabic i want to change character set to written in english -rw-r--r-- 1 root ...

6. UNIX for Advanced & Expert Users

ASCII Character Set

I thought I would point this out. This has a lot of the non printing characters. ASCII Character Set

7. UNIX for Dummies Questions & Answers

Character set problem

Hi, I'm trying to edit a file with vi, but all special characters (�� etc) don't seem to show correctly. They don't seem to be supported by the OS (SunOS 5.10). I'm using MobaXterm as the terminal emulator, which is configured to use ISO-8859-1. The same charset is used on Solaris. If I open...

8. Shell Programming and Scripting

How to set character limit on READ?

Hello, I created the following (snippet from larger code): echo -n "A1: " read A1 VERIFY=$(echo -n $A1|wc -c) if ; then echo -e "TOO MANY CHARACTERS" fi echo -n "A2: " read A2 echo -n "A3: " read A3 echo -e "Concat: $B1/$B2/$B3" Basically what it does is it...

LEARN ABOUT SUNOS

iconv_zh_tw

iconv_zh_TW(5)							File Formats Manual						    iconv_zh_TW(5)

NAME

       iconv_zh_TW - code set conversion tables in traditional Chinese (zh_TW) locale

AVAILABILITY

       SUNWhleu

DESCRIPTION

       The following code set conversions are supported:

		  Code Set Conversions Supported
       Code	       Symbol		    TargetCode	    Symbol

       CNS 11643       zh_TW-euc	    Big-5	    zh_TW-big5
       CNS 11643       zh_TW-euc	    ISO 2022-7	    zh_TW-iso2022-7
       CNS 11643       zh_TW-euc	    UTF-8	    UTF-8
       CNS 11643       zh_TW-euc	    IS0 2022-CN-EXT zh_TW-iso2022-CN-EXT
       Big-5	       zh_TW-big5	    CNS 11643	    zh_TW-euc
       Big-5	       zh_TW-big5	    ISO 2022-7	    zh_TW-iso2022-7
       Big-5	       zh_TW-big5	    IS0 2022-CN-EXT zh_TW-is02022-CN-EXT
       Big-5	       zh_TW-big5	    UTF-8	    UTF-8
       ISO 2022-7      zh_TW-iso2022-7	    CNS 11643	    zh_TW-euc
       ISO 2022-7      zh_TW-iso2022-7	    Big-5	    zh_TW-big5
       IS0 2022-7      zh_TW-iso2022-7	    UTF-8	    UTF-8
       IS0 2022-CN-EXT zh_TW-iso2022-CN-EXT CNS 11643	    zh_TW-euc
       IS0 2022-CN-EXT zh_TW-iso2022-CN-EXT Big-5	    zh_TW-big5
       Code Page 937   zh_TW-cp937	    UTF-8	    UTF-8
       BIG5HK	       zh_HK-big5hk	    UTF-8	    UTF-8
       Big-5p	       zh_TW-big5p	    UTF-8	    UTF-8
       UTF-8	       UTF-8		    CNS 11643	    zh_TW-euc
       UTF-8	       UTF-8		    IS0 2022-7	    zh_TW-iso2022-7
       UTF-8	       UTF-8		    Big-5	    Big-5
       UTF-8	       UTF-8		    Code Page 937   zh_TW-cp937
       UTF-8	       UTF-8		    BIG5HK	    zh_HK-big5hk
       UTF-8	       UTF-8		    Big-5p	    zh_TW-big5p

       Conversions  are  performed  as described below. For all conversions, if the source code set includes characters not included in the target
       code set, conversion and output for all such characters will be done using a substitute characters.

zh_TW-euc to UTF-8 and UTF-8 to zh_TW-euc
       Conversion modules are provided to convert CNS 11643 plane 1, 2 and 3 characters between EUC-TW and UTF-8 encodings. If	input  data  which
       does  not  belong  to the above charset is encountered, it will be replaced with the substitute character (zh_TW-euc: '??' (0x3f3f), UTF-8:
       U+FFFD (0xefbfbd)).

zh_TW-euc to zh_TW-big5 and zh_TW-big5 to zh_TW-euc
       Conversion modules can be used to convert CNS 11643 plane 1 and 2 characters between EUC-TW and BIG5 encodings. If input  data  which  does
       not  belong  to	the  above charset is encountered, it will be replaced with the subsitute character (zh_TW-euc: '__' (0x5f5f), zh_TW-big5:
       '__' (0x5f5f)).

       Note that the seven additional popular characters from ETen extension have been supported, they belong to CNS 11643 plane 3.

zh_TW-euc to zh_TW-iso2022-7 and zh_TW-iso2022-7 to zh_TW-euc
       Conversion modules can be used to convert CNS 11643 characters between EUC-TW and ISO-2022-7 encodings.

zh_TW-euc to zh_TW-iso2022-CN-EXT and zh_TW-iso2022-CN-EXT to zh_TW-euc
       Conversion modules can be used to convert GB 2312-80, CNS 11643 plane 1, 2, 3, 4, 5, 6 and 7 characters between EUC-TW and  IS0-2022-CN-EXT
       encodings.

zh_TW-big5 to UTF-8 and UTF-8 to zh_TW-big5
       Conversion  modules  are  provided to convert Big-5 characters between BIG5 and UTF-8 encodings. If input data which does not belong to the
       above charset is encountered, it will be replaced with the substitute character (zh_TW-big5: '??' (0x3f3f), UTF-8: U+FFFD (0xefbfbd)).

       Note that the seven additional popular characters from ETen extension have been supported, they are 0xf9d6 -- 0xf9dc.

zh_TW-big5 to zh_TW-iso2022-7 and zh_TW-iso2022-7 to zh_TW-big5
       Conversion modules can be used to convert Big-5 characters between BIG5 and ISO-2022-7 encodings. If input Big-5 data which does  not  have
       corresponding CNS 11643 character is encountered, it will be replaced with the substitute character (zh_TW-iso2022-7: '__' (0x5f5f)).

zh_TW-big5 to zh_TW-iso2022-CN-EXT and zh_TW-iso2022-CN-EXT to zh_TW-big5
       Conversion  modules are provided to convert Big-5 characters between BIG5 and ISO-2022-CN-EXT encodings. If input Big-5 data which does not
       have corresponding CNS 11643 character is encountered, it will be replaced with the substitute character (zh_TW-iso2022-7: '__' (0x5f5f)).

zh_TW-big5p to UTF-8 and UTF-8 to zh_TW-big5p
       Conversion modules can be used to convert Big-5p characters between BIG5+ and UTF-8 encodings. If input data which doesn't  belong  to  the
       above charset is encountered, it will be replaced with the substitute character (zh_TW-big5p: '??' (0x3f3f), UTF-8: U+FFFD).

zh_HK-big5hk to UTF-8 and UTF-8 to zh_HK-big5hk
       Conversion modules can be used to convert HKSCS and Big-5 characters between BIG5HK and UTF-8 encoding. If input data which does not belong
       to the above charsets is encountered, it will be replaced with the substitute character (zh_HK-big5hk: '??' (0x3f3f), UTF-8: U+FFFD).

zh_TW-cp937 to UTF-8 and UTF-8 to zh_TW-cp937
       Conversion modules are provided to convert CNS 11643 characters between IBM Code Page 937 and UTF-8 encodings.

zh_TW-iso2022-7 to UTF-8 and UTF-8 to zh_TW-iso2022-7
       Conversion modules can be used to convert CNS 11643 characters between ISO-2022-7 and UTF-8 encodings.

SEE ALSO

       iconv(1), iconv(3), iconv(5), iconv_zh(5)

								    2 Nov 2001							    iconv_zh_TW(5)