Sponsored Content
Top Forums UNIX for Beginners Questions & Answers Answers to Frequently Asked Questions Email Antispam Techniques and Email Filtering Stopping Language Character Set Spam Post 34324 by Neo on Sunday 16th of February 2003 09:07:39 PM
Old 02-16-2003
Latest Set.....

Lastest set of these,, working great....

Code:
:0
* charset.*ks_c_5601|euc-kr|3Deuc-kr|euc-kr|big5|gb2312|koi8|iso-ir-111
charset_spam

:0
* charset.*iso-8859-[2-8]|euc-jp|iso-2022|windows-125
charset_spam


:0
* charset.*shift_jis|x-johab|x-unified-hangul|3Dgb2312
charset_spam


:0
* charset.*cn-gb|cn-big5|utf-8|x-euc-tw|iso_2022_cn
chinese_charset_spam

:0
* ^Subject:*ks_c_5601-1987|euc-kr|3Deuc-kr|euc-kr|big5|gb2312|utf-8
charset_spam

 

8 More Discussions You Might Find Interesting

1. Solaris

latin 2 character-set with xterm

Hi, We have problems with the latin 2 Character-set with xterm. We have installed SunRay-Server with Solaris 8. Our Thinclients use hu- and cz-keyboards. I have set the right local-settings and xmodemaps. If I use the dtterm all is running fine. As soon as I use the xterm, it cannot display... (0 Replies)
Discussion started by: paho
0 Replies

2. Programming

character set solaris

hi , i am trying to work on a script that transforms some special Dutch characters and send them to a Xerox printer .. the problem is that while doing so iam unable to identify th correct character set that is used by solaris , to transfer these characcters to Xerox character set . thanks... (2 Replies)
Discussion started by: ppass
2 Replies

3. UNIX for Advanced & Expert Users

iconv -l and ANSEL character set

I am forced to use the ANSEL character set for some GEDCOM documents but must convert them to a more modern set for another app which doesn't recognize ANSEL. I am unable to locate an ISO code for ANSEL in a search of the web. Would someone plese identify the ANSEL character set from the list given... (4 Replies)
Discussion started by: Whiterock
4 Replies

4. Shell Programming and Scripting

Unix character set problem

Hi All, We are getting file into our unix box with multibyte characters. When we tried to view the file the record looks like this Frédéric Actually the data sent to us is Frédéric --> my locale charmap of unix is set to UTF8 only ... but still i am getting this problem. I... (6 Replies)
Discussion started by: sandeeppvk
6 Replies

5. Solaris

help me to change the character set

dears i am using solaris 10 i am facing a problem when i make setup for solaris i choose the country egypt and i select the language north america but i forget to do that the i found the date Jun written in arabic i want to change character set to written in english -rw-r--r-- 1 root ... (4 Replies)
Discussion started by: hosney00ux
4 Replies

6. UNIX for Advanced & Expert Users

ASCII Character Set

I thought I would point this out. This has a lot of the non printing characters. ASCII Character Set (7 Replies)
Discussion started by: cokedude
7 Replies

7. UNIX for Dummies Questions & Answers

Character set problem

Hi, I'm trying to edit a file with vi, but all special characters (áéíóú etc) don't seem to show correctly. They don't seem to be supported by the OS (SunOS 5.10). I'm using MobaXterm as the terminal emulator, which is configured to use ISO-8859-1. The same charset is used on Solaris. If I open... (4 Replies)
Discussion started by: Subbeh
4 Replies

8. Shell Programming and Scripting

How to set character limit on READ?

Hello, I created the following (snippet from larger code): echo -n "A1: " read A1 VERIFY=$(echo -n $A1|wc -c) if ; then echo -e "TOO MANY CHARACTERS" fi echo -n "A2: " read A2 echo -n "A3: " read A3 echo -e "Concat: $B1/$B2/$B3" Basically what it does is it... (4 Replies)
Discussion started by: jl487
4 Replies
iconv_zh_TW(5)							File Formats Manual						    iconv_zh_TW(5)

NAME
iconv_zh_TW - code set conversion tables in traditional Chinese (zh_TW) locale AVAILABILITY
SUNWhleu DESCRIPTION
The following code set conversions are supported: Code Set Conversions Supported Code Symbol TargetCode Symbol CNS 11643 zh_TW-euc Big-5 zh_TW-big5 CNS 11643 zh_TW-euc ISO 2022-7 zh_TW-iso2022-7 CNS 11643 zh_TW-euc UTF-8 UTF-8 CNS 11643 zh_TW-euc IS0 2022-CN-EXT zh_TW-iso2022-CN-EXT Big-5 zh_TW-big5 CNS 11643 zh_TW-euc Big-5 zh_TW-big5 ISO 2022-7 zh_TW-iso2022-7 Big-5 zh_TW-big5 IS0 2022-CN-EXT zh_TW-is02022-CN-EXT Big-5 zh_TW-big5 UTF-8 UTF-8 ISO 2022-7 zh_TW-iso2022-7 CNS 11643 zh_TW-euc ISO 2022-7 zh_TW-iso2022-7 Big-5 zh_TW-big5 IS0 2022-7 zh_TW-iso2022-7 UTF-8 UTF-8 IS0 2022-CN-EXT zh_TW-iso2022-CN-EXT CNS 11643 zh_TW-euc IS0 2022-CN-EXT zh_TW-iso2022-CN-EXT Big-5 zh_TW-big5 Code Page 937 zh_TW-cp937 UTF-8 UTF-8 BIG5HK zh_HK-big5hk UTF-8 UTF-8 Big-5p zh_TW-big5p UTF-8 UTF-8 UTF-8 UTF-8 CNS 11643 zh_TW-euc UTF-8 UTF-8 IS0 2022-7 zh_TW-iso2022-7 UTF-8 UTF-8 Big-5 Big-5 UTF-8 UTF-8 Code Page 937 zh_TW-cp937 UTF-8 UTF-8 BIG5HK zh_HK-big5hk UTF-8 UTF-8 Big-5p zh_TW-big5p Conversions are performed as described below. For all conversions, if the source code set includes characters not included in the target code set, conversion and output for all such characters will be done using a substitute characters. zh_TW-euc to UTF-8 and UTF-8 to zh_TW-euc Conversion modules are provided to convert CNS 11643 plane 1, 2 and 3 characters between EUC-TW and UTF-8 encodings. If input data which does not belong to the above charset is encountered, it will be replaced with the substitute character (zh_TW-euc: '??' (0x3f3f), UTF-8: U+FFFD (0xefbfbd)). zh_TW-euc to zh_TW-big5 and zh_TW-big5 to zh_TW-euc Conversion modules can be used to convert CNS 11643 plane 1 and 2 characters between EUC-TW and BIG5 encodings. If input data which does not belong to the above charset is encountered, it will be replaced with the subsitute character (zh_TW-euc: '__' (0x5f5f), zh_TW-big5: '__' (0x5f5f)). Note that the seven additional popular characters from ETen extension have been supported, they belong to CNS 11643 plane 3. zh_TW-euc to zh_TW-iso2022-7 and zh_TW-iso2022-7 to zh_TW-euc Conversion modules can be used to convert CNS 11643 characters between EUC-TW and ISO-2022-7 encodings. zh_TW-euc to zh_TW-iso2022-CN-EXT and zh_TW-iso2022-CN-EXT to zh_TW-euc Conversion modules can be used to convert GB 2312-80, CNS 11643 plane 1, 2, 3, 4, 5, 6 and 7 characters between EUC-TW and IS0-2022-CN-EXT encodings. zh_TW-big5 to UTF-8 and UTF-8 to zh_TW-big5 Conversion modules are provided to convert Big-5 characters between BIG5 and UTF-8 encodings. If input data which does not belong to the above charset is encountered, it will be replaced with the substitute character (zh_TW-big5: '??' (0x3f3f), UTF-8: U+FFFD (0xefbfbd)). Note that the seven additional popular characters from ETen extension have been supported, they are 0xf9d6 -- 0xf9dc. zh_TW-big5 to zh_TW-iso2022-7 and zh_TW-iso2022-7 to zh_TW-big5 Conversion modules can be used to convert Big-5 characters between BIG5 and ISO-2022-7 encodings. If input Big-5 data which does not have corresponding CNS 11643 character is encountered, it will be replaced with the substitute character (zh_TW-iso2022-7: '__' (0x5f5f)). zh_TW-big5 to zh_TW-iso2022-CN-EXT and zh_TW-iso2022-CN-EXT to zh_TW-big5 Conversion modules are provided to convert Big-5 characters between BIG5 and ISO-2022-CN-EXT encodings. If input Big-5 data which does not have corresponding CNS 11643 character is encountered, it will be replaced with the substitute character (zh_TW-iso2022-7: '__' (0x5f5f)). zh_TW-big5p to UTF-8 and UTF-8 to zh_TW-big5p Conversion modules can be used to convert Big-5p characters between BIG5+ and UTF-8 encodings. If input data which doesn't belong to the above charset is encountered, it will be replaced with the substitute character (zh_TW-big5p: '??' (0x3f3f), UTF-8: U+FFFD). zh_HK-big5hk to UTF-8 and UTF-8 to zh_HK-big5hk Conversion modules can be used to convert HKSCS and Big-5 characters between BIG5HK and UTF-8 encoding. If input data which does not belong to the above charsets is encountered, it will be replaced with the substitute character (zh_HK-big5hk: '??' (0x3f3f), UTF-8: U+FFFD). zh_TW-cp937 to UTF-8 and UTF-8 to zh_TW-cp937 Conversion modules are provided to convert CNS 11643 characters between IBM Code Page 937 and UTF-8 encodings. zh_TW-iso2022-7 to UTF-8 and UTF-8 to zh_TW-iso2022-7 Conversion modules can be used to convert CNS 11643 characters between ISO-2022-7 and UTF-8 encodings. SEE ALSO
iconv(1), iconv(3), iconv(5), iconv_zh(5) 2 Nov 2001 iconv_zh_TW(5)
All times are GMT -4. The time now is 02:57 PM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy