Linux and UNIX Man Pages

Linux & Unix Commands - Search Man Pages

tacchtype(centos) [centos man page]

thai/wtt.h(3)							      libthai							     thai/wtt.h(3)

thai/wtt.h - WTT I/O implementation. SYNOPSIS
Enumerations enum WTTClass { CTRL = 0, NON = 1, CONS = 2, LV = 3, FV1 = 4, FV2 = 5, FV3 = 6, BV1 = 7, BV2 = 8, BD = 9, TONE = 10, AD1 = 11, AD2 = 12, AD3 = 13, AV1 = 14, AV2 = 15, AV3 = 16 } Classification of characters in TIS620 according to WTT. enum WTTOp { CP = 1, XC = 2, AC = 3, RJ = 4, SR = 5 } Composibility checking tables. Functions WTTClass TACchtype (thchar_t c) WTT character class. WTTOp TACio_op (thchar_t c1, thchar_t c2) WTT I/O operation. Detailed Description WTT I/O implementation. WTT stands for Wing Tuk Tee (in Thai, Runs everywhere). It was defined by TACTIS (Thai API Consortium/Thai Industrial Standard) in the NECTEC Thai Software Standard Project (1989-1991), and later endorsed by Thai Industrial Standard Institute (TISI) as TIS 1566-2541 in 1998. WTT classifies Thai chracter(TIS-620) into 17 types below. ITYPE VALUE SHORT_DESCRIPTION CTRL 0 control characters NON 1 non composible characters CONS 2 consonants LV 3 leading vowels FV1 4 following vowels 1 FV2 5 following vowels 2 FV3 6 following vowels 3 BV1 7 below vowels 1 BV2 8 below vowels 2 BD 9 below diacritics TONE 10 tonemarks AD1 11 above diacritics 1 AD2 12 above diacritics 2 AD3 13 above diacritics 3 AV1 14 above vowels 1 AV2 15 above vowels 2 AV3 16 above vowels 3 Functions in thctype.h do basic character classifications while wtt.h classifies a chracter in detail. Please refer to the reference. TACio_op() checks how to compose two given chracters. The possiblities are Composible (CP), Non-display (XC), Accept (AC), Reject (RJ) and Strict Reject (SR). The values of CP, XC, AC, RJ and SR are difined in wtt.h. And their meanings are: o CP : second character is displayed in the same cell as the first, also implies an acceptance. o XC : Do nothing. o AC : Display second character in the next cell. o RJ : Discard second character. o SR : Reject second character only in strict mode. Enumeration Type Documentation enum WTTClass Classification of characters in TIS620 according to WTT. Enumerator CTRL control chars NON non composibles CONS consonants LV leading vowels FV1 following vowels 1 FV2 following vowels 2 FV3 following vowels 3 BV1 below vowels 1 BV2 below vowels 2 BD below diacritics TONE tonemarks AD1 above diacritics 1 AD2 above diacritics 2 AD3 above diacritics 3 AV1 above vowels 1 AV2 above vowels 2 AV3 above vowels 3 enum WTTOp Composibility checking tables. Enumerator CP COMPOSIBLE - following char is displayed in the same cell as leading char, also implies ACCEPT XC Non-display AC ACCEPT - display the following char in the next cell RJ REJECT - discard that following char, ignore it SR STRICT REJECT - REJECT only if in strict mode Function Documentation WTTClass TACchtype (thchar_tc) WTT character class. Parameters: c : the character Returns: WTT character class { CTRL, ..., AV3 } WTTOp TACio_op (thchar_tc1, thchar_tc2) WTT I/O operation. Parameters: c1 : previous character c2 : next character Returns: WTT I/O operation { CP, XC, AC, RJ, SR } Author Generated automatically by Doxygen for libthai from the source code. Version 0.1.14 Tue Jun 17 2014 thai/wtt.h(3)

Check Out this Related Man Page

TACTIS(5)							File Formats Manual							 TACTIS(5)

TACTIS, tactis - A character encoding system (codeset) for Thai. DESCRIPTION
The TACTIS (Thai API Consortium/Thai Industrial Standard) codeset consists of the following two character sets: ASCII (ISO 646-1983) TIS 620-2533 These characters are 8-bit coded, ranging from 00 to FF. ASCII Characters In the TACTIS codeset, all ASCII characters are implemented in the form of single-byte, 7-bit characters; that is, the most significant bit (MSB) of ASCII characters is always set off. For more information, refer to ascii(5). TIS 620-2533 Characters The TIS 620-2533 character set includes 89 characters that are categorized as follows: Consonants: 44 Vowels: 18 total (5 leading vowels, 6 following vowels, 2 below vowels, and 5 above vowels) Tone marks: 4 Diacritics: 5 (4 above diacritics and 1 below diacritic) NonComposi- bles: 8 (1 nobreak space, 10 Thai digits, 6 Thai special characters, and 1 word separator) Note Thai digits are not recognized by the isdigit(), iswdigit(), isxdigit(), and iswxdigit(), isalnum(), and iswalnum() functions. Many applications make assumptions about how a digit character can be converted to its numeric equivalent. Changing the functions to rec- ognize Thai digits would break these applications. Code Ranges in the TACTIS Codeset In the TACTIS codeset, the most significant bit (MSB) of a byte is set on in codes for TIS 620-2533 characters. This differentiates TIS 620-2533 character code from ASCII character code. Following are the code ranges for each of the five categories of Thai characters in the codeset: -------------------------------------------------- Category Code Range (hex) -------------------------------------------------- Consonants A1 to CE Leading vowels E0 to E4 Normal following vowels D0, D2, D3, E5 Special following vowels C4, C6 Below vowels D8, D9 Above vowels D1, D4 to D7 Tone marks E8 to EB Above diacritics E7, EC to EE Below diacritics DA Nobreak space A0 Thai digits F0 to F9 Thai special characters CF, DF, E6, EF, FA, FB Word separator DC -------------------------------------------------- In TACTIS, the hexadecimal code points of TIS 620-2533 characters are as follows: A0 NO-BREAK SPACE C0 PO SAMPOW E0 SARA E A1 KO KAI C1 MO MA E1 SARA AE A2 KHO KHAI C2 YO YAK E2 SARA O A3 KHO KHUAT C3 RO RUA E3 SARA AI MAIMUAN A4 KHO KHWAI C4 RU E4 SARA AI MAIMALAI A5 KHO KHON C5 LO LING E5 LAKKHANGYAO A6 KHO RAKHANG C6 LU E6 MAIYAMOK A7 NGO NGU C7 WO WAEN E7 MAITAIKHU A8 CHO CHAN C8 SO SALA E8 MAI EK A9 CHO CHING C9 SO RUSI E9 MAI THO AA CHO CHANG CA SO SUA EA MAI TRIE AB SO SO CB HO HEEP EB MAI CHATTAWA AC CHO CHOE CC LO CHULA EC THANTHAKHAT AD YO YING CD O ANG ED NIKHANHIT AE DO CHADA CE HO NOKHUK EE YAMAKKAN AF TO PATAK CF PAIYANNOI EF FONGMAN B0 THO THO THAN D0 SARA A F0 THAI ZERO B1 THO NANGMONTHO D1 MAI HAN-AKAT F1 THAI ONE B2 THO PHOO THAO D2 SARA AA F2 THAI TWO B3 NOR NANE D3 SARA AM F3 THAI THREE B4 DOR DEK D4 SARA E F4 THAI FOUR B5 TO TAO D5 SARA EE F5 THAI FIVE B6 THO THUNG D6 SARA UR F6 THAI SIX B7 THO THAHAN D7 SARA UUR F7 THAI SEVEN B8 THO THONG D8 SARA U F8 THAI EIGHT B9 NO NU D9 SARA UU F9 THAI NINE BA BO BAIMAI DA PHINTHU FA ANGKHANKHU BB PO PLA DB FB KHOMUT BC PHO PERNG DC WORD SEPARATOR FC BD FO FA DD FD BE PO PAN DE FE BF FO FAN DF BAHT FF For more information on Thai characters, refer to Wototo(5). Fonts for TIS 620 2533 The operating system provides both screen and printer fonts for TIS 620 2533 characters. The following bitmap fonts reflect various sizes and typefaces for 75dpi and 100dpi display devices: -adecw-screen-medium-r-normal--14-140-75-75-p-70-tis620.2533-1 -adecw-screen-medium-r-normal--18-180-75-75-p-80-tis620.2533-1 -adecw- screen-medium-r-normal--24-240-75-75-p-120-tis620.2533-1 -adecw-screen-medium-r-normal--14-140-100-100-p-70-tis620.2533-1 -adecw-screen- medium-r-normal--18-180-100-100-p-80-tis620.2533-1 -adecw-screen-medium-r-normal--24-240-100-100-p-120-tis620.2533-1 The operating system provides the following Thai fonts for PostScript printers: AngsanaUPC-Bold AngsanaUPC-BoldItalic AngsanaUPC-Italic AngsanaUPC-Light CordiaUPC-Bold CordiaUPC-BoldItalic CordiaUPC-Italic CordiaUPC-Light EucrosiaUPC-Bold EucrosiaUPC-BoldItalic EucrosiaUPC- Italic EucrosiaUPC-Light FreesiaUPC-Bold FreesiaUPC-BoldItalic FreesiaUPC-Italic FreesiaUPC-Light IrisUPC-Bold IrisUPC-BoldItalic IrisUPC- Italic IrisUPC-Light JasmineUPC-Bold JasmineUPC-BoldItalic JasmineUPC-Italic JasmineUPC-Light KodchiangUPC-Bold KodchiangUPC-BoldItalic KodchiangUPC-Italic KodchiangUPC-Light LilyUPC-Bold LilyUPC-BoldItalic LilyUPC-Italic LilyUPC-Light WaterlilyUPC-Bold WaterlilyUPC- BoldItalic WaterlilyUPC-Italic WaterlilyUPC-Light YuccaUPC-Bold YuccaUPC-BoldItalic YuccaUPC-Italic YuccaUPC-Light For general information on printing Asian language text, refer to i18n_printing(5). Codeset Conversion The following converter pairs are available for converting data between TACTIS and other encoding formats. Refer to iconv_intro(5) for an introduction to codeset conversion. For more information about the other codeset for which TACTIS is the input or output, see the reference page specified in the list item. cp874_TACTIS, TACTIS_cp874 Converting from and to PC code page 874: code_page(5) UCS-2_TACTIS, TACTIS_UCS-2 Converting from and to UCS-2: Unicode(5) UCS-4_TACTIS, TACTIS_UCS-4 Converting from and to UCS-4: Unicode(5) UTF-8_TACTIS, TACTIS_UTF-8 Converting from and to UTF-8: Unicode(5) SEE ALSO
Commands: locale(1) Others: code_page(5), ascii(5), i18n_intro(5), i18n_printing(5), iconv_intro(5), Thai(5), Unicode(5), Wototo(5) TACTIS(5)
Man Page