Sponsored Content
Full Discussion: Unix Codes for Diacritics
Top Forums Shell Programming and Scripting Unix Codes for Diacritics Post 302454422 by Krishna Birth on Saturday 18th of September 2010 05:31:35 AM
Old 09-18-2010
Bug Unix Codes for Diacritics

Could someone please correctly tell the codes to use on Unix operating systems to produce the below diacritics:


Code:
A 
Ā = Unicode Character 'LATIN CAPITAL LETTER A WITH MACRON' (U+0100) 
a 
ā = Unicode Character 'LATIN SMALL LETTER A WITH MACRON' (U+0101) 



D 
Ḍ = Unicode Character 'LATIN CAPITAL LETTER D WITH DOT BELOW' (U+1E0C) 
d 
ḍ = Unicode Character 'LATIN SMALL LETTER D WITH DOT BELOW' (U+1E0D) 



H 
Ḥ = Unicode Character 'LATIN CAPITAL LETTER H WITH DOT BELOW' (U+1E24) 
h 
ḥ = Unicode Character 'LATIN SMALL LETTER H WITH DOT BELOW' (U+1E25) 



I 
Ī = Unicode Character 'LATIN CAPITAL LETTER I WITH MACRON' (U+012A) 
i 
ī = Unicode Character 'LATIN SMALL LETTER I WITH MACRON' (U+012B) 



L 
Ḷ = Unicode Character 'LATIN CAPITAL LETTER L WITH DOT BELOW' (U+1E36) 

l 
ḷ = Unicode Character 'LATIN SMALL LETTER L WITH DOT BELOW' (U+1E37) 

L 
L̐ = 004C LATIN CAPITAL LETTER L 0310 COMBINING CANDRABINDU - UniView 
l 
l̐ = 006C LATIN SMALL LETTER L 0310 COMBINING CANDRABINDU  - String analyser 



M 
Ṁ ̇= Unicode Character 'LATIN CAPITAL LETTER M WITH DOT ABOVE' (U+1E40) 
m 
ṁ = Unicode Character 'LATIN SMALL LETTER M WITH DOT ABOVE' (U+1E41) 



N 
Ñ = Unicode Character 'LATIN CAPITAL LETTER N WITH TILDE' (U+00D1) 
n 
ñ = Unicode Character 'LATIN SMALL LETTER N WITH TILDE' (U+00F1) 

N 
Ṇ = Unicode Character 'LATIN CAPITAL LETTER N WITH DOT BELOW' (U+1E46) 
n 
ṇ = Unicode Character 'LATIN SMALL LETTER N WITH DOT BELOW' (U+1E47) 

N 
Ṅ = Unicode Character 'LATIN CAPITAL LETTER N WITH DOT ABOVE' (U+1E44) 
n 
ṅ = Unicode Character 'LATIN SMALL LETTER N WITH DOT ABOVE' (U+1E45) 



R 
Ṛ = Unicode Character 'LATIN CAPITAL LETTER R WITH DOT BELOW' (U+1E5A) 
r 
ṛ = Unicode Character 'LATIN SMALL LETTER R WITH DOT BELOW' (U+1E5B) 

R 
Ṝ = Unicode Character 'LATIN CAPITAL LETTER R WITH DOT BELOW AND MACRON' (U+1E5C) 
r 
ṝ = Unicode Character 'LATIN SMALL LETTER R WITH DOT BELOW AND MACRON' (U+1E5D) 



S 
Ṣ = Unicode Character 'LATIN CAPITAL LETTER S WITH DOT BELOW' (U+1E62) 
s 
ṣ = Unicode Character 'LATIN SMALL LETTER S WITH DOT BELOW' (U+1E63) 

S 
Ś = Unicode Character 'LATIN CAPITAL LETTER S WITH ACUTE' (U+015A) 
s 
ś = Unicode Character 'LATIN SMALL LETTER S WITH ACUTE' (U+015B) 



T 
Ṭ = Unicode Character 'LATIN CAPITAL LETTER T WITH DOT BELOW' (U+1E6C) 
t 
ṭ = Unicode Character 'LATIN SMALL LETTER T WITH DOT BELOW' (U+1E6D) 



U 
Ū = Unicode Character 'LATIN CAPITAL LETTER U WITH MACRON' (U+016A) 
u 
ū = Unicode Character 'LATIN SMALL LETTER U WITH MACRON' (U+016B)


I need to find this for a project/coder's question?



Best,



Meeकu
 

10 More Discussions You Might Find Interesting

1. UNIX for Dummies Questions & Answers

escape codes

I did a search and found the link for escape codes,- but I am not sure how to modify this script to set the margins. I started with a script that was already written on my system to set a printer to print landscape. I need to send an report that is in an ascii file to multiple printers from my... (1 Reply)
Discussion started by: MizzGail
1 Replies

2. UNIX for Dummies Questions & Answers

unix return codes

Suppose I have a script which is monitoring a directory whenever a file drops in that directory,it sends alert say I want to write a return code for the above script which on successful execution of script gives a return value Based on return code , I want to do initiate some jobs in other... (1 Reply)
Discussion started by: abhib45
1 Replies

3. Programming

how to write c codes in UNIX

Sorry for asking a very basic question. How do i compile my c codes in UNIX. That is how do i know whether any c compiler is there in my system? If nothing is present where from do i get one and load into my system? (12 Replies)
Discussion started by: rraajjiibb
12 Replies

4. UNIX for Dummies Questions & Answers

Return codes

Hi, Can anyone tell me if there are return codes for SFTP? If so how would you capture them? I've tried 'man sftp' but its not particularly helpful. Many thanks Helen :confused: (4 Replies)
Discussion started by: Bab00shka
4 Replies

5. UNIX for Advanced & Expert Users

Know any good codes?

Know any good codes? (2 Replies)
Discussion started by: fgjiu
2 Replies

6. UNIX for Dummies Questions & Answers

How do you Sum UNIX return codes

Hi, I know how to read a return code after executing a single command. "echo $?". But I do not know how to sum the return code for a group of commands. If I string 3 commands together and I do an echo $? all I get is the retunr code for the last command. Example below: ... (3 Replies)
Discussion started by: z1tt45uuprs7
3 Replies

7. UNIX and Linux Applications

Printing unix error codes

Hi Friends, I need your help in understanding this: :rolleyes: I want to print the error numbers returned by the system calls (not the actual integers but the strings that they match to). The error numbers like EACCESS, ENOMEM, ENOENT etc ... For eg: returnCode = open(somefile,... (2 Replies)
Discussion started by: vijaya2006
2 Replies

8. Linux

Exit codes

I am trying to run this SH on Linux and getting error at IF condition. I want to read the EXIT code and send the failure or success message. Please help me on this. This worked when i was running on Solaris. #!/bin/bash $ORACLE_HOME/bin/sqlplus abc/xyz@qwe @/home/test.sql if ;... (4 Replies)
Discussion started by: rlmadhav
4 Replies

9. UNIX for Dummies Questions & Answers

Return Codes...

Not sure if this is of any use but...... I was messing around with getting return codes greater than 255 for special usage... Of course the code could be made simple but in this code the new stored return code is generated as exit is progressing... #!/bin/sh # Real and imaginary return... (9 Replies)
Discussion started by: wisecracker
9 Replies

10. UNIX for Beginners Questions & Answers

How to remove unused html codes from the file using UNIX?

Hi All, We have a HTML source which will be processed using a informatica workflow. In between these two we have a Unix script which transforms the file. We are getting an error from past week in the informatica saying invalid format, because the file has unused html reference (0-8,14-31 etc)... (2 Replies)
Discussion started by: karthik adiga
2 Replies
thai/wtt.h(3)							      libthai							     thai/wtt.h(3)

NAME
thai/wtt.h - WTT I/O implementation. SYNOPSIS
Enumerations enum WTTClass { CTRL = 0, NON = 1, CONS = 2, LV = 3, FV1 = 4, FV2 = 5, FV3 = 6, BV1 = 7, BV2 = 8, BD = 9, TONE = 10, AD1 = 11, AD2 = 12, AD3 = 13, AV1 = 14, AV2 = 15, AV3 = 16 } Classification of characters in TIS620 according to WTT. enum WTTOp { CP = 1, XC = 2, AC = 3, RJ = 4, SR = 5 } Composibility checking tables. Functions WTTClass TACchtype (thchar_t c) WTT character class. WTTOp TACio_op (thchar_t c1, thchar_t c2) WTT I/O operation. Detailed Description WTT I/O implementation. WTT stands for Wing Tuk Tee (in Thai, Runs everywhere). It was defined by TACTIS (Thai API Consortium/Thai Industrial Standard) in the NECTEC Thai Software Standard Project (1989-1991), and later endorsed by Thai Industrial Standard Institute (TISI) as TIS 1566-2541 in 1998. WTT classifies Thai chracter(TIS-620) into 17 types below. ITYPE VALUE SHORT_DESCRIPTION CTRL 0 control characters NON 1 non composible characters CONS 2 consonants LV 3 leading vowels FV1 4 following vowels 1 FV2 5 following vowels 2 FV3 6 following vowels 3 BV1 7 below vowels 1 BV2 8 below vowels 2 BD 9 below diacritics TONE 10 tonemarks AD1 11 above diacritics 1 AD2 12 above diacritics 2 AD3 13 above diacritics 3 AV1 14 above vowels 1 AV2 15 above vowels 2 AV3 16 above vowels 3 Functions in thctype.h do basic character classifications while wtt.h classifies a chracter in detail. Please refer to the reference. TACio_op() checks how to compose two given chracters. The possiblities are Composible (CP), Non-display (XC), Accept (AC), Reject (RJ) and Strict Reject (SR). The values of CP, XC, AC, RJ and SR are difined in wtt.h. And their meanings are: o CP : second character is displayed in the same cell as the first, also implies an acceptance. o XC : Do nothing. o AC : Display second character in the next cell. o RJ : Discard second character. o SR : Reject second character only in strict mode. Enumeration Type Documentation enum WTTClass Classification of characters in TIS620 according to WTT. Enumerator CTRL control chars NON non composibles CONS consonants LV leading vowels FV1 following vowels 1 FV2 following vowels 2 FV3 following vowels 3 BV1 below vowels 1 BV2 below vowels 2 BD below diacritics TONE tonemarks AD1 above diacritics 1 AD2 above diacritics 2 AD3 above diacritics 3 AV1 above vowels 1 AV2 above vowels 2 AV3 above vowels 3 enum WTTOp Composibility checking tables. Enumerator CP COMPOSIBLE - following char is displayed in the same cell as leading char, also implies ACCEPT XC Non-display AC ACCEPT - display the following char in the next cell RJ REJECT - discard that following char, ignore it SR STRICT REJECT - REJECT only if in strict mode Function Documentation WTTClass TACchtype (thchar_tc) WTT character class. Parameters: c : the character Returns: WTT character class { CTRL, ..., AV3 } WTTOp TACio_op (thchar_tc1, thchar_tc2) WTT I/O operation. Parameters: c1 : previous character c2 : next character Returns: WTT I/O operation { CP, XC, AC, RJ, SR } Author Generated automatically by Doxygen for libthai from the source code. Version 0.1.14 Tue Jun 17 2014 thai/wtt.h(3)
All times are GMT -4. The time now is 07:41 AM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy