12-31-2012
Unicode help
is there any way to handle unicode such as ʃʰɐm̆
10 More Discussions You Might Find Interesting
1. UNIX for Advanced & Expert Users
Hi,
I have some software I need to install on HP-UX 11iv1 64bit but it must not be set up in unicode mode. I know unicode/ ASCII etc. I don't know how to get unix to switch between these. Is there an environment setting for that? I use the Korn shell.
Thanks. (2 Replies)
Discussion started by: rein
2 Replies
2. Programming
I have a stream of characters like "\u8BBE\u5907\u7BA1"
and i want to display it.
I tried following things already without any luck.
1) printf("%s",L("\u8BBE\u5907\u7BA1"));
2) printf("%lc",0x8BBE);
3) setlocale followed by fwide followed by wprintf
4) also changed the local manually... (3 Replies)
Discussion started by: jackdorso
3 Replies
3. UNIX for Advanced & Expert Users
Here at the agency I work for, a need has arisen for a subdomain that utilizes some unicode characters. It has something to do with our foreign clients getting "page could not be displayed" errors in their internationalized browsers. I am still investigating the issue, but I've been asked to find... (2 Replies)
Discussion started by: deckard
2 Replies
4. UNIX for Advanced & Expert Users
All,
I'm trying to grasp how to use Unicode with/in Unix. I've made progress on some fronts, for example, when uploading files to my server I can use the intermediary language to convert the file to UTF-8.
I'm having trouble getting Samba to do this (I'm using "unix charset" in smb.conf);... (4 Replies)
Discussion started by: effigy
4 Replies
5. Programming
Dear experts,
While developping a C UNICODE application under AIX 5.3, I encountered the following problem, and after days of investigations I still could not find any solution.
Please note that the application is full wchar_t based (not utf8) and that I could compile and run it without any... (4 Replies)
Discussion started by: tgilbert
4 Replies
6. Programming
on some distributions UTF-32 is the default and i need to change the size of wchar_t to 2 bytes. i tried to compile it with -fwide-exec-charset=UTF-16 but it didn't help. anyone have any ideas?
thanks,
Akos (3 Replies)
Discussion started by: Akimaki
3 Replies
7. Programming
im starting to go a little serious with c, woking in a personal project that will read a xml, which might contain Unicode characters (i know it will on my system, which is set to es_AR.UTF-8)
im using mxml, and the documentation says it uses utf8 internally (no worries here).
so i need to be... (4 Replies)
Discussion started by: broli
4 Replies
8. Programming
I'm trying to figure out how to support Unicode or atleast an unsigned char in the d_name of struct dirent
The problem i'm facing is that I'm checking file names for special characters and obviously the "char d_name" doesn't like it. I'm looping through the directory and getting the file... (3 Replies)
Discussion started by: james2432
3 Replies
9. Shell Programming and Scripting
hello
there's some stranges code symbols they looks like this:
\u0438 \u0247. unicode i think
this code can be viewed by javascript so i need it
i need to convert casual characters to this code with perl
atm stucked with ord, chr, pack, etc things but they giving other digits (7 Replies)
Discussion started by: tip78
7 Replies
10. Shell Programming and Scripting
I don't want HTML_CONTENT,RICH_CONTENT,TEXT_CONTENT columns data in the file and reset of data we need to extract.
Find the attached file.
Need to extract date in between DI_UX_ROW_END tag.
Can help me using unix command using AWK.
Thanks, (2 Replies)
Discussion started by: bmk
2 Replies
LEARN ABOUT DEBIAN
unicode
unicode(3tcl) Unicode normalization unicode(3tcl)
__________________________________________________________________________________________________________________________________________________
NAME
unicode - Implementation of Unicode normalization
SYNOPSIS
package require Tcl 8.3
package require unicode 1.0
::unicode::fromstring string
::unicode::tostring uclist
::unicode::normalize form uclist
::unicode::normalizeS form string
_________________________________________________________________
DESCRIPTION
This is an implementation in Tcl of the Unicode normalization forms.
COMMANDS
::unicode::fromstring string
Converts string to list of integer Unicode character codes which is used in unicode for internal string representation.
::unicode::tostring uclist
Converts list of integers uclist back to Tcl string.
::unicode::normalize form uclist
Normalizes Unicode characters list ulist according to form and returns the normalized list. Form form takes one of the following
values: D (canonical decomposition), C (canonical decomposition, followed by canonical composition), KD (compatibility decomposi-
tion), or KC (compatibility decomposition, followed by canonical composition).
::unicode::normalizeS form string
A shortcut to ::unicode::tostring [unicode::normalize $form [::unicode::fromstring $string]]. Normalizes Tcl string and returns
normalized string.
EXAMPLES
% ::unicode::fromstring "u0410u0411u0412u0413"
1040 1041 1042 1043
% ::unicode::tostring {49 50 51 52 53}
12345
%
% ::unicode::normalize D {7692 775}
68 803 775
% ::unicode::normalizeS KD "u1d2c"
A
%
REFERENCES
[1] "Unicode Standard Annex #15: Unicode Normalization Forms", (http://unicode.org/reports/tr15/)
AUTHORS
Sergei Golovan
BUGS, IDEAS, FEEDBACK
This document, and the package it describes, will undoubtedly contain bugs and other problems. Please report such in the category string-
prep of the Tcllib SF Trackers [http://sourceforge.net/tracker/?group_id=12883]. Please also report any ideas for enhancements you may
have for either package and/or documentation.
SEE ALSO
stringprep(3tcl)
KEYWORDS
normalization, unicode
COPYRIGHT
Copyright (c) 2007, Sergei Golovan <sgolovan@nes.ru>
stringprep 1.0.0 unicode(3tcl)