Unix and Linux Discussions Tagged with unicode |
|
Thread / Thread Starter |
Last Post |
Replies |
Views |
Forum |
|
|
|
11 |
9,117 |
UNIX for Advanced & Expert Users |
|
|
|
1 |
2,417 |
Shell Programming and Scripting |
|
|
|
3 |
7,608 |
Shell Programming and Scripting |
|
|
|
5 |
3,092 |
Programming |
|
|
|
3 |
5,201 |
OS X (Apple) |
|
|
|
3 |
5,739 |
Programming |
|
|
|
1 |
4,137 |
UNIX for Advanced & Expert Users |
|
|
|
0 |
6,172 |
UNIX for Advanced & Expert Users |
|
|
|
3 |
14,226 |
UNIX for Dummies Questions & Answers |
|
|
|
0 |
3,148 |
Red Hat |
|
|
|
0 |
4,237 |
Windows & DOS: Issues & Discussions |
|
|
|
10 |
18,172 |
Shell Programming and Scripting |
|
|
|
0 |
1,019 |
Software Releases - RSS News |
|
|
|
1 |
9,915 |
UNIX for Dummies Questions & Answers |
|
|
|
0 |
2,434 |
UNIX for Dummies Questions & Answers |
|
|
|
0 |
1,114 |
Software Releases - RSS News |
|
|
|
10 |
51,486 |
UNIX for Advanced & Expert Users |
|
|
|
0 |
688 |
Software Releases - RSS News |
|
|
|
0 |
987 |
Software Releases - RSS News |
|
|
|
0 |
2,608 |
Software Releases - RSS News |
|
|
|
0 |
5,533 |
AIX |
|
|
|
0 |
786 |
Software Releases - RSS News |
|
|
|
1 |
18,366 |
Shell Programming and Scripting |
|
|
|
0 |
1,289 |
Software Releases - RSS News |
|
|
|
0 |
901 |
Software Releases - RSS News |
|
|
|
0 |
1,302 |
Security Advisories (RSS) |
|
|
|
0 |
1,662 |
Software Releases - RSS News |
|
|
|
0 |
1,036 |
Software Releases - RSS News |
|
|
|
3 |
7,479 |
Shell Programming and Scripting |
|
|
|
0 |
939 |
Software Releases - RSS News |
|
|
|
0 |
913 |
Software Releases - RSS News |
unicode(n) Unicode normalization unicode(n)
__________________________________________________________________________________________________________________________________________________
NAME
unicode - Implementation of Unicode normalization
SYNOPSIS
package require Tcl 8.3
package require unicode 1.0
::unicode::fromstring string
::unicode::tostring uclist
::unicode::normalize form uclist
::unicode::normalizeS form string
_________________________________________________________________
DESCRIPTION
This is an implementation in Tcl of the Unicode normalization forms.
COMMANDS
::unicode::fromstring string
Converts string to list of integer Unicode character codes which is used in unicode for internal string representation.
::unicode::tostring uclist
Converts list of integers uclist back to Tcl string.
::unicode::normalize form uclist
Normalizes Unicode characters list ulist according to form and returns the normalized list. Form form takes one of the following
values: D (canonical decomposition), C (canonical decomposition, followed by canonical composition), KD (compatibility decomposi-
tion), or KC (compatibility decomposition, followed by canonical composition).
::unicode::normalizeS form string
A shortcut to ::unicode::tostring [unicode::normalize $form [::unicode::fromstring $string]]. Normalizes Tcl string and returns
normalized string.
EXAMPLES
% ::unicode::fromstring "u0410u0411u0412u0413"
1040 1041 1042 1043
% ::unicode::tostring {49 50 51 52 53}
12345
%
% ::unicode::normalize D {7692 775}
68 803 775
% ::unicode::normalizeS KD "u1d2c"
A
%
REFERENCES
[1] "Unicode Standard Annex #15: Unicode Normalization Forms", (http://unicode.org/reports/tr15/)
AUTHORS
Sergei Golovan
BUGS, IDEAS, FEEDBACK
This document, and the package it describes, will undoubtedly contain bugs and other problems. Please report such in the category string-
prep of the Tcllib SF Trackers [http://sourceforge.net/tracker/?group_id=12883]. Please also report any ideas for enhancements you may
have for either package and/or documentation.
SEE ALSO
stringprep(n)
KEYWORDS
normalization, unicode
COPYRIGHT
Copyright (c) 2007, Sergei Golovan <sgolovan@nes.ru>
stringprep 1.0.0 unicode(n)