FYI (mostly off topic), I think the hyphen-minus is considered both an ASCII char as well as Unicode.
######### START MOSTLY OFF TOPIC REFERENCE #########
Reference:
Quote:
The hyphen-minus (-) is a character used in digital documents and computing to represent a hyphen () or a minus sign (').
It is present in Unicode as code point U+002D - HYPHEN-MINUS; it is also in ASCII with the same value.
Where is the hyphen in duckduckgo.com ?
My understanding is that hyphens used in domain names are generally coded in ASCII, FYI.
Reference:
Quote:
The characters allowed in a domain name include letters (abc), numbers (123), and dashes/hyphens (---). No spaces are allowed and the domain name can't begin or end with dash/hyphen.
In the case above (in a permitted domain name char), the hyphen is considered ASCII.
Anyway, this is not germane to your question, which is related to GTK applications.
######### END MOSTLY OFF TOPIC REFERENCE #########
My best guess is that your GTK application has an input filter which checks for non-ASCII chars and pops up an error message when non-ASCII is detected.
The way around this, off the top of my head, is of course to look at the source code, verify the code which is detecting the non-ASCII char (and giving the error pop up) and then comment that code out and recompile and test it.
I have a stream of characters like "\u8BBE\u5907\u7BA1"
and i want to display it.
I tried following things already without any luck.
1) printf("%s",L("\u8BBE\u5907\u7BA1"));
2) printf("%lc",0x8BBE);
3) setlocale followed by fwide followed by wprintf
4) also changed the local manually... (3 Replies)
Hi all,
I am generating a file on the Unix machine , now i want to FTP the same file to the NT machine.
how can i do that and the application currently upon which i am working is a JAVA based application.
I need your help.
regards
Ruchir (2 Replies)
Noob question ..
My Java based application needs to change some user passwords based on some user actions. Since this application can run on Redhat AS2.1 / AS4.0 / Solaris 9 etc, the most safe and portable solution that I could think of was: Use expect.
Now, expect is not available on all... (1 Reply)
Here at the agency I work for, a need has arisen for a subdomain that utilizes some unicode characters. It has something to do with our foreign clients getting "page could not be displayed" errors in their internationalized browsers. I am still investigating the issue, but I've been asked to find... (2 Replies)
Hello all !
I'm trying to write a shell script (bash) to ftp a file starting with particular name like "Latest_" that is present on a Windows box to UNIX server. Basically I want to set this script in the cron so that daily the new build that is posted on the Windows box can be downloaded to the... (2 Replies)
Hello, I have a question. There is a command line mail client "mail", it is good, but obviously, does not support Unicode. Are there any (other) mail clients for command line having support for Unicode (UTF-8) and maybe other encodings? Or are there any other versions of mail/mailx programm which... (0 Replies)
hi everybody,
currently i'm playing with perl and Gtk2.
i've found a fairly old but nice looking example of a client/server application which is written in perl and Gtk2.
the server part works perfect but i can't start the client part and keep getting following error message:
$ ./client-gui.pl... (1 Reply)
I don't want HTML_CONTENT,RICH_CONTENT,TEXT_CONTENT columns data in the file and reset of data we need to extract.
Find the attached file.
Need to extract date in between DI_UX_ROW_END tag.
Can help me using unix command using AWK.
Thanks, (2 Replies)
WE have a file coming from a server that has characters for 4-5 languages. If I download the file to my windows PC and open in Notepad ++, I can clearly see the text in different languages. Notepad++ is able to reder text that is in Portugese, French, Thai etc. My objective it to do the following:... (2 Replies)
Hello my dear friends,
Two file are auto generated from mon - fri at different directories on same windows box.Every day i have to copy the file, rename it (specific name)and ftp it to linux box specified directory.
is it possible to automate this process,If yes this has to be done from windows... (1 Reply)
Discussion started by: umesh yadav
1 Replies
LEARN ABOUT DEBIAN
unicode
unicode(3tcl) Unicode normalization unicode(3tcl)__________________________________________________________________________________________________________________________________________________NAME
unicode - Implementation of Unicode normalization
SYNOPSIS
package require Tcl 8.3
package require unicode 1.0
::unicode::fromstring string
::unicode::tostring uclist
::unicode::normalize form uclist
::unicode::normalizeS form string
_________________________________________________________________DESCRIPTION
This is an implementation in Tcl of the Unicode normalization forms.
COMMANDS
::unicode::fromstring string
Converts string to list of integer Unicode character codes which is used in unicode for internal string representation.
::unicode::tostring uclist
Converts list of integers uclist back to Tcl string.
::unicode::normalize form uclist
Normalizes Unicode characters list ulist according to form and returns the normalized list. Form form takes one of the following
values: D (canonical decomposition), C (canonical decomposition, followed by canonical composition), KD (compatibility decomposi-
tion), or KC (compatibility decomposition, followed by canonical composition).
::unicode::normalizeS form string
A shortcut to ::unicode::tostring [unicode::normalize $form [::unicode::fromstring $string]]. Normalizes Tcl string and returns
normalized string.
EXAMPLES
% ::unicode::fromstring "u0410u0411u0412u0413"
1040 1041 1042 1043
% ::unicode::tostring {49 50 51 52 53}
12345
%
% ::unicode::normalize D {7692 775}
68 803 775
% ::unicode::normalizeS KD "u1d2c"
A
%
REFERENCES
[1] "Unicode Standard Annex #15: Unicode Normalization Forms", (http://unicode.org/reports/tr15/)
AUTHORS
Sergei Golovan
BUGS, IDEAS, FEEDBACK
This document, and the package it describes, will undoubtedly contain bugs and other problems. Please report such in the category string-
prep of the Tcllib SF Trackers [http://sourceforge.net/tracker/?group_id=12883]. Please also report any ideas for enhancements you may
have for either package and/or documentation.
SEE ALSO stringprep(3tcl)KEYWORDS
normalization, unicode
COPYRIGHT
Copyright (c) 2007, Sergei Golovan <sgolovan@nes.ru>
stringprep 1.0.0 unicode(3tcl)