11-17-2009
Unicode programing in C
im starting to go a little serious with c, woking in a personal project that will read a xml, which might contain Unicode characters (i know it will on my system, which is set to es_AR.UTF-8)
im using mxml, and the documentation says it uses utf8 internally (no worries here).
so i need to be sure im using utf8 in my program. to be sure that i can safely interact with mxml and to be sure my program will work in all languages.
i have been reading alot, but i dont quite fully get how i can accomplish this.
im going for something simple, something easy that wont demand much of me.
my program will read user input (from cli for now, gtk later), and will save it in xml (is a config file for other app). it will also have the option to read a xml, and use it as a base for a new one
now, i have a few concrete questions.
a) do i have to use a special type of variable?
if a) is true, then i need a hole new set of functions? (for strcmp, or strstr)
b) can i work with unicode characters using char *?
if b) is true, how do i "make" them utf8?
c) is a mix of the above? then how to choose the mix ratio?
i appreciate any help, manual, link, ect that can help me understand how this works. (that includes source code)
thanks
9 More Discussions You Might Find Interesting
1. UNIX Desktop Questions & Answers
Hi all!
I`m new in Unix (Linux) and i whant to ask something!
What language should i use for Linux developing.I meen applications an GAME DEVELOPING!
Should i use C,TCL ??? Please help me on this ...:( (1 Reply)
Discussion started by: Sebastyan
1 Replies
2. Programming
I am having a windows and i would like to know whitch program do you prefer for programing in windows
P.S. C++ (1 Reply)
Discussion started by: D.Borak
1 Replies
3. Programming
I have a stream of characters like "\u8BBE\u5907\u7BA1"
and i want to display it.
I tried following things already without any luck.
1) printf("%s",L("\u8BBE\u5907\u7BA1"));
2) printf("%lc",0x8BBE);
3) setlocale followed by fwide followed by wprintf
4) also changed the local manually... (3 Replies)
Discussion started by: jackdorso
3 Replies
4. HP-UX
How can program at crontab dayly each 30 minut (2 Replies)
Discussion started by: petroleo
2 Replies
5. Shell Programming and Scripting
plx help to solve these problems??
1. Create a HERE document which will edit multiple files in the same directory, using the ed editor. I give you 3 original files: file1.c , file2.c , file3.c, download them and change each string "stdio.h" to "STDIO.H" in these files. Note: when execute the... (1 Reply)
Discussion started by: SoCalledEngr
1 Replies
6. Shell Programming and Scripting
Hi...
i need to write a shell script wich shows the full name and
station of every logged user in the system.
pls help! (1 Reply)
Discussion started by: relu89
1 Replies
7. IP Networking
Hello experts,
please help me as i want to learn the networking concepts in details ,
as i come know Unix network programming by Richard Stevens volume 1,2
is good please any of you downloaded the Free PDF version of it please direct m e as i want to download these books or the pdf form of it,... (1 Reply)
Discussion started by: vin_pll
1 Replies
8. IP Networking
hi
I am strucked in a client server program
client need to login to server
client logins if only username and password are correct
i have written a program
username is stored as file and password is smilar to username whic is stored in that file
when server asks for username... (2 Replies)
Discussion started by: karthik1238
2 Replies
9. Shell Programming and Scripting
i wrote an awk progarm to calculate throughput from a ns2 trace file. i want this program to act on multiple trace files and it should display each output in a single output file can anyone please clear my doubt i tried with awk -f awkscript inputfile1... (7 Replies)
Discussion started by: sarathyy
7 Replies
LEARN ABOUT DEBIAN
perlcn
PERLCN(1) Perl Programmers Reference Guide PERLCN(1)
NAME
perlcn - XXXX Perl XX
DESCRIPTION
XXXX Perl XXX!
X 5.8.0 XXX, Perl XXXXXX Unicode (XXX) XX, XXXXXXXXXXXXXXXXXXX; CJK (XXX) XXXXXXXX. Unicode XXXXXXX, XXXXXXXXXXXX: XXXX, XXXX, XXXXXXXX
(XXX, XXXX, XXXX, XXXX, XXX, XXXX, XX). XXXXXXXXXXXXXX (X PC XXXX).
Perl XXX Unicode XXXX. XXX Perl XXXXXXXXXX Unicode XX; Perl XXXXXX (XXXXXXXXX) XXX Unicode XXXX. XXXXXXX, XXXXX Unicode XXXXXXXXXXXX, Perl
XXX Encode XXXX, XXXXXXXXXXXXXXXXXXX.
Encode XXXXXXXXXXXXXXXXX ('gb2312' XX 'euc-cn'):
euc-cn Unix XXXXX, XXXXXXXXX
gb2312-raw XXXXX (XXX) GB2312 XXX
gb12345 XXXXXXXXXXXXXX
iso-ir-165 GB2312 + GB6345 + GB8565 + XXXX
cp936 XXX 936, XXXX 'GBK' (XXXXX) XX
hz 7 XXXXX GB2312 XX
XXXX, X EUC-CN XXXXXXX Unicode, XXXXXXXX:
perl -Mencoding=euc-cn,STDOUT,utf8 -pe1 < file.euc-cn > file.utf8
Perl XXXX "piconv", XXXXX Perl XXXXXXXXXXX, XXXX:
piconv -f euc-cn -t utf8 < file.euc-cn > file.utf8
piconv -f utf8 -t euc-cn < file.utf8 > file.euc-cn
XX, XX encoding XX, XXXXXXXXXXXXXXXXX, XXXX:
#!/usr/bin/env perl
# XX euc-cn XXXX; XXXXXXXXXXXXX euc-cn XX
use encoding 'euc-cn', STDIN => 'euc-cn', STDOUT => 'euc-cn';
print length("XX"); # 2 (XXXXXXX)
print length('XX'); # 4 (XXXXXXX)
print index("XXXX", "XX"); # -1 (XXXXXXXX)
print index('XXXX', 'XX'); # 1 (XXXXXXXX)
XXXXXXXX, "X" XXXXXXX "X" XXXXXXXXX EUC-CN XX "X"; "X" XXXXXXXX "X" XXXXXXXXX "X". XXXXXX EUC-CN XXXXXXXXXXX.
XXXXXXX
XXXXXXXXXXX, XXX CPAN (<http://www.cpan.org/>) XX Encode::HanExtra XX. XXXXXXXXXXX:
gb18030 XXXXXXX, XXXXXX
XX, Encode::HanConvert XXXXXXXXXXXXXXXX:
big5-simp Big5 XXXXX Unicode XXXXXX
gbk-trad GBK XXXXX Unicode XXXXXX
XXX GBK X Big5 XXXX, XXXXXXXXX b2g.pl X g2b.pl XXXX, XXXXXXXXXXX:
use Encode::HanConvert;
$euc_cn = big5_to_gb($big5); # X Big5 XX GBK
$big5 = gb_to_big5($euc_cn); # X GBK XX Big5
XXXXXX
XXX Perl XXXXXXXXX (XXXXXXXXX), XXXXXXX Perl XXX, XX Unicode XXXXX. XX, XXXXXXXXX:
XX Perl XXXXX
<http://www.perl.com/>
Perl XXX (XXXXXXXX)
<http://www.cpan.org/>
Perl XXXXX (Comprehensive Perl Archive Network)
<http://lists.perl.org/>
Perl XXXXXX
XX Perl XXX
<http://www.oreilly.com.cn/indexcat.php?c=perl>
XXXXXXXXX Perl XX
Perl XXXXX
<http://www.pm.org/groups/asia.html>
XX Perl XXXXX
Unicode XXXX
<http://www.unicode.org/>
Unicode XXXX (Unicode XXXXXX)
<http://www.cl.cam.ac.uk/%7Emgk25/unicode.html>
Unix/Linux XX UTF-8 X Unicode XXX
SEE ALSO
Encode, Encode::CN, encoding, perluniintro, perlunicode
AUTHORS
Jarkko Hietaniemi <jhi@iki.fi>
Audrey Tang (XX) <audreyt@audreyt.org>
perl v5.14.2 2011-09-26 PERLCN(1)