09-07-2007
Unix charset
Hi,
How can I find out the charset on a Unix server (SUNOS 5.2)? I tried locale charmap and returned 646. What does 646 mean? If I send an xml file with encoding="utf-8", should the server be able to handle the file, even with special characters in it?
Thanks.
9 More Discussions You Might Find Interesting
1. SuSE
Hi,
I am a newbie to Linux(Suse).I am facing a problem with 'sqlldr' utility while trying to upload data to Database tables.My backend is Oracle and is using the UTF8 encoding format.I am trying to load a datafile which contains some Western European Characters.While loading am getting an... (0 Replies)
Discussion started by: DILEEP410
0 Replies
2. Shell Programming and Scripting
Hi all,
My objective is to find out the charset using which a file is encoded. (The OS is SunOs)
I have set NLS_LANG to AR8MSWIN1256 and spooled the file.
When viewed the file using vi, I saw the following
\307\341\321\355\307\326
I then inserted the line containing these codes in a... (3 Replies)
Discussion started by: sridhar_423
3 Replies
3. UNIX for Dummies Questions & Answers
How does unix system administration, unix programming, unix network programming differ?
Please help. (0 Replies)
Discussion started by: thulasidharan2k
0 Replies
4. Shell Programming and Scripting
Hi All
I'm using a tree command in a script that for me outputs:-
| - - DIRECTORYNAME
However a different user is getting the following output:-
aaa (actually with an umlat above them) DIRECTORYNAME
I'm not sure where this could be coming from, any ideas anyone? (0 Replies)
Discussion started by: Bashingaway
0 Replies
5. UNIX for Dummies Questions & Answers
what's the relationship among locale, glibc, charset, charmap and fonts?
why locale needs to be generated by glibc? how?
what are in the locale-archive file?
and what are in font files? (0 Replies)
Discussion started by: vistastar
0 Replies
6. UNIX for Advanced & Expert Users
Hello Experts, please help to provide any insight as I am facing issue migrating java application from hpux to redhat. The java program is using InputStreamReader to read a file without specifying any charset parameter.
However, in new Linux Redhat 5.6 environent, when reading a file that... (1 Reply)
Discussion started by: sonic_air
1 Replies
7. Shell Programming and Scripting
Dear All,
Can someone help to command or program to transfer the file from windows to Unix server and from one unix server to another Unix server in secure way.
I would request no samba client. (4 Replies)
Discussion started by: yadavricky
4 Replies
8. UNIX for Dummies Questions & Answers
Hi All,
I'm facing an issue when i ssh to a router and exporting the output to a txt file.
ssh johndoe@10.0.0.1 -a | tee file.txt
Closing the connection and opening the .txt file. There are strange 'domino's' appearing here and there. See the screenshot below.
... (2 Replies)
Discussion started by: Antonio Fargas
2 Replies
9. Red Hat
Hi all,
am running the following code on a RHEL 6.6 box to list which charsets are loaded and which are available:
#!/usr/bin/perl -w
use strict;
use Encode;
my @list = Encode->encodings();
my @all_encodings = Encode->encodings(":all");
print "@list\n\n";
print "@all_encodings\n";
... (3 Replies)
Discussion started by: Fundix
3 Replies
LEARN ABOUT OSF1
iso-2022
iso2022(5) File Formats Manual iso2022(5)
NAME
iso2022, iso-2022, ISO-2022 - A character encoding mechanism standardized by the International Standards Organization (ISO)
DESCRIPTION
The ISO-2022 standard defines a mechanism for handling single-byte and multibyte characters. The standard specifies four classes of charac-
ter sets: The 94-charset class, which contains character sets with 94 positions (single-byte characters). Examples are the ASCII and JIS
X0201 character sets. The 96-charset class, which contains character sets with 96 positions (single-byte characters). Examples are the ISO
Latin series of character sets. The 94x94-charset class, which contains character sets with 94x94 positions (2-byte characters). Examples
are the GB 2312 and the CNS 11643 character sets. The 96x96-charset class, which contains character sets with 96x96 positions (2-byte
characters).
In the ISO-2022 standard, four registers, called G0, G1, G2 and G3, are used to reference a character set. Before a character set can be
used, the character set must be assigned, or designated, to one of these registers. The designation of a character set is done by using an
escape sequence in the following format:
ESC [I] F
In this format: Is an intermediate character that is used to designate a character set to one of the registers (G0, G1, G2, oR G3). Is a
unique final character of a particular character set.
The designation of a character set, whose final character is F, to different registers is as follows: Designates a multibyte character set
(94x94 or 96x96) to G0. Designates a character set in the 94-charset class to G0. Designates a character set in the 94-charset class to
G1. Designates a character set in the 94-charset class to G2. Designates a character set in the 94-charset class to G3. Designates a
character set in the 96-charset class to G1. Designates a character set in the 96-charset class to G2. Designates a character set in the
96-charset class to G3.
SEE ALSO
Commands: locale(1)
Others: ascii(5), i18n_intro(5), iso2022jp(5), l10n_intro(5)
iso2022(5)