Hi,
How can I find out the charset on a Unix server (SUNOS 5.2)? I tried locale charmap and returned 646. What does 646 mean? If I send an xml file with encoding="utf-8", should the server be able to handle the file, even with special characters in it?
Thanks. (0 Replies)
I have a huge file with 20 fileds in each record and each field is seperated by "|". If i want to get all the reocrds that have 18th or for that matter any filed as null how can i do it? Please let me know (3 Replies)
Hello,
I am interested in finding and identifying suffixes for Indian names through an awk script or a perl program. Suffixes normally are found at the end of a word as is shown in the sample given below.
What I need is a perl script which will identify suffixes of a defined lenght to be given in... (4 Replies)
what's the relationship among locale, glibc, charset, charmap and fonts?
why locale needs to be generated by glibc? how?
what are in the locale-archive file?
and what are in font files? (0 Replies)
Hi,
A script is running for multiple databases so data is also being populated for multiple DBs in a.txt file.
I need to rename this file once all the data is populated. Kindly suggest me How can I check once file is populated completely before renaming?
Thanks in advance. (3 Replies)
Hello Experts, please help to provide any insight as I am facing issue migrating java application from hpux to redhat. The java program is using InputStreamReader to read a file without specifying any charset parameter.
However, in new Linux Redhat 5.6 environent, when reading a file that... (1 Reply)
I have an HP-UX server that runs a script each night. The script connects to an SFTP server and downloads all xml files (if any are present) from a certain folder, and then deletes the files from the SFTP server. So sometimes it will download a new file, sometimes it will download 2 or 3 new... (4 Replies)
Hi,
I have a file which contains few columns and the first column has the file names, and I would like to identify the missing file sequence number form the file and would copy to another file. My files has data in below format.
APKRISPSIN320131231201319_0983,1,54,125,... (5 Replies)
Hi all,
am running the following code on a RHEL 6.6 box to list which charsets are loaded and which are available:
#!/usr/bin/perl -w
use strict;
use Encode;
my @list = Encode->encodings();
my @all_encodings = Encode->encodings(":all");
print "@list\n\n";
print "@all_encodings\n";
... (3 Replies)
Hi Experts,
I have written the below script to check the missing files based on the date in the file name from current date to in a given interval of days.
In the file names we have dates along with some name. ex:jera_sit_2017-04-25-150325.txt.
The below script is working fine if we have only... (10 Replies)
Discussion started by: nalu
10 Replies
LEARN ABOUT OSF1
iso-2022
iso2022(5) File Formats Manual iso2022(5)NAME
iso2022, iso-2022, ISO-2022 - A character encoding mechanism standardized by the International Standards Organization (ISO)
DESCRIPTION
The ISO-2022 standard defines a mechanism for handling single-byte and multibyte characters. The standard specifies four classes of charac-
ter sets: The 94-charset class, which contains character sets with 94 positions (single-byte characters). Examples are the ASCII and JIS
X0201 character sets. The 96-charset class, which contains character sets with 96 positions (single-byte characters). Examples are the ISO
Latin series of character sets. The 94x94-charset class, which contains character sets with 94x94 positions (2-byte characters). Examples
are the GB 2312 and the CNS 11643 character sets. The 96x96-charset class, which contains character sets with 96x96 positions (2-byte
characters).
In the ISO-2022 standard, four registers, called G0, G1, G2 and G3, are used to reference a character set. Before a character set can be
used, the character set must be assigned, or designated, to one of these registers. The designation of a character set is done by using an
escape sequence in the following format:
ESC [I] F
In this format: Is an intermediate character that is used to designate a character set to one of the registers (G0, G1, G2, oR G3). Is a
unique final character of a particular character set.
The designation of a character set, whose final character is F, to different registers is as follows: Designates a multibyte character set
(94x94 or 96x96) to G0. Designates a character set in the 94-charset class to G0. Designates a character set in the 94-charset class to
G1. Designates a character set in the 94-charset class to G2. Designates a character set in the 94-charset class to G3. Designates a
character set in the 96-charset class to G1. Designates a character set in the 96-charset class to G2. Designates a character set in the
96-charset class to G3.
SEE ALSO
Commands: locale(1)
Others: ascii(5), i18n_intro(5), iso2022jp(5), l10n_intro(5)iso2022(5)