BIG5(5) BSD File Formats Manual BIG5(5)NAME
big5 -- ``Big Five'' encoding for Traditional Chinese text
SYNOPSIS
ENCODING "BIG5"
DESCRIPTION
``Big Five'' is the de facto standard for encoding Traditional Chinese text. Each character is represented by either one or two bytes.
Characters from the ASCII character set are represented as single bytes in the range 0x00 - 0x7F. Traditional Chinese characters are repre-
sented by two bytes: the first in the range 0xA1 - 0xFE, the second in the range 0x40 - 0xFE.
SEE ALSO euc(5), gb18030(5), utf8(5)BSD August 7, 2003 BSD
Check Out this Related Man Page
sbig5(5) File Formats Manual sbig5(5)NAME
sbig5 - A character encoding system (codeset) for Traditional Chinese
DESCRIPTION
The Shift Big-5 (sbig5) codeset is a variant of the Big-5 codeset (see big5(5)). The only difference between these codesets is that the
second byte of some Big-5 characters are mapped to different values in the Shift Big-5 codeset. The remapping is done to avoid having some
metacharacters like *, which has special meaning to UNIX commands, in the second byte of a 2-byte Big-5 character.
The mappings of Big-5 characters to Shift Big-5 characters are as follows:
------------------------------------------------------------------------
Big-5 (2nd Character Symbol Shift Big-5 (2nd Character Symbol
byte) byte)
------------------------------------------------------------------------
40 @ 30 0
5B [ 31 1
5C 32 2
5D ] 33 3
5E ^ 34 4
5F - 35 5
60 ` 36 6
7B { 37 7
7C | 38 8
7D } 39 9
7E ~ 9F nil
------------------------------------------------------------------------
The sbig5 codeset is not supported by a locale but only through codeset conversion.
Codeset Conversion
The following codeset converter pairs are available for converting Traditional Chinese characters between sbig5 and other encoding formats.
Refer to iconv_intro(5) for an introduction to codeset conversion. For more information about the other codeset for which sbig5 is the
input or output, see the reference page specified in the list item. big5_sbig5, sbig5_big5
Converting from and to the Big-5 codeset: big5(5).
Note that Big-5 encoding is equivalent to the Microsoft code-page format used on PCs for Traditional Chinese. Therefore, you can use
these converters to convert Traditional Chinese between PC code-page format and Shift Big-5 encoding. eucTW_sbig5, sbig5_eucTW
Converting from and to Taiwanese Extended UNIX Code: eucTW(5).
SEE ALSO
Commands: locale(1)
Others: ascii(5), big5(5), Chinese(5), code_page(5), dechanyu(5), dechanzi(5), eucTW(5), GBK(5), i18n_intro(5), i18n_printing(5),
iconv_intro(5), l10n_intro(5), telecode(5)sbig5(5)
Hello,
I'm trying to figure out how to change the Character code of a text file in a script. Such as the ability to render an out put to Chinese Simple (GB2312) or other language character encodes. (1 Reply)
Is there a way I can execute an installp command in debug mode, so that I can see whats happening when a fileset is being installed or updated? (What files are being replace etc etc).
I have an installp command failing for unknown reason. (7 Replies)
Hi,
I have a file in my desktop which is a unicode format. After this file is transferred to Unix using FTP, we are seeing some special character (like rectangle box type) at the first line. The same file is saved as UTF8 (using textpad tool, selecting encode to UTF-8 option) on my desktopand... (7 Replies)
Any help appreciated. I am connecting (FTP) to a NAS PRO box that is currently installed on one of our subnets in Hong Kong from a Solaris
system running Solaris 10 (our backup server). The NAS box has a backup job that backs up all user documents locally. I am trying to ftp the docs from the NAS... (4 Replies)
Hi guys.
We can can connect to mysql server from command line with some user. but when using mysql_connect() it says:
access is denied for user 'someuser'@'localhost' (using password: YES)
what should i do? (6 Replies)
While uploading an exl file to my application in Solaris 10 the upload failed with error Error! Parsing Error: /SPLM/TC83/tcdata83/model/model_dbextract.xml Line:65576 Column:73 An invalid XML character (Unicode: 0x1a) was found in the value of attribute "unitOfMeasureSymbol" and element is ... (12 Replies)
we have a file that shows encoding as small-endian (in NotePad+). I can not read this file in my program. If I change it to ANSI (again in Notepad+) and save it, I am able to read it in my program.
Can I use the iconv function to change this file to ANSI ?
Thanks in advance for your help. (4 Replies)
I had just added Japanese language as my secondary language with yast and I am still not able to view Japanese characters (they appear mangled)
I've copied two text files from windows to the SUSE Linux Enterprise Server 10 (i586), one with UTF and the other with ANSI. When viewed with cat and... (4 Replies)
Hello,
I am stuck... i dunno why does my test fail... any idea ?
#!/bin/bash
dos2unix info.txt
Distor=Distributeur
LINE=$(cat info.txt | sed -n 1p)
echo $LINE
echo $Distor
echo ""
echo "123-$LINE-123"
echo "123-$Distor-123"
if ; then
LINE2=$(cat info.txt | sed -n 2p)
echo $Distor... (14 Replies)
Trying to load a specific Traditional Chinese Character on an HP UX platform for a Taiwan database. Data is converting to ?. Database has existed since 2008 and this is the first issue I've seen where a user is unable to save the multi-byte character in the database. I'm thinking this is may be... (0 Replies)
Hello,
Im downloading the file from windows server through FTP, the downloaded file is containing some junk character at very start of the file as below and causing my whole script is to fail,
how to download without junk or how to remove these before processing it?
▒▒"nmdbfnmdsfsdf"
... (19 Replies)
I have a file in my Unix ( SOLARIS ) with EBCDIC format...I want this file to read in ASCII OR unicode...Is it possible with UNIX to convert this file on ASCII OR UNICODE format from EBCDIC format?
I was searching through web and found only conversion table :(
Request Rejected
Below is... (16 Replies)
Hi all,
am running the following code on a RHEL 6.6 box to list which charsets are loaded and which are available:
#!/usr/bin/perl -w
use strict;
use Encode;
my @list = Encode->encodings();
my @all_encodings = Encode->encodings(":all");
print "@list\n\n";
print "@all_encodings\n";
... (3 Replies)
Hi Forum.
I have a unique problem that I'm hoping someone can assist me.
I'm generating a fixed width file and one of the output column (person_name at col. pos.#483 defined as string(36) sometimes contains french characters in the name and it causes the next column of data to shift to the... (10 Replies)