02-05-2009
I think the problem is that the kanji characters are stored in different integer
notation on one box to the next.
This phenomenon occurs a lot in japanese email, and i think, is called "gojimake".
The solution is nasty:
You must translate the binary values on the computer where the kanji look correct,
into their ASCII numeric characters. ( ie. 00010010101 into "27182" )
This could be done using a C program.
Then, within html, you can get the Japanese characters by specifying:
<meta charset='x-euc-jp' >
in your html at the top....
and then accessing the spelled out numbers like:
&27182;
This is the preferred method -- as this is not confused between machines
with different binary integer encoding.
10 More Discussions You Might Find Interesting
1. UNIX for Dummies Questions & Answers
I need to know how to enter a unix path in a cgi script for a guest book:
example:
My URL is http://www.kitachi.info
I have an html file in the main folder on my site, the file is called :
gbook.html
what would the correct unix path for this file be ???
the part of the script... (1 Reply)
Discussion started by: akitachi
1 Replies
2. UNIX for Dummies Questions & Answers
Hi All,
Can anyone please help me in unix command
Query:
====
File contains data along with date and time stamp like,
..
Date: 08:23:2005 01:00:00
method: xyz
init variables
Date 08:23:2005 01:00:01
method: xyz
finished init variable
.... (2 Replies)
Discussion started by: thaduka
2 Replies
3. UNIX for Dummies Questions & Answers
Hi All,
Can anyone please help me in sort out the command to get the following command
say File abc.log contains
....
......
This is the first line
This is the second line
This is the third line
This is the fourth line
This is the fifth line
This is the first line
This is the... (7 Replies)
Discussion started by: thaduka
7 Replies
4. UNIX for Advanced & Expert Users
It is required to trap the signal send to a daemon process before rebooting a unix server. Suppose a script abc.ksh is running in the server as daemon. Before rebooting the server, the unix admin kills all the daemon processes. It is not known to me how admin kills the processes; I mean by which... (9 Replies)
Discussion started by: k_bijitesh
9 Replies
5. Shell Programming and Scripting
My data is something like shown below.
date1 date2 aaa bbbb ccccc
date3 date4 dddd eeeeeee ffffffffff ggggg hh
I want the output like this
date1date2 aaa eeeeee
I serached in the forum but didn't find the exact matching solution. Please help. (7 Replies)
Discussion started by: rdhanek
7 Replies
6. UNIX for Dummies Questions & Answers
Hi,
I am unable to copy Kanji characters into a unix file. They look like special characters when pasted into the Unix file. My objective is to copy these characters into a unix file and be able to print it and see the Kanji characters. Any help would be greatly appreciated.
I am trying this... (1 Reply)
Discussion started by: andrussw
1 Replies
7. Shell Programming and Scripting
I have a file 123.txt which is
aasaasas=1
bsasasasasa=2
sawqas=3
I want my output to be
1
2
3
I am new to scripting can some1 help me out. (14 Replies)
Discussion started by: karthikkasarla
14 Replies
8. UNIX for Dummies Questions & Answers
Hi,
My shell script calls a perl script to create an excel and the shell
script emails the excel. This excel file needs to be renamed
to some Kanji name.
I have a flat file that has the required file name in kanji and i extract it
within the shell script and try to rename the file, but... (3 Replies)
Discussion started by: tariq_m
3 Replies
9. Shell Programming and Scripting
HI guys here's hoping some on pout the can help
I have a large library of epub and mobi file creates some what by calibre.
Output of tree listing below
I would like to recursively rename the directories removing the brackets and numbers
I have been scratching my head over... (4 Replies)
Discussion started by: dunryc
4 Replies
10. Shell Programming and Scripting
I have 40000 records in a file where i need to change the 7th field date format from 05142016 to 20160514
I have given field below. any help would be highly appreciated.
364512|9999999|9999999|210553|195495477|195257095|05142016|10009|36313
---------- Post updated at 05:02 AM... (2 Replies)
Discussion started by: arun888
2 Replies
LEARN ABOUT SUNOS
jistosj
jistoeuc(1) User Commands jistoeuc(1)
NAME
jistoeuc, jistosj, euctojis, euctosj, sjtojis, sjtoeuc - Code conversion between JIS, PC kanji, and Japanese EUC
SYNOPSIS
jistoeuc [-8] [-U] [filename...]
jistosj [-8] [-U] [filename...]
euctojis [-8] [-U] [filename...]
euctosj [-U] [filename...]
sjtojis [-8] [-U] [filename...]
sjtoeuc [-U] [filename...]
AVAILABILITY
SUNWjfpu
DESCRIPTION
For Japanese language handling, the jistoeuc family provides conversion between different code standards. command [ filename ...] does the
specified conversion on the contents of the input filenames and writes it to stdout.
If filename is not given, it reads and converts characters from the standard input.
jistoeuc converts JIS to Japanese EUC
jistosj converts JIS to PC kanji
euctojis converts Japanese EUC to JIS
euctosj converts Japanese EUC to PC kanji
sjtojis converts PC kanji to JIS
sjtoeuc converts PC kanji to Japanese EUC
OPTIONS
-8 With this option specified, the commands jistoeuc, jistosj, sjtojis, and sjtoeuc, can support JIS X 0201 (Half-Size Katakana).
This 8-bit JIS code does not use ISO Shift-In and Shift-Out escape sequences.
-U The output is not buffered (The default is buffered output).
SEE ALSO
iconv(1), iconv_ja(5)
NOTES
This command can handle shift-in escape sequences for the following character sets:
JIS X 0208 shift-in escape - E$B, E$(B, E$@
JIS X 0212 shift-in escape - E$(D
JIS X 0201 Roman shift-in escape - E(J, E(H
ASCII shift-in escape - E(B
euctojis and sjtojis can handle shift-in escape sequences for the following character sets:
JIS X 0208 shift-in - E$B
JIS X 0212 shift-in - E$(D (except when sjtojis command is specified)
JIS X 0201 Roman shift-in - E(J
jistoeuc does not check whether or not each code in the input file is correct. Conversion with PC kanji is not based on TOG Japanese Ven-
dors Council (TOG/JVC) Recommended Code Set Conversion Specification between Japanese EUC and Shift-JIS. The iconv(1) utility provides
these functions. See iconv(1) and iconv_ja(5) for more information.
BUGS
If JIS X 0212 character set is specified as input, jistosj and euctosj can not support the conversion correctly. euctosj, sjtoeuc, jis-
tosj, and sjtojis can support conversion correctly only if JIS X 0208 1 ku - 84 ku is specified as input.
SunOS 5.10 10 Jan 2003 jistoeuc(1)