Sponsored Content
Top Forums UNIX for Dummies Questions & Answers Help required for Kanji characters in UNIX Post 302284430 by quirkasaurus on Thursday 5th of February 2009 01:06:41 PM
Old 02-05-2009
I think the problem is that the kanji characters are stored in different integer
notation on one box to the next.

This phenomenon occurs a lot in japanese email, and i think, is called "gojimake".

The solution is nasty:

You must translate the binary values on the computer where the kanji look correct,
into their ASCII numeric characters. ( ie. 00010010101 into "27182" )

This could be done using a C program.

Then, within html, you can get the Japanese characters by specifying:

<meta charset='x-euc-jp' >

in your html at the top....
and then accessing the spelled out numbers like:

&27182;

This is the preferred method -- as this is not confused between machines
with different binary integer encoding.
 

10 More Discussions You Might Find Interesting

1. UNIX for Dummies Questions & Answers

UNIX PATH info required PLEASE HELP (I'm new to unix)

I need to know how to enter a unix path in a cgi script for a guest book: example: My URL is http://www.kitachi.info I have an html file in the main folder on my site, the file is called : gbook.html what would the correct unix path for this file be ??? the part of the script... (1 Reply)
Discussion started by: akitachi
1 Replies

2. UNIX for Dummies Questions & Answers

Help Required in Unix Command

Hi All, Can anyone please help me in unix command Query: ==== File contains data along with date and time stamp like, .. Date: 08:23:2005 01:00:00 method: xyz init variables Date 08:23:2005 01:00:01 method: xyz finished init variable .... (2 Replies)
Discussion started by: thaduka
2 Replies

3. UNIX for Dummies Questions & Answers

Unix command help required

Hi All, Can anyone please help me in sort out the command to get the following command say File abc.log contains .... ...... This is the first line This is the second line This is the third line This is the fourth line This is the fifth line This is the first line This is the... (7 Replies)
Discussion started by: thaduka
7 Replies

4. UNIX for Advanced & Expert Users

Help required regarding Unix Signal

It is required to trap the signal send to a daemon process before rebooting a unix server. Suppose a script abc.ksh is running in the server as daemon. Before rebooting the server, the unix admin kills all the daemon processes. It is not known to me how admin kills the processes; I mean by which... (9 Replies)
Discussion started by: k_bijitesh
9 Replies

5. Shell Programming and Scripting

Getting required fields from a test file in required fromat in unix

My data is something like shown below. date1 date2 aaa bbbb ccccc date3 date4 dddd eeeeeee ffffffffff ggggg hh I want the output like this date1date2 aaa eeeeee I serached in the forum but didn't find the exact matching solution. Please help. (7 Replies)
Discussion started by: rdhanek
7 Replies

6. UNIX for Dummies Questions & Answers

How to paste Kanji Characters into a Unix File

Hi, I am unable to copy Kanji characters into a unix file. They look like special characters when pasted into the Unix file. My objective is to copy these characters into a unix file and be able to print it and see the Kanji characters. Any help would be greatly appreciated. I am trying this... (1 Reply)
Discussion started by: andrussw
1 Replies

7. Shell Programming and Scripting

Unix script required

I have a file 123.txt which is aasaasas=1 bsasasasasa=2 sawqas=3 I want my output to be 1 2 3 I am new to scripting can some1 help me out. (14 Replies)
Discussion started by: karthikkasarla
14 Replies

8. UNIX for Dummies Questions & Answers

Rename file to kanji -- Solaris

Hi, My shell script calls a perl script to create an excel and the shell script emails the excel. This excel file needs to be renamed to some Kanji name. I have a flat file that has the required file name in kanji and i extract it within the shell script and try to rename the file, but... (3 Replies)
Discussion started by: tariq_m
3 Replies

9. Shell Programming and Scripting

Recursivly rename folders removing characters as required

HI guys here's hoping some on pout the can help I have a large library of epub and mobi file creates some what by calibre. Output of tree listing below I would like to recursively rename the directories removing the brackets and numbers I have been scratching my head over... (4 Replies)
Discussion started by: dunryc
4 Replies

10. Shell Programming and Scripting

Help required in UNIX commands

I have 40000 records in a file where i need to change the 7th field date format from 05142016 to 20160514 I have given field below. any help would be highly appreciated. 364512|9999999|9999999|210553|195495477|195257095|05142016|10009|36313 ---------- Post updated at 05:02 AM... (2 Replies)
Discussion started by: arun888
2 Replies
jistoeuc(1)							   User Commands						       jistoeuc(1)

NAME
jistoeuc, jistosj, euctojis, euctosj, sjtojis, sjtoeuc - Code conversion between JIS, PC kanji, and Japanese EUC SYNOPSIS
jistoeuc [-8] [-U] [filename...] jistosj [-8] [-U] [filename...] euctojis [-8] [-U] [filename...] euctosj [-U] [filename...] sjtojis [-8] [-U] [filename...] sjtoeuc [-U] [filename...] AVAILABILITY
SUNWjfpu DESCRIPTION
For Japanese language handling, the jistoeuc family provides conversion between different code standards. command [ filename ...] does the specified conversion on the contents of the input filenames and writes it to stdout. If filename is not given, it reads and converts characters from the standard input. jistoeuc converts JIS to Japanese EUC jistosj converts JIS to PC kanji euctojis converts Japanese EUC to JIS euctosj converts Japanese EUC to PC kanji sjtojis converts PC kanji to JIS sjtoeuc converts PC kanji to Japanese EUC OPTIONS
-8 With this option specified, the commands jistoeuc, jistosj, sjtojis, and sjtoeuc, can support JIS X 0201 (Half-Size Katakana). This 8-bit JIS code does not use ISO Shift-In and Shift-Out escape sequences. -U The output is not buffered (The default is buffered output). SEE ALSO
iconv(1), iconv_ja(5) NOTES
This command can handle shift-in escape sequences for the following character sets: JIS X 0208 shift-in escape - E$B, E$(B, E$@ JIS X 0212 shift-in escape - E$(D JIS X 0201 Roman shift-in escape - E(J, E(H ASCII shift-in escape - E(B euctojis and sjtojis can handle shift-in escape sequences for the following character sets: JIS X 0208 shift-in - E$B JIS X 0212 shift-in - E$(D (except when sjtojis command is specified) JIS X 0201 Roman shift-in - E(J jistoeuc does not check whether or not each code in the input file is correct. Conversion with PC kanji is not based on TOG Japanese Ven- dors Council (TOG/JVC) Recommended Code Set Conversion Specification between Japanese EUC and Shift-JIS. The iconv(1) utility provides these functions. See iconv(1) and iconv_ja(5) for more information. BUGS
If JIS X 0212 character set is specified as input, jistosj and euctosj can not support the conversion correctly. euctosj, sjtoeuc, jis- tosj, and sjtojis can support conversion correctly only if JIS X 0208 1 ku - 84 ku is specified as input. SunOS 5.10 10 Jan 2003 jistoeuc(1)
All times are GMT -4. The time now is 02:47 PM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy