The UNIX and Linux Forums  
Hello and Welcome from United States to the UNIX and Linux Forums! Thank You for Visiting and Joining Our Global Community.

Go Back   The UNIX and Linux Forums > Top Forums > Shell Programming and Scripting
.
google unix.com



Shell Programming and Scripting Post questions about KSH, CSH, SH, BASH, PERL, PHP, SED, AWK and OTHER shell scripts and shell scripting languages here.

More UNIX and Linux Forum Topics You Might Find Helpful
Thread Thread Starter Forum Replies Last Post
sed and character return problem Santiago Shell Programming and Scripting 3 09-19-2008 05:27 AM
character set conversion in unix C gucho High Level Programming 1 03-01-2008 08:27 AM
Problem deleting file with special character hart1165 UNIX for Dummies Questions & Answers 2 12-07-2005 11:29 AM
rsync problem - space character in filename chief2 UNIX for Dummies Questions & Answers 3 09-24-2004 12:06 PM
character comparison problem giannicello UNIX for Dummies Questions & Answers 13 02-26-2003 09:35 PM

Closed Thread
English Japanese Spanish French German Portuguese Italian Dutch Swedish Russian Norwegian Hungarian Hebrew Danish Bulgarian Greek Powered by Powered by Google
 
LinkBack Thread Tools Search this Thread Rate Thread Display Modes
  #1 (permalink)  
Old 04-18-2009
sandeeppvk sandeeppvk is offline
Registered User
  
 

Join Date: Apr 2009
Posts: 10
Unix character set problem

Hi All,

We are getting file into our unix box with multibyte characters. When we tried to view the file the record looks like this

Frédéric

Actually the data sent to us is

Frédéric

--> my locale charmap of unix is set to UTF8 only ... but still i am getting this problem.

I created the same record in windows desktop and ftp ed the file to unix server. File looks fine when ftp ed.

We thought error might be during writing the file to unix from other source. Then source sender send the data along with ascii characters of that file.

so the file looks like this...

Frédéric

70 114 233 100 233 114 105 99 <-- ascii values for above record

Ascii values are coming correctly but data looks different...

Help me out on this...
  #2 (permalink)  
Old 04-18-2009
cbkihong cbkihong is offline Forum Advisor  
Advisor
  
 

Join Date: Sep 2002
Location: Hong Kong, China
Posts: 1,624
It may probably be a terminal font rendering issue, or your terminal may be started in another locale. Even though you switched the locale in the shell, text may still not be rendered properly at the terminal emulator level. This is common with X-based terminals.

So which kind of terminal are you using, and are you sure a Unicode font with the needed characters is used for rendering the terminal text?
  #3 (permalink)  
Old 04-18-2009
sandeeppvk sandeeppvk is offline
Registered User
  
 

Join Date: Apr 2009
Posts: 10
Thanks for the reply.

We are using putty. With this interface we tried to change the character set ..we didnt get proper data ....

Is there any other interface like if we use other interface it is possible to view the data properly...please suggest...
  #4 (permalink)  
Old 04-18-2009
cbkihong cbkihong is offline Forum Advisor  
Advisor
  
 

Join Date: Sep 2002
Location: Hong Kong, China
Posts: 1,624
With Putty, you need to make sure you are selecting the proper encoding. Also check the font used. Both may be configured as preferences for specific sites.
  #5 (permalink)  
Old 04-20-2009
sandeeppvk sandeeppvk is offline
Registered User
  
 

Join Date: Apr 2009
Posts: 10
Thanks once again....

Its working fine when i change the settings in putty configuration.

But if we have to change them manually. Is there any command in unix which automatically change the settings of putty to UTF8 and font changes.

Please suggest.
  #6 (permalink)  
Old 04-24-2009
sandeeppvk sandeeppvk is offline
Registered User
  
 

Join Date: Apr 2009
Posts: 10
Hi,

We are receiving the file in unix with korean and china characters along with french characters. when we are using UTF-8 mode only french characters are loaded properly when loaded into oracle database.

Which character set should I use to capture korean characters .... Normally I heard UTf-8 will hold all the types... but here I am not able to ....Please help me on this....
Closed Thread

Bookmarks

Thread Tools Search this Thread
Search this Thread:

Advanced Search
Display Modes Rate This Thread
Rate This Thread:

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On




All times are GMT -4. The time now is 07:03 AM.


Powered by: vBulletin, Copyright ©2000 - 2006, Jelsoft Enterprises Limited. Language Translations Powered by .
vBCredits v1.4 Copyright ©2007 - 2008, PixelFX Studios
The UNIX and Linux Forums Content Copyright ©1993-2009. All Rights Reserved.Ad Management by RedTyger

Content Relevant URLs by vBSEO 3.2.0