UTF-8 in xterm

08-24-2011

Registered User

32, 1

Join Date: Jan 2011

Last Activity: 22 July 2012, 2:59 AM EDT

Posts: 32

Thanks Given: 13

Thanked 1 Time in 1 Post

UTF-8 in xterm

I need to use sort, uniq, grep, wc,... and the like to work with lists of words in UTF-8 (the "words" being phonetic transcriptions using the IPA). I have been using Google a lot and I even found at least one previous post on this topic, but it didn't help.

I tried following the instructions on:
UTF-8 and Unicode FAQ
* I set the locale in my xterm with

Code:

export LC_ALL=fr_FR.UTF-8

(which is installed as per locale -a)
* Then I started a new xterm from within the old one with

Code:

xterm -fn '-adobe-courier-medium-r-normal--10-100-75-75-m-60-iso10646-1'

which I found using

Code:

xlsfonts | grep iso10646-1 | less

* Then I tested using some of the example files found on
UTF-8 and Unicode FAQ

Unfortunately, the unicode characters are displayed as boxes when viewing the file with less (after typing a "y" in answer to the message warning me that "UTF-8-demo.txt may be a binary file...")

I also tried setting LESSCHARSET=utf-8, but it didn't help either.

Can anyone help?

I am using the latest version of X11.app on Mac OS X (XQuartz 2.6.3). less is version 394, xterm version 269.

mregine

View Public Profile for mregine

Find all posts by mregine

08-24-2011

Registered User

23,310, 4,623

Join Date: Aug 2005

Last Activity: 7 July 2020, 11:47 AM EDT

Location: Saskatchewan

Posts: 23,310

Thanks Given: 1,331

Thanked 4,623 Times in 4,217 Posts

Quote:

Originally Posted by mregine

Unfortunately, the unicode characters are displayed as boxes when viewing the file with less (after typing a "y" in answer to the message warning me that "UTF-8-demo.txt may be a binary file...")

I'm suspicious of any tutorial that asks you to use a specific font to get unicode... Those instructions probably only work for one revision of one distro.

xterm has an options menu when running, little known but definitely there, in which you may be able to change fonts and charsets etc.
Unfortunately I don't have access to an xterm right now to tell you where it is but it may be something like right-clicking the title bar.

Corona688

View Public Profile for Corona688

Visit Corona688's homepage!

Find all posts by Corona688

08-24-2011

Registered User

32, 1

Join Date: Jan 2011

Last Activity: 22 July 2012, 2:59 AM EDT

Posts: 32

Thanks Given: 13

Thanked 1 Time in 1 Post

I also tried using the "underspecified" version:

Code:

LC_ALL=fr_FR.UTF-8 xterm -fn '-*-*-*-*-*--14-*-*-*-*-*-iso10646-*'

The result is the same :-/

I know of two menus I can call up with the mouse. One of them is titled "main options", and the other "VT Fonts". I use the 2nd one every now and then to change font size, e.g. when using a beamer, but it doesn't offer options for changing the font.

I have however, achieved a partial solution using

Code:

LESSCHARSET=utf-8 less UTF-8-demo.txt

in the new xterm, but there are still a lot of boxes...

mregine

View Public Profile for mregine

Find all posts by mregine

UNIX for Dummies Questions & Answers

UTF-8 in xterm

10 More Discussions You Might Find Interesting

1. UNIX for Dummies Questions & Answers

Conversion from ansii to UTF 16

Discussion started by: harry00514

2. Linux

Help to Convert file from UNIX UTF-8 to Windows UTF-16

Discussion started by: phanidhar6039

3. Shell Programming and Scripting

ASCII to UTF-8 conversion

Discussion started by: Sriranga

4. AIX

How to print UTF-8 from AIX (lp)

Discussion started by: burnAF

5. Programming

strlen for UTF-8

Discussion started by: cyler

6. UNIX for Advanced & Expert Users

vi and UTF-8 errors

Discussion started by: jlacasci

7. UNIX Desktop Questions & Answers

How to configure Xterm for UTF-8?

Discussion started by: siegfried

8. AIX

en_us.utf-8

Discussion started by: shubhendu.pyne

9. Shell Programming and Scripting

replace UTF-8 characters with tr

Discussion started by: ripat

10. Shell Programming and Scripting

UTF 8 and SED

Discussion started by: jaganadh