Conversion from ansii to UTF 16 Post: 302939434

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

UTF 8 and SED

Collegues I tried to manipulate a UTF 8 data using the following script. cat $1 | sed 's/ലായി$/ലായി LAYI/g' | sed 's/ുടെ/ുടെ UTE/g' | sed 's/യില്*/യില്* YIL/g' But it says that cnot exicute binary file. Any solution. Jaganadh. Linguist

2. Shell Programming and Scripting

replace UTF-8 characters with tr

Hi, I try to get tr to replace multibytes characters by ascii equivalent. For example "Je vais � l'�cole" ---> 'Je vais a l'ecole" But my version of tr (5.97) doesn't seem to support multibyte sets. $ locale charmap; echo "Je vais � l'�cole" | tr �� ea UTF-8 Je vais aa l'aacole I try to...

3. AIX

en_us.utf-8

please someone provide me the link for downloading en_us.utf-8 .....i have an issue with locale for which i need this :(

4. UNIX Desktop Questions & Answers

How to configure Xterm for UTF-8?

hmmm... I was not sure where to post this! I want emit non-ascii chinese and ciryllic text. I'm running windows server 2003 with cygwin xfree86. I know I have one font that can render chinese and russian: "Arial Unicode MS". How can I configure my cygwin xterm so I can emit russian and...

5. UNIX for Advanced & Expert Users

UTF-8 to EBCDIC conversion in UNIX

Hi all, At present a file from AS400 system is being FTPed to an AIX system. Now, a similar file needs to be sent from our Unix box (Solaris) Is there any tool available which does the conversion in Unix from UTF-8 to EBCDIC? Any suggestions/ pointers are really appreciated. Thanks,...

6. UNIX for Advanced & Expert Users

vi and UTF-8 errors

We just installed icu for UTF-8 compliance on our AIX 5.3 system. While usuing vi on some files we get the following error: ex: 0602-169 Incomplete or invalid multibyte character encountere yte character encountered, conversion failed.ex: 0602-169 Incomplete or invalidb ractersultibyte...

7. Programming

strlen for UTF-8

My OS (Debian) and gcc use the UTF-8 locale. This code says that the char size is 1 byte but the size of 'a' is really 4 bytes. int main(void) { setlocale(LC_ALL, "en_US.UTF-8"); printf("Char size: %i\nSize of char 'a': %i\nSize of Euro sign '�': %i\nLength of Euro sign: %i\n",...

8. UNIX for Dummies Questions & Answers

UTF-8 in xterm

I need to use sort, uniq, grep, wc,... and the like to work with lists of words in UTF-8 (the "words" being phonetic transcriptions using the IPA). I have been using Google a lot and I even found at least one previous post on this topic, but it didn't help. I tried following the instructions...

9. Shell Programming and Scripting

ASCII to UTF-8 conversion

I Am trying to change the file encoding from ASCII to UTF-8 using below command iconv -f ASCII -t UTF-8 <input_file> > <output_file> But the output_file is not actually in UTF-8 format. If I use the file command to check the file encoding it still says ASCII. While converting am not...

10. Linux

Help to Convert file from UNIX UTF-8 to Windows UTF-16

Hi, I have tried to convert a UTF-8 file to windows UTF-16 format file as below from unix machine unix2dos < testing.txt | iconv -f UTF-8 -t UTF-16 > out.txt and i am getting some chinese characters as below which l opened the converted file on windows machine. LANG=en_US.UTF-8...

LEARN ABOUT BSD

re

RE(1)							      General Commands Manual							     RE(1)

NAME

       re - programm to conver russian texts between different encodings

SYNOPSIS

       re <infile><outfile><infrmt><outfrmt><f|b|s><u|l|s>

DESCRIPTION

       This manual page was written for the Debian GNU/Linux distribution because the original program does not have a manual page.

       re is a program that converts russian texts between encodings.

OPTIONS

       The easiest way to convert some file into readable form (KOI-8), just type this:

       re <SourceFile> <DestFile> ? K

       where :

       <SourceFile> - unreadable file

       <DestFile>   - resulting file

       ?	    - tells RE that source codepage is unknown and RE should analyze the file and determine the source codepage

       K	    - tells RE that destination codepage is KOI-8

       Note:  If  you  know, what is source codepage, you may use it instead of "?" option. For example, you want to convert letter.txt file which
       was written in Win you know, that this file is in 1251 codepage:

	   re letter.txt letter2.txt W K

       Now examine the letter2.txt - you should see russian symbols in KOI-8.

       Here is the know codepages list and their abbreviatures:
	   W   - Windows 1251	 "_"  - _xxe
	   D   - Dos		 "%"  - %hex
	   K   - KOI-8		 "\" - 'hex
	   L   - Latin		  G   - Graph_win
	   I   - Iso		 "<"  - <binhex>
	   H   - HEX		  +   - +UTF7-
	   S   - ShiftKbrd	  C   - C_MIC
	   M   - Mac		  Y   - Y_c16
	   A   - AFF		  Z   - Z_c32
	   O   - Odd(UTF8_1)	  F   - F(UTF8_2)
	   B   - Base64 	  P   - Pict
	   E   - Express	  N   - N_Estl
	   T   - T-Html 	  V   - V_Vpp855
	   U   - User		  X   - X_sp
	   "-" - uue		  J   - J_diff

       Other options are: [-v][-E|-R|-N][-e|-s]
	    -v - tells what is processing
	    -n - don't tells what is processing (default)
	    -E - converts all p,H from Russian to English
	    -R - converts all p,H from English to Russian
	    -N - lets all p,H Russian or English as in text (default)
	    -e - converts all symbols 0x80 - 0xFF
	    -s - converts only 64 symbols of Russian Alphabet (default)

AUTHOR

       This manual page was written by Aigars Mahinovs <aigarius@debian.org>, for the Debian GNU/Linux system (but may be used by others).

								  March 21, 2000							     RE(1)

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

UTF 8 and SED

Discussion started by: jaganadh

2. Shell Programming and Scripting

replace UTF-8 characters with tr

Discussion started by: ripat

3. AIX

en_us.utf-8

Discussion started by: shubhendu.pyne

4. UNIX Desktop Questions & Answers

How to configure Xterm for UTF-8?

Discussion started by: siegfried

5. UNIX for Advanced & Expert Users

UTF-8 to EBCDIC conversion in UNIX

Discussion started by: sridhar_423

6. UNIX for Advanced & Expert Users

vi and UTF-8 errors

Discussion started by: jlacasci

7. Programming

strlen for UTF-8

Discussion started by: cyler