03-25-2015
I agree, but I seem to remember there is also a Windows ANSI codepage set..
Last edited by Scrutinizer; 03-25-2015 at 02:35 PM..
10 More Discussions You Might Find Interesting
1. Shell Programming and Scripting
Collegues
I tried to manipulate a UTF 8 data using the following script.
cat $1 | sed 's/ലായി$/ലായി LAYI/g' | sed 's/ുടെ/ുടെ UTE/g' | sed 's/യില്*/യില്* YIL/g'
But it says that cnot exicute binary file. Any solution.
Jaganadh.
Linguist (1 Reply)
Discussion started by: jaganadh
1 Replies
2. Shell Programming and Scripting
Hi,
I try to get tr to replace multibytes characters by ascii equivalent. For example
"Je vais ŕ l'école" ---> 'Je vais a l'ecole"
But my version of tr (5.97) doesn't seem to support multibyte sets.
$ locale charmap; echo "Je vais ŕ l'école" | tr éŕ ea
UTF-8
Je vais aa l'aacole
I try to... (2 Replies)
Discussion started by: ripat
2 Replies
3. AIX
please someone provide me the link for downloading en_us.utf-8 .....i have an issue with locale for which i need this :( (1 Reply)
Discussion started by: shubhendu.pyne
1 Replies
4. UNIX Desktop Questions & Answers
hmmm... I was not sure where to post this! I want emit non-ascii chinese and ciryllic text. I'm running windows server 2003 with cygwin xfree86.
I know I have one font that can render chinese and russian: "Arial Unicode MS".
How can I configure my cygwin xterm so I can emit russian and... (1 Reply)
Discussion started by: siegfried
1 Replies
5. UNIX for Advanced & Expert Users
Hi all,
At present a file from AS400 system is being FTPed to an AIX system.
Now, a similar file needs to be sent from our Unix box (Solaris)
Is there any tool available which does the conversion in Unix from UTF-8 to EBCDIC?
Any suggestions/ pointers are really appreciated.
Thanks,... (4 Replies)
Discussion started by: sridhar_423
4 Replies
6. UNIX for Advanced & Expert Users
We just installed icu for UTF-8 compliance on our AIX 5.3 system. While usuing vi on some files we get the following error:
ex: 0602-169 Incomplete or invalid multibyte character encountere
yte character encountered, conversion failed.ex: 0602-169 Incomplete or invalidb
ractersultibyte... (0 Replies)
Discussion started by: jlacasci
0 Replies
7. Programming
My OS (Debian) and gcc use the UTF-8 locale. This code says that the char size is 1 byte but the size of 'a' is really 4 bytes.
int main(void)
{
setlocale(LC_ALL, "en_US.UTF-8");
printf("Char size: %i\nSize of char 'a': %i\nSize of Euro sign '€': %i\nLength of Euro sign: %i\n",... (8 Replies)
Discussion started by: cyler
8 Replies
8. UNIX for Dummies Questions & Answers
I need to use sort, uniq, grep, wc,... and the like to work with lists of words in UTF-8 (the "words" being phonetic transcriptions using the IPA). I have been using Google a lot and I even found at least one previous post on this topic, but it didn't help.
I tried following the instructions... (2 Replies)
Discussion started by: mregine
2 Replies
9. Shell Programming and Scripting
I Am trying to change the file encoding from ASCII to UTF-8 using below command
iconv -f ASCII -t UTF-8 <input_file> > <output_file>
But the output_file is not actually in UTF-8 format. If I use the file command to check the file encoding it still says ASCII.
While converting am not... (5 Replies)
Discussion started by: Sriranga
5 Replies
10. Linux
Hi,
I have tried to convert a UTF-8 file to windows UTF-16 format file as below from unix machine
unix2dos < testing.txt | iconv -f UTF-8 -t UTF-16 > out.txt
and i am getting some chinese characters as below which l opened the converted file on windows machine.
LANG=en_US.UTF-8... (3 Replies)
Discussion started by: phanidhar6039
3 Replies
RE(1) General Commands Manual RE(1)
NAME
re - programm to conver russian texts between different encodings
SYNOPSIS
re <infile><outfile><infrmt><outfrmt><f|b|s><u|l|s>
DESCRIPTION
This manual page was written for the Debian GNU/Linux distribution because the original program does not have a manual page.
re is a program that converts russian texts between encodings.
OPTIONS
The easiest way to convert some file into readable form (KOI-8), just type this:
re <SourceFile> <DestFile> ? K
where :
<SourceFile> - unreadable file
<DestFile> - resulting file
? - tells RE that source codepage is unknown and RE should analyze the file and determine the source codepage
K - tells RE that destination codepage is KOI-8
Note: If you know, what is source codepage, you may use it instead of "?" option. For example, you want to convert letter.txt file which
was written in Win you know, that this file is in 1251 codepage:
re letter.txt letter2.txt W K
Now examine the letter2.txt - you should see russian symbols in KOI-8.
Here is the know codepages list and their abbreviatures:
W - Windows 1251 "_" - _xxe
D - Dos "%" - %hex
K - KOI-8 "\" - 'hex
L - Latin G - Graph_win
I - Iso "<" - <binhex>
H - HEX + - +UTF7-
S - ShiftKbrd C - C_MIC
M - Mac Y - Y_c16
A - AFF Z - Z_c32
O - Odd(UTF8_1) F - F(UTF8_2)
B - Base64 P - Pict
E - Express N - N_Estl
T - T-Html V - V_Vpp855
U - User X - X_sp
"-" - uue J - J_diff
Other options are: [-v][-E|-R|-N][-e|-s]
-v - tells what is processing
-n - don't tells what is processing (default)
-E - converts all p,H from Russian to English
-R - converts all p,H from English to Russian
-N - lets all p,H Russian or English as in text (default)
-e - converts all symbols 0x80 - 0xFF
-s - converts only 64 symbols of Russian Alphabet (default)
AUTHOR
This manual page was written by Aigars Mahinovs <aigarius@debian.org>, for the Debian GNU/Linux system (but may be used by others).
March 21, 2000 RE(1)