Sponsored Content
Top Forums UNIX for Dummies Questions & Answers Determing the encoding of a file Post 302751837 by DGPickett on Friday 4th of January 2013 02:53:32 PM
Old 01-04-2013
Try using 'od' on it to see if there is a pattern you can recognize. Is it unicode, euc, jis, ebcdic, bcdic, or just an odd code page? Hard to say! 'I use 'od -bc' because I was octal-raised, but there are options for hex and decimal offsets. But yes, really, you should know!

Often, 'C' is linked to iso-8859-1 or Latin-1, but your file is not that.
 

10 More Discussions You Might Find Interesting

1. UNIX for Dummies Questions & Answers

File encoding in Unix

1. I have a shell script which creates a file using cat command. How can i find what encoding the file follows (e.g. UTF8, ANSI)? 2. I want to convert that file to PC-ANSI format. How can i achieve that? I am using HP-Unix. (6 Replies)
Discussion started by: ssmallya
6 Replies

2. Shell Programming and Scripting

get the file encoding

Hello! The system is AIX 5.3 Give please command or script to get the file encoding Thanks (2 Replies)
Discussion started by: vinment
2 Replies

3. AIX

get the file encoding

Hello! The system is AIX 5.3 Give please command or script to get the file encoding (1 Reply)
Discussion started by: vinment
1 Replies

4. Shell Programming and Scripting

Dymically determing the number of check list in Zenity, How?

hi, In my project i cannot determine the number of check list initially... I will know dynamically during execution... so How to specify the number of check list dynamically in zenity Waiting for your precious Answer..... (1 Reply)
Discussion started by: shivarajM
1 Replies

5. Shell Programming and Scripting

Cygwin vi XML file encoding problem

Hi, I have got a zip (binary) file transferred from MacOS (thus it has additional __MACOSX directory packed inside). On extracting this zip, there are few *.xml files available. When I opened this *.xml file in vim editor using Cygwin (on windows) the editor displayed in the bottom. I tried... (4 Replies)
Discussion started by: royalibrahim
4 Replies

6. HP-UX

how to find the character encoding of a file in hp_ux

how to find the character encoding of a file in hp_ux (1 Reply)
Discussion started by: alokjyotibal
1 Replies

7. Shell Programming and Scripting

How to find the file encoding and updating the file encoding?

Hi, I am beginner to Unix. My requirement is to validate the encoding used in the incoming file(csv,txt).If it is encoded with UTF-8 format,then the file should remain as such otherwise i need to chnage the encoding to UTF-8. Please advice me how to proceed on this. (7 Replies)
Discussion started by: cnraja
7 Replies

8. HP-UX

Determing size of swap space

Hi Experts, Need your advise in determining the size of swap space in of the new HP-Ux server. Server is having 32G of physical memory. Ideally what amout of physical memory should be allocated as a swap space? Following document from HP suggests to have minimum swap space... (2 Replies)
Discussion started by: sai_2507
2 Replies

9. Solaris

View file encoding then change encoding.

Hi all!! I´m using command file -i myfile.xml to validate XML file encoding, but it is just saying regular file . I´m expecting / looking an output as UTF8 or ANSI / ASCII Is there command to display the files encoding? Thank you! (2 Replies)
Discussion started by: mrreds
2 Replies

10. Shell Programming and Scripting

How to know file encoding?

how can i know what format a file is * example: UTF-8 ANSI UCS2 i am in a... (8 Replies)
Discussion started by: tricampeon81
8 Replies
PERLJP(1)						 Perl Programmers Reference Guide						 PERLJP(1)

NAME
perljp - XXX Perl XXX XX
Perl XXXXXXXX! Perl 5.8.0 XXXUnicodeXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX CJK (XXXXXXXXXXXX)XXXXXXXXXXXUnicodeXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXOSXXXXXXXXXXXXXXX(PCXXXMacintosh)XXXXXXXXXXXX Perl XXX Unicode XXXXXXXPerl XXXXXXXXXXXXXXXXXXXX Unicode XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Encode XXXXXXXXXXXXUnicode XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX XXXX Encode XXXXXXXXXXXXXXXXXXXXXX 7bit-jis AdobeStandardEncoding AdobeSymbol AdobeZdingbat ascii big5 big5-hkscs cp1006 cp1026 cp1047 cp1250 cp1251 cp1252 cp1253 cp1254 cp1255 cp1256 cp1257 cp1258 cp37 cp424 cp437 cp500 cp737 cp775 cp850 cp852 cp855 cp856 cp857 cp860 cp861 cp862 cp863 cp864 cp865 cp866 cp869 cp874 cp875 cp932 cp936 cp949 cp950 dingbats euc-cn euc-jp euc-kr gb12345-raw gb2312-raw gsm0338 hp-roman8 hz iso-2022-jp iso-2022-jp-1 iso-8859-1 iso-8859-10 iso-8859-11 iso-8859-13 iso-8859-14 iso-8859-15 iso-8859-16 iso-8859-2 iso-8859-3 iso-8859-4 iso-8859-5 iso-8859-6 iso-8859-7 iso-8859-8 iso-8859-9 iso-ir-165 jis0201-raw jis0208-raw jis0212-raw johab koi8-f koi8-r koi8-u ksc5601-raw MacArabic MacCentralEurRoman MacChineseSimp MacChineseTrad MacCroatian MacCyrillic MacDingbats MacFarsi MacGreek MacHebrew MacIcelandic MacJapanese MacKorean MacRoman MacRomanian MacRumanian MacSami MacSymbol MacThai MacTurkish MacUkrainian nextstep posix-bc shiftjis symbol UCS-2BE UCS-2LE UTF-16 UTF-16BE UTF-16LE UTF-32 UTF-32BE UTF-32LE utf8 viscii (X114XX) XXXXXXXXXFOOXXXXXXUTF-8XXXXXXXXXXXXXXXXXX perl -Mencoding=FOO,STDOUT,utf8 -pe1 < file.FOO > file.utf8 XXXPerlXXXXXXPerlXXXXXXXXXXXXXXXXXXXXpiconvXXXXXXXXXXXXXXXXXXXXXXXXXX piconv -f FOO -t utf8 < file.FOO > file.utf8 piconv -f utf8 -t FOO < file.utf8 > file.FOO About (jcode.pl|Jcode.pm|JPerl) 5.8XXXXXXXXXXEUC-JPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXJcode.pmX( http://openlab.ring.gr.jp/Jcode/ )Xperl4XXXXXXXXXXXXXjcode.plXXXXXXXXXXXXXXXXCGIXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 5.005XXXPerlXXXXXXXXXXXXXXXXXXXJperlXXXXXXX( http://homepage2.nifty.com/kipp/perl/jperl/index.html )XXXXMac OS 9.x/ClassicXXPerlXMacPerlXXXXXXMacJPerlXXXXXXXXXXX( http://habilis.net/macjperl/ ).XXXXXXXXXXXXXEUC-JPXXXShift_JISXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Perl5.8XXXXXXXXXXXXXPerlXXXXXXXXXXXXXXXXXXXXXXX114XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCPANXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX o XXX XXXXXXXXXShift_JISXXXXEUC-JPXXXXXXXXXXX # jcode.pl require "jcode.pl"; while(<>){ jcode::convert(*_, 'euc', 'sjis'); print; } # Jcode.pm use Jcode; while(<>){ print Jcode->new($_, 'sjis')->euc; } # Perl 5.8 use Encode; while(<>){ from_to($_, 'shiftjis', 'euc-jp'); print; } # Perl 5.8 - encoding XXXXX use encoding 'euc-jp', STDIN => 'shiftjis'; while(<>){ print; } o Jperl XXXXXXX XXXX"shebang"XXXXXXXXXJperlXXscriptXXXXXXXXXXXXXXXXXXXXXXX #!/path/to/jperl X #!/path/to/perl -Mencoding=euc-jp XXXX perldoc encoding XXXXXXXXXX XXXXXX PerlXXXXXXXXXXXXXXXPerlXXXXXUnicodeXXXXXXXXEncodeXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX perldoc perlunicode # PerlXUnicodeXXXXXX perldoc Encode # EncodeXXXXXXXXX perldoc Encode::JP # XXXXXXXXXXXXXX PerlXXXXXX URL <http://www.perl.com/> Perl XXXXXX (O'Reilly and Associates) <http://www.cpan.org/> CPAN (Comprehensive Perl Archive Network) <http://lists.perl.org/> Perl XXXXXXXXX PerlXXXXXXX URL <http://www.oreilly.com.tw/> O'Reilly XXPerlXXXX(XXXXXX) <http://www.oreilly.com.cn/> O'Reilly XXPerlXXXX(XXXXXX) <http://www.oreilly.co.jp/catalog/> XXXXXXXPerlXXXX(XXX) Perl XXXXXXXX <http://www.pm.org/groups/asia.html> UnicodeXXXURL <http://www.unicode.org/> Unicode XXXXXXX (UnicodeXXXXXXX) <http://www.cl.cam.ac.uk/%7Emgk25/unicode.html> UTF-8 and Unicode FAQ for Unix/Linux <http://wiki.kldp.org/Translations/html/UTF8-Unicode-KLDP/UTF8-Unicode-KLDP.html> UTF-8 and Unicode FAQ for Unix/Linux (XXXXX) AUTHORS
Jarkko Hietaniemi <jhi@iki.fi> Dan Kogai (XXXX) <dankogai@dan.co.jp> perl v5.18.2 2013-11-04 PERLJP(1)
All times are GMT -4. The time now is 02:04 AM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy