01-04-2013
Try using 'od' on it to see if there is a pattern you can recognize. Is it unicode, euc, jis, ebcdic, bcdic, or just an odd code page? Hard to say! 'I use 'od -bc' because I was octal-raised, but there are options for hex and decimal offsets. But yes, really, you should know!
Often, 'C' is linked to iso-8859-1 or Latin-1, but your file is not that.
10 More Discussions You Might Find Interesting
1. UNIX for Dummies Questions & Answers
1. I have a shell script which creates a file using cat command. How can i find what encoding the file follows (e.g. UTF8, ANSI)?
2. I want to convert that file to PC-ANSI format. How can i achieve that?
I am using HP-Unix. (6 Replies)
Discussion started by: ssmallya
6 Replies
2. Shell Programming and Scripting
Hello!
The system is AIX 5.3
Give please command or script to get the file encoding
Thanks (2 Replies)
Discussion started by: vinment
2 Replies
3. AIX
Hello!
The system is AIX 5.3
Give please command or script to get the file encoding (1 Reply)
Discussion started by: vinment
1 Replies
4. Shell Programming and Scripting
hi,
In my project i cannot determine the number of check list initially... I will know dynamically during execution... so How to specify the number of check list dynamically in zenity
Waiting for your precious Answer..... (1 Reply)
Discussion started by: shivarajM
1 Replies
5. Shell Programming and Scripting
Hi,
I have got a zip (binary) file transferred from MacOS (thus it has additional __MACOSX directory packed inside). On extracting this zip, there are few *.xml files available. When I opened this *.xml file in vim editor using Cygwin (on windows) the editor displayed in the bottom. I tried... (4 Replies)
Discussion started by: royalibrahim
4 Replies
6. HP-UX
how to find the character encoding of a file in hp_ux (1 Reply)
Discussion started by: alokjyotibal
1 Replies
7. Shell Programming and Scripting
Hi,
I am beginner to Unix.
My requirement is to validate the encoding used in the incoming file(csv,txt).If it is encoded with UTF-8 format,then the file should remain as such otherwise i need to chnage the encoding to UTF-8.
Please advice me how to proceed on this. (7 Replies)
Discussion started by: cnraja
7 Replies
8. HP-UX
Hi Experts,
Need your advise in determining the size of swap space in of the new HP-Ux server.
Server is having 32G of physical memory.
Ideally what amout of physical memory should be allocated as a swap space?
Following document from HP suggests to have minimum swap space... (2 Replies)
Discussion started by: sai_2507
2 Replies
9. Solaris
Hi all!!
I´m using command file -i myfile.xml to validate XML file encoding, but it is just saying regular file . I´m expecting / looking an output as UTF8 or ANSI / ASCII
Is there command to display the files encoding?
Thank you! (2 Replies)
Discussion started by: mrreds
2 Replies
10. Shell Programming and Scripting
how can i know what format a file is
* example:
UTF-8
ANSI
UCS2
i am in a... (8 Replies)
Discussion started by: tricampeon81
8 Replies
LEARN ABOUT REDHAT
encode::mime::header
Encode::MIME::Header(3pm) Perl Programmers Reference Guide Encode::MIME::Header(3pm)
NAME
Encode::MIME::Header -- MIME 'B' and 'Q' header encoding
SYNOPSIS
use Encode qw/encode decode/;
$utf8 = decode('MIME-Header', $header);
$header = encode('MIME-Header', $utf8);
ABSTRACT
This module implements RFC 2047 Mime Header Encoding. There are 3 variant encoding names; "MIME-Header", "MIME-B" and "MIME-Q". The dif-
ference is described below
decode() encode()
----------------------------------------------
MIME-Header Both B and Q =?UTF-8?B?....?=
MIME-B B only; Q croaks =?UTF-8?B?....?=
MIME-Q Q only; B croaks =?UTF-8?Q?....?=
DESCRIPTION
When you decode(=?encoding?X?ENCODED WORD?=), ENCODED WORD is extracted and decoded for X encoding (B for Base64, Q for Quoted-Printable).
Then the decoded chunk is fed to decode(encoding). So long as encoding is supported by Encode, any source encoding is fine.
When you encode, it just encodes UTF-8 string with X encoding then quoted with =?UTF-8?X?....?= . The parts that RFC 2047 forbids to
encode are left as is and long lines are folded within 76 bytes per line.
BUGS
It would be nice to support encoding to non-UTF8, such as =?ISO-2022-JP? and =?ISO-8859-1?= but that makes the implementation too compli-
cated. These days major mail agents all support =?UTF-8? so I think it is just good enough.
SEE ALSO
Encode
RFC 2047, <http://www.faqs.org/rfcs/rfc2047.html> and many other locations.
perl v5.8.0 2002-06-01 Encode::MIME::Header(3pm)