ASCII to UTF-8 conversion Post: 302578215

10 More Discussions You Might Find Interesting

1. Programming

Howto convert Ascii -> UTF-8 & back C++

While working with russian text under FreeBSD&MySQL I need to convert a string from MySQL to the Unicode format. I've just started my way in C++ under FreeBSD , so please explain me how can I get ascii code of Char variable and also how can i get a character into variable with the specified ascii...

2. Shell Programming and Scripting

ascii conversion

after converting my ebcidic file to ascii i get the following output 2097152+0 records in 1797345+1 records out Why is there a difference in number of records. Is the converson chopping off any records. All i am doing is just a conversion using the following script dd if=xaa cbs=152 ...

3. Shell Programming and Scripting

ascii to ebcdic conversion

Hello, I need a program for ascii to ebsdic conversion. If anybody can help, it'll be greatly appreciated. Thanks.

4. UNIX for Advanced & Expert Users

UTF-8 to EBCDIC conversion in UNIX

Hi all, At present a file from AS400 system is being FTPed to an AIX system. Now, a similar file needs to be sent from our Unix box (Solaris) Is there any tool available which does the conversion in Unix from UTF-8 to EBCDIC? Any suggestions/ pointers are really appreciated. Thanks,...

5. Shell Programming and Scripting

need help with ascii to decimal conversion

Hi, Can anyone please help me ascci to decimal conversion in bash I have a file which contains stream of numbers like this,these are ascci values 729711810132973278105991013268971213233 I want to covert it to its actual value like upper code's decimal is "Have a Nice Day!" ...

6. Shell Programming and Scripting

binary to ascii conversion

Hi, I have got a library file, created by compiling C code. The file information with "file" command, gives it a "application/x-archive" type file. I want to extract the release string of my software from this file, so that i can know which version of C files were used to create the lib. Can...

7. Red Hat

Can't convert 7bit ASCII to UTF-8

Hello, I am trying to convert a 7bit ASCII file to UTF-8. I have used iconv before though it can't recognize it for some reason and says unknown file encoding. When I used ascii2uni package with different package, ./ascii2uni -a K -a I -a J -a X test_file > new_test_file It still...

8. UNIX for Dummies Questions & Answers

Conversion from ansii to UTF 16

Hi I have a big file which is in ansii . I want to convert it to UTF-16 .Please help me on this as I am stuck at this point in unix .

9. Shell Programming and Scripting

Convert UTF-8 file to ASCII/ISO8859-1 OR replace characters

I am trying to develop a script which will work on a source UTF-8 file and perform one or more of the following It will accept the target encoding as an argument e.g. US-ASCII or ISO-8859-1, etc 1. It should replace all occurrences of characters outside target character set by " " (space) or...

10. UNIX for Advanced & Expert Users

EBCDIC to ASCII conversion

Hi, I have a input file which is EBCIDIC and it has packed decimals. Can anyone help me to convert EBCIDIC file to ASCII(Need to convert even Packed decimal values also to normal format). Thanks swapna

LEARN ABOUT DEBIAN

encode::imaputf7

Encode::IMAPUTF7(3pm)					User Contributed Perl Documentation				     Encode::IMAPUTF7(3pm)

NAME

       Encode::IMAPUTF7 - modification of UTF-7 encoding for IMAP

SYNOPSIS

	 use Encode qw/encode decode/;
	 use Encode::IMAPUTF7;

	 print encode('IMAP-UTF-7', 'RA~Xpertoire');
	 print decode('IMAP-UTF-7', R&AOk-pertoire');

ABSTRACT

       IMAP mailbox names are encoded in a modified UTF7 when names contains international characters outside of the printable ASCII range. The
       modified UTF-7 encoding is defined in RFC2060 (section 5.1.3).

       There is another CPAN module with same purpose, Unicode::IMAPUtf7. However, it works correctly only with strings, which encoded form does
       not contain plus sign. For example, the Cyrillic string x{043f}x{0440}x{0435}x{0434}x{043b}x{043e}x{0433} is represented in UTF-7 as
       +BD8EQAQ1BDQEOwQ+BDM- Note the second plus sign 4 characters before the end.  Unicode::IMAPUtf7 encodes the above string as
       +BD8EQAQ1BDQEOwQ&BDM- which is not valid modified UTF-7 (the ampersand and the plus are swapped). The problem is solved by the current
       module, which is slightly modified Encode::Unicode::UTF7 and has nothing common with Unicode::IMAPUtf7.

RFC2060 - section 5.1.3 - Mailbox International Naming Convention
       By convention, international mailbox names are specified using a modified version of the UTF-7 encoding described in [UTF-7].  The purpose
       of these modifications is to correct the following problems with UTF-7:

       1) UTF-7 uses the "+" character for shifting; this conflicts with
	  the common use of "+" in mailbox names, in particular USENET
	  newsgroup names.

       2) UTF-7's encoding is BASE64 which uses the "/" character; this
	  conflicts with the use of "/" as a popular hierarchy delimiter.

       3) UTF-7 prohibits the unencoded usage of ""; this conflicts with
	  the use of "" as a popular hierarchy delimiter.

       4) UTF-7 prohibits the unencoded usage of "~"; this conflicts with
	  the use of "~" in some servers as a home directory indicator.

       5) UTF-7 permits multiple alternate forms to represent the same
	  string; in particular, printable US-ASCII chararacters can be
	  represented in encoded form.

       In modified UTF-7, printable US-ASCII characters except for "&" represent themselves; that is, characters with octet values 0x20-0x25 and
       0x27-0x7e.  The character "&" (0x26) is represented by the two- octet sequence "&-".

       All other characters (octet values 0x00-0x1f, 0x7f-0xff, and all Unicode 16-bit octets) are represented in modified BASE64, with a further
       modification from [UTF-7] that "," is used instead of "/".  Modified BASE64 MUST NOT be used to represent any printing US-ASCII character
       which can represent itself.

       "&" is used to shift to modified BASE64 and "-" to shift back to US- ASCII.  All names start in US-ASCII, and MUST end in US-ASCII (that
       is, a name that ends with a Unicode 16-bit octet MUST end with a "- ").

       For example, here is a mailbox name which mixes English, Japanese, and Chinese text: ~peter/mail/&ZeVnLIqe-/&U,BTFw-

REQUESTS &; BUGS
       Please report any requests, suggestions or bugs via the RT bug-tracking system at http://rt.cpan.org/ or email to
       bug-Encode-IMAPUTF7@rt.cpan.org.

       http://rt.cpan.org/NoAuth/Bugs.html?Dist=Encode-IMAPUTF7 is the RT queue for Encode::IMAPUTF7.  Please check to see if your bug has already
       been reported.

COPYRIGHT

       Copyright 2005 Sava Chankov

       Sava Chankov, sava@cpan.org

       This software may be freely copied and distributed under the same terms and conditions as Perl.

AUTHORS

       Peter Makholm <peter@makholm.net>, current maintainer

       Sava Chankov <sava@cpan.org>, original author

SEE ALSO

       perl(1), Encode.

perl v5.12.4							    2011-09-25						     Encode::IMAPUTF7(3pm)

10 More Discussions You Might Find Interesting

1. Programming

Howto convert Ascii -> UTF-8 & back C++

Discussion started by: macron

2. Shell Programming and Scripting

ascii conversion

Discussion started by: rintingtong

3. Shell Programming and Scripting

ascii to ebcdic conversion

Discussion started by: er_ashu

4. UNIX for Advanced & Expert Users

UTF-8 to EBCDIC conversion in UNIX

Discussion started by: sridhar_423