Sponsored Content
Top Forums Shell Programming and Scripting How to find the file encoding and updating the file encoding? Post 302525593 by alokjyotibal on Friday 27th of May 2011 06:50:45 AM
Old 05-27-2011
i am talking about the auto_ef utility.
is there any thing similar to this in hp-ux
 

10 More Discussions You Might Find Interesting

1. UNIX for Dummies Questions & Answers

File encoding in Unix

1. I have a shell script which creates a file using cat command. How can i find what encoding the file follows (e.g. UTF8, ANSI)? 2. I want to convert that file to PC-ANSI format. How can i achieve that? I am using HP-Unix. (6 Replies)
Discussion started by: ssmallya
6 Replies

2. Shell Programming and Scripting

Delete original wav file if lame was successful encoding.

In a bash script: src=”cooltrack.wav” dst=”cooltrack.mp3” lame $src $dst I would like to add some line that would delete the source wav file like: rm $src but I would like this only if the encoding was successful. What should I include before deleting the original to check that the... (2 Replies)
Discussion started by: Aia
2 Replies

3. Shell Programming and Scripting

get the file encoding

Hello! The system is AIX 5.3 Give please command or script to get the file encoding Thanks (2 Replies)
Discussion started by: vinment
2 Replies

4. AIX

get the file encoding

Hello! The system is AIX 5.3 Give please command or script to get the file encoding (1 Reply)
Discussion started by: vinment
1 Replies

5. Shell Programming and Scripting

Cygwin vi XML file encoding problem

Hi, I have got a zip (binary) file transferred from MacOS (thus it has additional __MACOSX directory packed inside). On extracting this zip, there are few *.xml files available. When I opened this *.xml file in vim editor using Cygwin (on windows) the editor displayed in the bottom. I tried... (4 Replies)
Discussion started by: royalibrahim
4 Replies

6. HP-UX

how to find the character encoding of a file in hp_ux

how to find the character encoding of a file in hp_ux (1 Reply)
Discussion started by: alokjyotibal
1 Replies

7. UNIX for Dummies Questions & Answers

Determing the encoding of a file

Hi, I am trying to determine the encoding for the file, because to convert to UTF-8, it seems as though I have to know the encoding of the source. Tried this file <filename> give me this: <filename>:data or International Language text Tried to see the locale and this is the output:... (6 Replies)
Discussion started by: MIA651
6 Replies

8. UNIX for Advanced & Expert Users

ISO 88591 file encoding charset in Linux

Hello Experts, please help to provide any insight as I am facing issue migrating java application from hpux to redhat. The java program is using InputStreamReader to read a file without specifying any charset parameter. However, in new Linux Redhat 5.6 environent, when reading a file that... (1 Reply)
Discussion started by: sonic_air
1 Replies

9. Solaris

View file encoding then change encoding.

Hi all!! I´m using command file -i myfile.xml to validate XML file encoding, but it is just saying regular file . I´m expecting / looking an output as UTF8 or ANSI / ASCII Is there command to display the files encoding? Thank you! (2 Replies)
Discussion started by: mrreds
2 Replies

10. Shell Programming and Scripting

How to know file encoding?

how can i know what format a file is * example: UTF-8 ANSI UCS2 i am in a... (8 Replies)
Discussion started by: tricampeon81
8 Replies
auto_ef(1)							   User Commands							auto_ef(1)

NAME
auto_ef - auto encoding finder SYNOPSIS
/usr/bin/auto_ef [-e encoding_list] [-a] [-l level] [file ...] /usr/bin/auto_ef -h DESCRIPTION
The auto_ef utility identifies the encoding of a given file. The utility judges the encoding by using the iconv code conversion, determin- ing whether a certain code conversion was successful with the file, and also by performing frequency analyses on the character sequences that appear in the file. The auto_ef utility might produce unexpected output if the string is binary, a character table, a localized digit list, or a chronogram, or if the string or file is very small in size (for example, less than one 100 bytes). ASCII JIS ISO-2022-JP eucJP Japanese EUC PCK Japanese PC Kanji, CP932, Shift JIS UTF-8 Korean EUC ko_KR.euc ko_KR.cp949 Unified Hangul ISO-2022-KR ISO-2022 Korean zh_CN.iso2022-CN ISO-2022 CN/CN-EXT zh_CN.euc Simplified Chinese EUC, GB2312 GB18030 Simplified Chinese GB18030/GBK zh_TW-big5 BIG5 zh_TW-euc Traditional Chinese EUC zh_TW.hkscs Hong Kong BIG5 iso-8859-1 West European, and similar iso-8859-2 East European, and similar iso-8859-5 Cyrillic, and similar iso-8859-6 Arabic iso-8859-7 Greek iso-8859-8 Hebrew CP1250 windows-1250, corresponding to ISO-8859-2 CP1251 windows-1251, corresponding to ISO-8859-5 CP1252 windows-1252, corresponding to ISO-8859-1 CP1253 windows-1253, corresponding to ISO-8859-7 CP1255 windows-1255, corresponding to ISO-8859-8 koi8-r corresponding to iso-8859-5 By default, auto_ef returns a single, most likely encoding for text in a specified file. To get all possible encodings for the file, use the -a option. Also by default, auto_ef uses the fastest process to examine the file. For more accurate results, use the -l option. To examine data with a limited set of encodings, use the -e option. OPTIONS
The following options are supported: -a Shows all possible encodings in order of possibility, with scores in the range between 0.0 and 1.0. A higher score means a higher possibility. For example, example% auto_ef -a test_file eucJP 0.89 zh_CN.euc 0.04 ko_KR.euc 0.01 Without this option, only one encoding with the highest score is shown. -e encoding_list Examines data only with specified encodings. For example, when encoding_list is specified as "ko_KR.euc:ko_KR.cp949", auto_ef examines text only with CP949 and ko_KR.euc. Without this option, auto_ef examines text with all encodings. Multiple encodings can be specified by separating the encodings using a colon (:). -h Shows the usage message. -l level Specifies the level of judgment. The value of level can be 0, 1, 2, or 3. Level 3 produces the best result but can be slow. Level 0 is fastest but results can be less accurate than in higher levels. The default is level 0. OPERANDS
The following operands are supported: file File name to examine. EXAMPLES
Example 1 Examining encoding of a file example% auto_ef file_name Example 2 Examining encoding of a file at level 2. example% auto_ef -l 2 file_name Example 3 Examining encoding of a file with only eucJP or ko_KR.euc example% auto_ef -e "eucJP:ko_KR.euc" file_name EXIT STATUS
The following exit values are returned: 0 Successful completion 1 An error occurred. ATTRIBUTES
See attributes(5) for descriptions of the following attributes: +-----------------------------+-----------------------------+ | ATTRIBUTE TYPE | ATTRIBUTE VALUE | +-----------------------------+-----------------------------+ |Availability |SUNWautoef | +-----------------------------+-----------------------------+ |Interface Stability |See below. | +-----------------------------+-----------------------------+ Interface Stability of output format, when option -a is specified, is Evolving. Other interfaces are Stable. SEE ALSO
auto_ef(3EXT), libauto_ef(3LIB), attributes(5) International Language Environments Guide SunOS 5.11 26 Sep 2004 auto_ef(1)
All times are GMT -4. The time now is 06:34 AM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy