08-04-2009
Hi,
I was able to successfully convert the file to UTF-8 format using the following command
iconv -f ISO8859-9 -t UTF-8 <input_file> > <output_file>
I still have one issue. We will receive file with encode type format ANSI and in some cases UTF-8.
If the file comes with encode type to ANSI, then using above command, we change the file to UTF-8. This is not an issue.
But if the file is comes with UTF-8 and if we run above command then the file special characters are not coming properly.
We need to run iconv command only if the file encode type is ANSI. If it is UTF-8 then we should not run iconv. How do we identify the encode of file in UNIX. Please help me in finding this.
Thanks.
Venkat
10 More Discussions You Might Find Interesting
1. UNIX for Dummies Questions & Answers
I need to capture a file's creation/modification date and time and convert this to a different format, whilst I can easily get the existing format from a ls -l | awk ' { print $......}' or a cut command I do not know how to convert it to a desired format?
I should add that at present the ls -l... (1 Reply)
Discussion started by: barney_clough
1 Replies
2. Shell Programming and Scripting
How can I can convert a string in a shell script that looks something like: ]] to unicode equivalent?
thanks a lot,
webtekie (1 Reply)
Discussion started by: webtekie
1 Replies
3. UNIX for Advanced & Expert Users
:) Hi
i am trying to convert a file which is in UTF8 format to ANSI format i tried to use the function ICONV but it is throwing error
Function i used it as
$ iconv -f UTF8 -t ANSI filename
Error iam getting is NOT Supported UTF8 to ANSI
please some help me out on this.........Let me... (1 Reply)
Discussion started by: rajreddy
1 Replies
4. UNIX for Dummies Questions & Answers
:confused: Hi
i am trying to convert a file which is in UTF8 format to ANSI format i tried to use the function ICONV but it is throwing error
Function i used it as
$ iconv -f UTF8 -t ANSI filename
Error iam getting is NOT Supported UTF8 to ANSI
please some help me out on... (9 Replies)
Discussion started by: rajreddy
9 Replies
5. Shell Programming and Scripting
Hello,
For 2 days now i've been searching for a solution to this. I am now beginning to doubt this is even possible. It's even harder when you don't know how to search for it. (which keywords generate enough relevancy etc..)
I need to parse a config file to generate a CSV file in return.
It... (7 Replies)
Discussion started by: zer0dvide
7 Replies
6. Shell Programming and Scripting
My input file is Pipe delimited with 10 fields, I am trying to create a tab delimited output file with 6 fields from the provided input file.
Below is sample data
Input file
abc||2|PIN|num||||www.123.com|abc@123.com|
bcd||2|PIN|num|||||abc@123.com|... (3 Replies)
Discussion started by: pasupuleti81
3 Replies
7. Shell Programming and Scripting
Hi,
I am having couple of files which i used to copy from windows to Linux, so now in case of text files (CTRL^M) appears at end of line. I know i can convert this windows format file to unix format file by running dos2unix.
My requirement here is that i want to do it automatically using a... (5 Replies)
Discussion started by: sarbjit
5 Replies
8. Shell Programming and Scripting
How can I get an error when converting 3rd line, since it has invalid characters
abcde
a®cdée
a�cd�
Unicode for
® = ®
é = é
I used "iconv -f UTF-8 -t ISO-8859-15 in.txt > out.txt" (2 Replies)
Discussion started by: arunbs
2 Replies
9. Shell Programming and Scripting
Hi All,
I need help in converting the mentioned file format into desired output format using awk. Could anyone help me in this?
Below is the input..
Date Account Campaign AdGroup Keyword Conversion Revenue Var1 Var2 Var3 Var4 Var5 10 20 30 ... (8 Replies)
Discussion started by: Ravi S M
8 Replies
10. UNIX for Dummies Questions & Answers
My file format:
--------------------------------------------------
Complete Consistency Check
Valid Area : VALID:VALID
Started by : esanwad
Started at : Thu Dec 11 16:04:46 2014
CNA version : R21H04_EC08
Check range : AREA VALID/VALID
... (4 Replies)
Discussion started by: Gautam Banerjee
4 Replies
LEARN ABOUT DEBIAN
plan9-ascii
ASCII(1) General Commands Manual ASCII(1)
NAME
ascii, unicode - interpret ASCII, Unicode characters
SYNOPSIS
ascii [ -8 ] [ -oxdbn ] [ -nct ] [ text ]
unicode [ -nt ] hexmin-hexmax
unicode [ -t ] hex [ ... ]
unicode [ -n ] characters
look hex /lib/unicode
DESCRIPTION
Ascii prints the ASCII values corresponding to characters and vice versa; under the -8 option, the ISO Latin-1 extensions (codes 0200-0377)
are included. The values are interpreted in a settable numeric base; -o specifies octal, -d decimal, -x hexadecimal (the default), and -bn
base n.
With no arguments, ascii prints a table of the character set in the specified base. Characters of text are converted to their ASCII val-
ues, one per line. If, however, the first text argument is a valid number in the specified base, conversion goes the opposite way. Control
characters are printed as two- or three-character mnemonics. Other options are:
-n Force numeric output.
-c Force character output.
-t Convert from numbers to running text; do not interpret control characters or insert newlines.
Unicode is similar; it converts between UTF and character values from the Unicode Standard (see utf(7)). If given a range of hexadecimal
numbers, unicode prints a table of the specified Unicode characters -- their values and UTF representations. Otherwise it translates from
UTF to numeric value or vice versa, depending on the appearance of the supplied text; the -n option forces numeric output to avoid ambigu-
ity with numeric characters. If converting to UTF , the characters are printed one per line unless the -t flag is set, in which case the
output is a single string containing only the specified characters. Unlike ascii, unicode treats no characters specially.
The output of ascii and unicode may be unhelpful if the characters printed are not available in the current font.
The file /lib/unicode contains a table of characters and descriptions, sorted in hexadecimal order, suitable for look(1) on the lower case
hex values of characters.
EXAMPLES
ascii -d
Print the ASCII table base 10.
unicode p
Print the hex value of `p'.
unicode 2200-22f1
Print a table of miscellaneous mathematical symbols.
look 039 /lib/unicode
See the start of the Greek alphabet's encoding in the Unicode Standard.
FILES
/lib/unicode
table of characters and descriptions.
SOURCE
/src/cmd/ascii.c
/src/cmd/unicode.c
SEE ALSO
look(1), tcs(1), utf(7), font(7)
ASCII(1)