I'm thinking that perhaps there is no direct or equivalent character to translate these characters to in your destination character set, and so that's why they're being dropped, maybe ?
Some testing of my own. Firstly, all I did here was copy and paste the string you provided:
and it was picked up as UTF-8, as you can see. Full disclosure: this was on a Slackware Linux 14.2 system.
So here's what happens when I try converting this to ASCII, and as mentioned I think it fails since these characters simply don't exist in any way in normal ASCII:
However, if I tell iconv to transliterate only what it can, and drop what it can't, things seem to work, although I end up with question marks in the output (since there's nothing to transliterate to):
So I think that's the issue: they're being dropped or giving errors because there isn't anything in your destination character set that iconv regards as an acceptable replacement.
Hi,
One of our application is producing log files. But if we open the log file in vi or less or view mode, it shows all the special characters in it. The 'cat' shows correctly but it shows only last page. If I do 'cat' <file_name> | more, then again it shows special characters.
... (1 Reply)
Hi,
I need some advise on treating non printable chars over ascii value 126
Case 1 :
On some fields in the text , I need to retiain then 'as-is' and load to a database.I understand it also depends on database codepage.
but i just wanna know how do i ensure it do not change while loading... (1 Reply)
here is my simple script to show process and owners except me:
ps `-ef |grep xterm |grep -v aucar` | while read a1 a2 a3 a4 a5 a6 a7 a8
do
echo KILL..\($a1\).. $a2 |more
done
how can I pass values from command "ps -ef |grep xterm|grep -v aucar" to ?
because above command... (2 Replies)
I was trying to run a code to check if a fax number is empty or not.
for that, I've written the following code which is throwing an error.
#!/bin/ksh
fax= "999-999-9999"
if ; then
fax_no="000-000-0000"
else
fax_no=$fax
fi
echo $fax_no
And I get the... (7 Replies)
Hi, I'm having trouble with awk print all characters between 2 patterns. I tried more then one solution found on this forum but with no success.
Probably my mistakes are due to the special characters "" and "]"in the search patterns.
Well, have a log file like this:
logfile.txt
... (3 Replies)
I have a file with multiple lines. From each line I want to get all strings that starts with '+' and ends with '/'. Then I want the strings to be separated by ' + '
Example input:
+$A$/NOUN+At/NSUFF_FEM_PL+K/CASE_INDEF_ACC
Sample output:
$A$ + At + K (20 Replies)
Hi guys,
I am trying to find the following string in a file, but I always get pattern not found error, not sure what is missing here. Can you help please?
I do a less to open the xrates.log and then do a /'="18"' in the file and tried various combinations to search the below string.
String... (8 Replies)
Running SunOs 5.6. Solaris.
I've been able to remove all special characters from a fixed length file which appear in the first column but as a result all subsequent columns have shifted to the left by the amount of characters deleted.
It is a space separated file. Line 1 in input file is... (6 Replies)
Hi all!!
I´m using command file -i myfile.xml to validate XML file encoding, but it is just saying regular file . I´m expecting / looking an output as UTF8 or ANSI / ASCII
Is there command to display the files encoding?
Thank you! (2 Replies)
Hi Team,
I have a file a1.txt with data as follows.
dfjakjf...asdfkasj</EnableQuotedIDs><SQL><SelectStatement modified='1' type='string'><!
The delimiter string: <SelectStatement modified='1' type='string'><!
dlm="<SelectStatement modified='1' type='string'><!
The above command is... (7 Replies)
Discussion started by: kmanivan82
7 Replies
LEARN ABOUT DEBIAN
yaz-iconv
YAZ-ICONV(1) Commands YAZ-ICONV(1)NAME
yaz-iconv - YAZ Character set conversion utility
SYNOPSIS
yaz-iconv [-f from] [-t to] [-v] [file...]
DESCRIPTION
yaz-iconv converts data in file in character set specified by from to output in character set as specified by to.
This yaz-iconv utility similar to the iconv found on many POSIX systems (Glibc, Solaris, etc).
If no file is specified, yaz-iconv reads from standard input.
OPTIONS -ffrom]
Specify the character set from of the input file. Should be used in conjunction with option -t.
-tto]
Specify the character set of of the output. Should be used in conjunction with option -f.
-v
Print more information about the conversion process.
ENCODINGS
The yaz-iconv command and the API as defined in yaz/yaz-iconv.h is a wrapper for the library system call iconv. But YAZ' iconv utility also
implements conversions on its own. The table below lists characters sets (or encodings). that are supported by YAZ. Each character set is
marked with either encode or decode. If an encoding is encode-enabled YAZ may convert to to the designated encoding. If an encoding is
decode-enabled, YAZ may convert from the designated encoding.
marc8 (encode, decode)
The MARC8[1] encoding as defined by the Library of Congress. Most MARC21/USMARC records usees this encoding.
marc8s (encode, decode)
Like MARC8 but with conversion prefers non-combined characters in the Latin-1 plane over combined characters.
marc8lossy (encode)
Lossy encoding of MARC-8.
marc8lossless (encode)
Lossless encoding of MARC8.
utf8 (encode, decode)
The most commonly used UNICODE encoding on the Internet.
iso8859-1 (encode, decode)
ISO-8859-1, AKA Latin-1.
iso5426 (decode)
ISO 5426. Some MARC records (UNIMARC) uses this encoding.
iso5428:1984 (encode, decode)
ISO 5428:1984.
advancegreek (encode, decode)
An encoding for Greek used by some vendors (Advance).
danmarc (decode)
Danmarc (in danish)[2] is an encoding based on UNICODE which is used for DanMARC2 records.
EXAMPLES
The following command converts from ISO-8859-1 (Latin-1) to UTF-8.
yaz-iconv -f ISO-8859-1 -t UTF-8 -X <input.lst >output.lst
FILES
prefix/bin/yaz-iconv
prefix/include/yaz/yaz-iconv.h
SEE ALSO yaz(7)iconv(1)NOTES
1. MARC8
http://www.loc.gov/marc/specifications/speccharmarc8.html
2. Danmarc (in danish)
http://www.kat-format.dk/danMARC2/Danmarc2.4.htm#felt+Indl.+4
YAZ 4.2.30 04/16/2012 YAZ-ICONV(1)