The
echo "invalid characters like Å, å, Ä, ä or"
is providing the input data with illegal characters that need removal. I need some test data and this is one way to demo a command. And the command I am showing is
tr -dc " a-zA-Z0-9,\n"
and that is what removes the garbage. The tr command, in this form, lists the valid characters, not the invalid ones. You may need to add stuff to the list. To replace invalid characters with a space use
Code:
$ echo "invalid characters like Å, å, Ä, ä or"| tr -c ' a-zA-Z0-9,\n' ' '
invalid characters like , , , or
$
I have switched to single quotes which may be better if you need certain special characters to be accepted. In your case you may want to just do
I am working on AIX. We ftp files to a database. The flat files are having thousands of records and each record is having some 50 to 60 characters(there are fields having certain character length). In addition to some valid ascii characters some invalid characters like Å, å, Ä, ä or pipes creep in which... (5 Replies)
This is a pretty straight-forward question. Within a program of mine, I have a string that's going to be used as a filename, but it might have some invalid characters in it that wouldn't be valid in a filename. If there are any invalid characters, I want to get rid of them and essentially squeeze... (4 Replies)
Hi,
I have to write s script to check an input file for invalid characters. In this script I have to find the exact line of the invalid character. If the input file contain 2 invalid character sat line 10 and 17, the script will show the value 10 and 17. Any help is appreciated. (3 Replies)
there is a file is generated from my program due to undefined filename.
-rw-r--r-- 1 angie angie 8644055 Jun 22 09:17 Ô$ÿÿÿÿÿÆ
may i know how to delete this file..??? thanks in advance... :) (5 Replies)
HI,
I have a source file which has the below data.
Tableid,table.txt
sourceid,1,2,3,4,5,6
targetid,1,2,3,4,5,6
Tableid,table
sourceid,1,2,3,4,5,6
targetid,1,2,3,4,5,6
Tableid,table.txt
sourceid,1,2,3,4,5,6
targetid,1,2,3,4,5,6
Tableid,table
sourceid,1,2,3,4,5,6
targetid,1,2,3,4,5,6... (6 Replies)
Hi All -
I'm building a script wherein it is design to remove characters that are not accepted on a non-unicode database. Examples are the following: ï,¿,½,Â,é, etc.
I can easily sed those characters one-by-one but I there's a problem when other unicode characters are found. Is there any way to... (1 Reply)
Hi All,
How to validate the 4th column,it is date column in the file, if it valid move to valid file else moved invalid file.
9f680174-cb87|20077337254|0|20120511|N
9f680174-cb88|20077337254|0|20120534|N
i want two file valid.txt and invalid.txt
Thanks, (7 Replies)
Hello,
Can any one help me in below query to search all the invalid characters that UNIX cannot recognize from a file. can we do anything with the help of grep command or any other commands.
Also, i am not sure what are the invalid characters present in the file.
Many thanks in advance.
... (6 Replies)
My Input file is fixed length record ends with . as end of the line and the character length is 4156
Example:
12234XYZ TY^4253$+00000-00000...........
I need to check is there any control characters(like ^M,^Z)
The line will be splitted
awk
'{id=substr($0,1,5)
nm=substr($0,6,3)... (2 Replies)
Hello guys,
Here i am writing a script to check for a valid url from a file,i am getting the valid url & i print it in a file and i want to print the invalid url also.how to do that?
#here is my script
if
then
URL=$(grep -E -o... (2 Replies)
Discussion started by: Meeran Rizvi
2 Replies
LEARN ABOUT LINUX
iconv
ICONV(1) Debian GNU/Linux ICONV(1)NAME
iconv - Convert encoding of given files from one encoding to another
SYNOPSIS
iconv -f encoding [-t encoding] [inputfile]...
DESCRIPTION
The iconv program converts the encoding of characters in inputfile, or from the standard input if no filename is specified, from one coded
character set to another. The result is written to standard output unless otherwise specified by the --output option.
--from-code, -f encoding
Convert characters from encoding.
--to-code, -t encoding
Convert characters to encoding. If not specified the encoding corresponding to the current locale is used.
--list, -l
List known coded character sets.
-c Omit invalid characters from output.
--output, -o file
Specify output file (instead of stdout).
--silent, -s
Suppress warnings, but not errors.
--verbose
Print progress information.
--help, -?
Give help list.
--usage
Give a short usage message.
--version, -V
Print program version.
ENCODINGS
The values permitted for --from-code and --to-code can be listed by the iconv --list command, and all combinations of the listed values are
supported. Furthermore the following two suffixes are supported:
//TRANSLIT
When the string "//TRANSLIT" is appended to --to-code, transliteration is activated. This means that when a character cannot be
represented in the target character set, it can be approximated through one or several similarly looking characters.
//IGNORE
When the string "//IGNORE" is appended to --to-code, characters that cannot be represented in the target character set will be
silently discarded.
AUTHOR
iconv was written by Ulrich Drepper as part of the GNU C Library.
This man page was written by Joel Klecker <espy@debian.org>, for the Debian GNU/Linux system.
3rd Berkeley Distribution lenny ICONV(1)