02-01-2008
Hi Karthik,
Thanks for the reply.
I do not have any particular list of invalid characters, the characters come up randomly. For example : ~, æ, £, Ç, Ä etc. Also I am not aware with perl, so clueless.
Can you plz help me in some other ways...!
-Kanu
10 More Discussions You Might Find Interesting
1. UNIX for Dummies Questions & Answers
I am working on AIX. We ftp files to a database. The flat files are having thousands of records and each record is having some 50 to 60 characters(there are fields having certain character length). In addition to some valid ascii characters some invalid characters like Å, å, Ä, ä or pipes creep in which... (15 Replies)
Discussion started by: kanu_pathak
15 Replies
2. Programming
This is a pretty straight-forward question. Within a program of mine, I have a string that's going to be used as a filename, but it might have some invalid characters in it that wouldn't be valid in a filename. If there are any invalid characters, I want to get rid of them and essentially squeeze... (4 Replies)
Discussion started by: cleopard
4 Replies
3. Shell Programming and Scripting
Hi,
I have to write s script to check an input file for invalid characters. In this script I have to find the exact line of the invalid character. If the input file contain 2 invalid character sat line 10 and 17, the script will show the value 10 and 17. Any help is appreciated. (3 Replies)
Discussion started by: beginner82
3 Replies
4. UNIX for Dummies Questions & Answers
there is a file is generated from my program due to undefined filename.
-rw-r--r-- 1 angie angie 8644055 Jun 22 09:17 Ô$ÿÿÿÿÿÆ
may i know how to delete this file..??? thanks in advance... :) (5 Replies)
Discussion started by: chxxangie
5 Replies
5. Shell Programming and Scripting
HI,
I have a source file which has the below data.
Tableid,table.txt
sourceid,1,2,3,4,5,6
targetid,1,2,3,4,5,6
Tableid,table
sourceid,1,2,3,4,5,6
targetid,1,2,3,4,5,6
Tableid,table.txt
sourceid,1,2,3,4,5,6
targetid,1,2,3,4,5,6
Tableid,table
sourceid,1,2,3,4,5,6
targetid,1,2,3,4,5,6... (6 Replies)
Discussion started by: shruthidwh
6 Replies
6. Shell Programming and Scripting
Hi All -
I'm building a script wherein it is design to remove characters that are not accepted on a non-unicode database. Examples are the following: ï,¿,½,Â,é, etc.
I can easily sed those characters one-by-one but I there's a problem when other unicode characters are found. Is there any way to... (1 Reply)
Discussion started by: Jin_
1 Replies
7. Shell Programming and Scripting
Hi All,
How to validate the 4th column,it is date column in the file, if it valid move to valid file else moved invalid file.
9f680174-cb87|20077337254|0|20120511|N
9f680174-cb88|20077337254|0|20120534|N
i want two file valid.txt and invalid.txt
Thanks, (7 Replies)
Discussion started by: bmk
7 Replies
8. UNIX for Dummies Questions & Answers
Hello,
Can any one help me in below query to search all the invalid characters that UNIX cannot recognize from a file. can we do anything with the help of grep command or any other commands.
Also, i am not sure what are the invalid characters present in the file.
Many thanks in advance.
... (6 Replies)
Discussion started by: schandru
6 Replies
9. Shell Programming and Scripting
My Input file is fixed length record ends with . as end of the line and the character length is 4156
Example:
12234XYZ TY^4253$+00000-00000...........
I need to check is there any control characters(like ^M,^Z)
The line will be splitted
awk
'{id=substr($0,1,5)
nm=substr($0,6,3)... (2 Replies)
Discussion started by: dineshaila
2 Replies
10. UNIX for Beginners Questions & Answers
Hello guys,
Here i am writing a script to check for a valid url from a file,i am getting the valid url & i print it in a file and i want to print the invalid url also.how to do that?
#here is my script
if
then
URL=$(grep -E -o... (2 Replies)
Discussion started by: Meeran Rizvi
2 Replies
LEARN ABOUT DEBIAN
xmltv::validatefile
ValidateFile(3pm) User Contributed Perl Documentation ValidateFile(3pm)
NAME
XMLTV::ValidateFile - Validates an XMLTV file
DESCRIPTION
Utility library that validates that a file is correct according to http://wiki.xmltv.org/index.php/XMLTVFormat.
EXPORTED FUNCTIONS
All these functions are exported on demand.
LoadDtd
Load the xmltv dtd. Takes a single parameter which is the name of the xmltv dtd file.
LoadDtd must be called before ValidateFile can be called.
ValidateFile
Validate that a file is valid according to the XMLTV dtd and try to check that it contains valid information. ValidateFile takes a
filename as parameter and optionally also a day and an offset and prints error messages to STDERR.
ValidateFile returns a list of errors that it found with the file. Each error takes the form of a keyword:
ValidateFile checks the following:
notwell
The file is not well-formed XML.
notdtd
The file does not follow the XMLTV DTD.
unknownid
No channel-entry found for a channelid that is used in a programme-entry.
duplicatechannel
More than one channel-entry found for a channelid.
noprogrammes
No programme entries were found in the file.
channelnoprogramme
There are no programme entries for one of the channels listed with a channel-entry.
invalidid
An xmltvid does not look like a proper id, i.e. it does not match /^[-a-zA-Z0-9]+(.[-a-zA-Z0-9]+)+$/.
noid
A programme-entry without an id was found.
emptytitle
A programme entry with an empty or missing title was found.
emptydescription
A programme entry with an empty desc-element was found. The desc-element shall be omitted if there is no description.
badstart
A programme entry with an invalid start-time was found.
badstop
A programme entry with an invalid stop-time was found.
badepisode
A programme entry with an invalid episode number was found.
badiso8859
The file is encoded in iso-8859 but contains characters that have no meaning in iso-8859 (or are control characters). If it's
iso-8859-1 aka Latin 1 it might be some characters in windows-1252 encoding.
badutf8
The file is encoded in utf-8 but contains characters that look strange. 1) Mis-encoded single characters represented with
[EF][BF][BD] bytes 2) Mis-encoded single characters represented with [C3][AF][C2][BF][C2][BD] bytes 3) Mis-encoded single
characters in range [C2][80-9F]
If no errors are found, an empty list is returned.
BUGS
It is currently necessary to specify the path to the xmltv dtd-file. This should not be necessary.
COPYRIGHT
Copyright (C) 2006 Mattias Holmlund.
This program is free software; you can redistribute it and/or modify it under the terms of the GNU General Public License as published by
the Free Software Foundation; either version 2 of the License, or (at your option) any later version.
This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of
MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.
You should have received a copy of the GNU General Public License along with this program; if not, write to the Free Software Foundation,
Inc., 51 Franklin Street, Fifth Floor, Boston, MA 02110-1301, USA.
perl v5.14.2 2011-06-22 ValidateFile(3pm)