02-10-2012
Would it be possible to re-download these XML files in an unconverted state? I think someone tried to remove the utf8 with cat -v and ruined it.
10 More Discussions You Might Find Interesting
1. UNIX for Advanced & Expert Users
Hi,
One of our application is producing log files. But if we open the log file in vi or less or view mode, it shows all the special characters in it. The 'cat' shows correctly but it shows only last page. If I do 'cat' <file_name> | more, then again it shows special characters.
... (1 Reply)
Discussion started by: divakarp
1 Replies
2. Shell Programming and Scripting
Hi,
I need some advise on treating non printable chars over ascii value 126
Case 1 :
On some fields in the text , I need to retiain then 'as-is' and load to a database.I understand it also depends on database codepage.
but i just wanna know how do i ensure it do not change while loading... (1 Reply)
Discussion started by: braindrain
1 Replies
3. Shell Programming and Scripting
here is my simple script to show process and owners except me:
ps `-ef |grep xterm |grep -v aucar` | while read a1 a2 a3 a4 a5 a6 a7 a8
do
echo KILL..\($a1\).. $a2 |more
done
how can I pass values from command "ps -ef |grep xterm|grep -v aucar" to ?
because above command... (2 Replies)
Discussion started by: xramm
2 Replies
4. UNIX for Dummies Questions & Answers
Hi,
How do I remove the lines where special characters or Unicode characters appear?
The following query does work but I wonder if there is a better way.
cat test.txt | egrep -v '\)|#|,|&|-|\(|\\|\/|\.'
The following lines show that my query is incomplete.
Warning: The word "*Khan" is... (1 Reply)
Discussion started by: shantanuo
1 Replies
5. Shell Programming and Scripting
I'm trying to check-in a repository to svn -- but the import is failing because some files waaaay down deep in some graphics-library folder are using unicode characters in the file name - which are masked using the ls command but picked up when piping output to more:
# ls -l 1914*
-rwxrwxr-x 1... (2 Replies)
Discussion started by: mshallop
2 Replies
6. Shell Programming and Scripting
Hi,
I have a Master file (file.txt) with good and bad records( records with unicode characters). I ahve a file with only bad records (bad.txt)
I want the records in file.txt which are not present in bad.txt ie only the good records.
I tried comm -23 file.txt bad.txt
It is giving... (14 Replies)
Discussion started by: ashwin3086
14 Replies
7. Shell Programming and Scripting
Hi, I'm having trouble with awk print all characters between 2 patterns. I tried more then one solution found on this forum but with no success.
Probably my mistakes are due to the special characters "" and "]"in the search patterns.
Well, have a log file like this:
logfile.txt
... (3 Replies)
Discussion started by: ginolatino
3 Replies
8. Shell Programming and Scripting
I have a file with multiple lines. From each line I want to get all strings that starts with '+' and ends with '/'. Then I want the strings to be separated by ' + '
Example input:
+$A$/NOUN+At/NSUFF_FEM_PL+K/CASE_INDEF_ACC
Sample output:
$A$ + At + K (20 Replies)
Discussion started by: Viernes
20 Replies
9. Shell Programming and Scripting
Hey Guys,
I'm swamped writing code for the forums:
Could someone write a script or command line to safely delete files with special chars in filenames from a directory:
Example:
-rw-r--r-- 1 root root 148 Apr 30 23:00 ?xA??
-rw-r--r-- 1 root root 148... (8 Replies)
Discussion started by: Neo
8 Replies
10. UNIX for Beginners Questions & Answers
Hi Team,
I have a file a1.txt with data as follows.
dfjakjf...asdfkasj</EnableQuotedIDs><SQL><SelectStatement modified='1' type='string'><!
The delimiter string: <SelectStatement modified='1' type='string'><!
dlm="<SelectStatement modified='1' type='string'><!
The above command is... (7 Replies)
Discussion started by: kmanivan82
7 Replies
LEARN ABOUT DEBIAN
mkdoc::xml
MKDoc::XML(3pm) User Contributed Perl Documentation MKDoc::XML(3pm)
NAME
MKDoc::XML - The MKDoc XML Toolkit
SYNOPSIS
This is an article, not a module.
SUMMARY
MKDoc is a web content management system written in Perl which focuses on standards compliance, accessiblity and usability issues, and
multi-lingual websites.
At MKDoc Ltd we have decided to gradually break up our existing commercial software into a collection of completely independent, well-
documented, well-tested open-source CPAN modules.
Ultimately we want MKDoc code to be a coherent collection of module distributions, yet each distribution should be usable and useful in
itself.
MKDoc::XML is part of this effort.
You could help us and turn some of MKDoc's code into a CPAN module. You can take a look at the existing code at
http://download.mkdoc.org/.
If you are interested in some functionality which you would like to see as a standalone CPAN module, send an email to
<mkdoc-modules@lists.webarch.co.uk>.
DISCLAIMER
MKDoc::XML is a low level XML library.
MKDoc::XML::* modules do not make sure your XML is well-formed.
MKDoc::XML::* modules can be used to work with somehow broken XML.
MKDoc::XML::* modules should not be used as high-level parsers with general purpose XML unless you know what you're doing.
WHAT'S IN THE BOX
XML tokenizer
MKDoc::XML::Tokenizer splits your XML / XHTML files into a list of MKDoc::XML::Token objects using a single regex.
XML tree builder
MKDoc::XML::TreeBuilder sits on top of MKDoc::XML::Tokenizer and builds parsed trees out of your XML / XHTML data.
XML stripper
MKDoc::XML::Stripper objects removes unwanted markup from your XML / HTML data. Useful to remove all those nasty presentational tags or
'style' attributes from your XHTML data for example.
XML tagger
MKDoc::XML::Tagger module matches expressions in XML / XHTML documents and tag them appropriately. For example, you could automatically
hyperlink certain glossary words or add <abbr> tags based on a dictionary of abbreviations and acronyms.
XML entity decoder
MKDoc::XML::Decode is a pluggable, configurable entity expander module which currently supports html entities, numerical entities and basic
xml entities.
XML entity encoder
MKDoc::XML::Encode does the exact reverse operation as MKDoc::XML::Decode.
XML Dumper
MKDoc::XML::Dumper serializes arbitrarily complex perl structures into XML strings. It is also able of doing the reverse operation, i.e.
deserializing an XML string into a perl structure.
AUTHOR
Copyright 2003 - MKDoc Holdings Ltd.
Author: Jean-Michel Hiver
This module is free software and is distributed under the same license as Perl itself. Use it at your own risk.
SEE ALSO
Petal: http://search.cpan.org/dist/Petal/
MKDoc: http://www.mkdoc.com/
Help us open-source MKDoc. Join the mkdoc-modules mailing list:
mkdoc-modules@lists.webarch.co.uk
perl v5.10.1 2005-03-10 MKDoc::XML(3pm)