XML2ASC(1) HTML-XML-utils XML2ASC(1)NAME
xml2asc - convert UTF-8 to &#nnn; entities
SYNOPSIS
xml2asc
DESCRIPTION
Reads an UTF-8 encoded text from standard input and writes to standard output, converting all non-ASCII characters to &#nnn; entities, so
that the result is ASCII-encoded.
One example use is to convert ISO-8859-1 to ASCII with &#nnn; entities, by first running asc2xml to convert ISO-8859-1 to UTF-8 and then
pipe the result into xml2asc to convert to ASCII with &#nnn; entities for all accented characters.
DIAGNOSTICS
xml2asc returns with a non-zero exit code if the input was not UTF-8.
SEE ALSO asc2xml(1), UTF-8 (RFC 2279)
BUGS
Doesn't distinguish mark-up from content, so if the input uses non-ASCII characters in XML element names, they will be output with numeri-
cal entities in them, which is not legal in XML.
6.x 10 Jul 2011 XML2ASC(1)
Check Out this Related Man Page
HXUNENT(1) HTML-XML-utils HXUNENT(1)NAME
hxunent - replace HTML predefined character entities by UTF-8
SYNOPSIS
hxunent [ -b ] [ -f ] [ file ]
DESCRIPTION
The hxunent command reads the file (or standard input) and copies it to standard output with &-entities by their equivalent character
(encoded as UTF-8). E.g., " is replaced by " and < is replaced by <.
OPTIONS
The following options are supported:
-b The five builtin entities of XML (< > " ' &) are not replaced but copied unchanged. This is necessary if the
output has to be valid XML or SGML.
-f This option changes how unknown entities or lone ampersands are handled. Normally they are copied unchanged, but this option
tries to "fix" them by replacing ampersands by &. Often such stray ampersands are the result of copy and paste of URLs into a
document and then this option indeed fixes them and makes the document valid.
DIAGNOSTICS
The program's exit value is 0 if all went well, otherwise:
1 The input couldn't be read (file not found, file not readable...)
2 Wrong command line arguments.
SEE ALSO asc2xml(1), xml2asc(1), UTF-8 (RFC 2279)
BUGS
The program assumes entities are as defined by HTML. It doesn't read a document's DTD to find the actual definitions in use in a document.
With -f, it will even remove all entities that are not HTML entities.
6.x 10 Jul 2011 HXUNENT(1)
I have a huge matrix file containing some 1.5 million rows and 6000 columns. The matrix looks something like this:
1 2 3
4 5 6
7 8 9
3 4 5
I want to add all the numbers in the columns of this matrix and display the result to my stdout. This means that the numbers in the first column are:
... (2 Replies)
Hi,
Just trying to get to grips with sed and awk for some reporting for work and I need some assistance:
I have a file that lists policy names on the first line and then on the second line whether the policy is active or not.
Policy Name: Policy1
Active: yes
Policy... (8 Replies)
version info :
vi availabe with RHEL 5.4
I have a text file with 10,000 lines. I want to copy lines from 5000th line to 7000th and redirect to a file. Any idea how I can do this?
Note:
The above scenario is just an example. In my actual requirement, the file has 14 million lines and I want... (9 Replies)
Hi everyone,
I know the following questions are noobish questions but I am asking them because I am confused about the basics of history behind UNIX and LINUX.
Ok onto business, my questions are-:
Was/Is UNIX ever an open source operating system ?
If UNIX was... (21 Replies)
Hello,
I couldn't find an actual introduction thread, so I decided to just put this here.
I go by d0wngrade online. I have been programming in multiple languages for about 15+ years. I started with standard web design languages like HTML and CSS, but I then advanced from design to development... (2 Replies)
Hi guys...
The first active code line in AudioScope.sh is set -u .
This causes a complete exit if a variable is used/found but has not been allocated at the start of the program.
However, apart from writing code to do the task, is there a switch to to check which variables have been... (17 Replies)
Hi.
In thread https://www.unix.com/shell-programming-and-scripting/267833-grouping-counting.html rovf and I had a mini-discussion on grep and awk.
Here is a demo script that compares the awk and grep approaches for this single problem:
#!/usr/bin/env bash
# @(#) s2 Demonstrate group... (1 Reply)
Hello,
I have to fish out some specific columns from a file based on the header value. I have the list of columns I need in a different file. I thought I could read in the list of headers I need,
# file with header names of required columns in required order
headers_file=$2
# read contents... (11 Replies)
For those interested in installing dash shell on OSX Lion to help test POSIX compliancy of shell scripts, it is quite easy. I did it like this:
If you don't have gcc on your system:
0. Download and install the Command Line Tools for Xcode package from Sign In - Apple *
1. Download the dash... (2 Replies)
Hello and thanks in advance for any help anyone can offer me
I'm trying to learn the find command and thought I was understanding it... Apparently I was wrong. I was doing compound searches and I started getting weird results with the -size test. I was trying to do a search on a 1G file owned by... (14 Replies)
I have data of an excel files as given below,
file1
org1_1 1 1 2.5 100
org1_2 1 2 5.5 98
org1_3 1 3 7.2 88
file2
org2_1 1 1 2.5 100
org2_2 1 2 5.5 56
org2_3 1 3 7.2 70
I have multiple excel files as above shown.
I have to copy column 1, column 4 and paste into a new excel file as... (26 Replies)
Dear All,
Taking a break from Vue.js coding for the site, SEO and YT videos; and hopefully addressing some well deserved criticism from some here that I have been too focused on the visual aspects of the forums versus the substance and the community....
While the "current generation... (9 Replies)
Hi all...
Well guys and gals, I jumped in at the deep end and found things that PERL cannot do by default.
Many tricky terminal escape codes are not catered for so I had to create workarounds.
One thing I searched for was this:
Passing perl variable to shell command
AND, @Neo this was... (15 Replies)