debian man page for unhtml

Query: unhtml

OS: debian

Section: 1

Format: Original Unix Latex Style Formatted with HTML and a Horizontal Scroll Bar

UNHTML(1)						      General Commands Manual							 UNHTML(1)

NAME
unhtml - strip the HTML formatting from a document or the standard input stream and display it to the standard output
SYNOPSIS
unhtml -version | [ filename ]
DESCRIPTION
Parses text read from the standard input, or a file if a file name is supplied, and removes any HTML formatting it finds. Prints the resulting cleansed text to the standard output for easy redirection. The version included with this man page has been improved to handle comments and scripts.
OPTIONS
-version Version. unhtml will display its version and exit.
EXAMPLES
This example simply scans a file called "index.html" and prints the file to the standard output with the HTML formatting removed. The standard output is redirected to a file called "index.txt" which, after running, will contain the plain text of the .html file. example% unhtml index.html > index.txt
BUGS
Currently, if the output is redirected to a file of the same name as the input file, the result will be an empty file of the same name, but this is really an idiosyncracy of the redirect operator, and cannot be corrected in the program.
DEVELOPMENT
This document is Copyright (C) 1998 by Kevin Swan. 3 February 1998 UNHTML(1)
Related Man Pages
html::filter(3) - centos
httpindex(1) - debian
html::filter(3) - osx
html::filter(3) - suse
html::filter5.16(3) - osx
Similar Topics in the Unix Linux Community
Member's infractions and notes
Total User Notes and Link to User Notes in Infraction Page