debian unhtml man page on unix.com

Debian Man Pages All Man Pages Latest Topics Forum Categories

Query: unhtml

OS: debian

Section: 1

Format: Original Unix Latex Style Formatted with HTML and a Horizontal Scroll Bar

UNHTML(1)						      General Commands Manual							 UNHTML(1)

NAME
       unhtml - strip the HTML formatting from a document or the standard input stream and display it to the standard output

SYNOPSIS
       unhtml -version | [ filename ]

DESCRIPTION
       Parses  text  read  from  the  standard	input, or a file if a file name is supplied, and removes any HTML formatting it finds.	Prints the
       resulting cleansed text to the standard output for easy redirection.  The version included with this man page has been improved	to  handle
       comments and scripts.

OPTIONS
       -version
	      Version.	unhtml will display its version and exit.

EXAMPLES
       This  example  simply  scans  a	file called "index.html" and prints the file to the standard output with the HTML formatting removed.  The
       standard output is redirected to a file called "index.txt" which, after running, will contain the plain text of the .html file.

	      example% unhtml index.html > index.txt

BUGS
       Currently, if the output is redirected to a file of the same name as the input file, the result will be an empty file of the same name, but
       this is really an idiosyncracy of the redirect operator, and cannot be corrected in the program.

DEVELOPMENT
       This document is Copyright (C) 1998 by Kevin Swan.

								  3 February 1998							 UNHTML(1)

debian man page for unhtml