debian man page for djvu2hocr

Query: djvu2hocr

OS: debian

Section: 1

Format: Original Unix Latex Style Formatted with HTML and a Horizontal Scroll Bar

DJVU2HOCR(1)							 djvu2hocr manual						      DJVU2HOCR(1)

NAME
djvu2hocr - DjVu to hOCR converter
SYNOPSIS
djvu2hocr [option...] djvu-file djvu2hocr {--version | --help | -h}
DESCRIPTION
djvu2hocr converts hidden text from a DjVu file to the hOCR[1] format.
OPTIONS
Text segmentation options --word-segmentation=simple Use the same word segmentation as found in the DjVu file. This is the default. --word-segmentation=uax29 Use the Unicode Text Segmentation[2] algorithm to break lines into words, possibly fixing word segmentation found in the DjVu file. Other options --version Output version information and exit. -h, --help Display help and exit.
PORTABILITY
djvu2hocr uses a custom extension to hOCR to retain characters which cannot be directly represented in an HTML/XML document. For example, control character BEL (^G, U+0007), is converted into the following HTML chunk: <span class="djvu_char" title="#x07"> </span>
SEE ALSO
djvu(1)
AUTHOR
Jakub Wilk <jwilk@jwilk.net> Author.
NOTES
1. hOCR http://docs.google.com/View?docid=dfxcv4vc_67g844kf 2. Unicode Text Segmentation http://unicode.org/reports/tr29/ djvu2hocr 0.7.9 03/10/2012 DJVU2HOCR(1)
Related Man Pages
didjvu(1) - debian
hocr2djvused(1) - debian
ocrodjvu(1) - debian
djvuserve(1) - suse
djvuxml(1) - suse
Similar Topics in the Unix Linux Community
Why not a segmentation fault??
Segmentation fault on basic linux commands
Threading Segmentation fault
Segmentation fault in Unix shell (linux OS)
Why does this example C code run and yet SHOULD either not compile or give a segmentation fault?