Query: dawg2wordlist
OS: debian
Section: 1
Format: Original Unix Latex Style Formatted with HTML and a Horizontal Scroll Bar
DAWG2WORDLIST(1) DAWG2WORDLIST(1)NAMEdawg2wordlist - convert a Tesseract DAWG to a wordlistSYNOPSISdawg2wordlist UNICHARSET DAWG WORDLISTDESCRIPTIONdawg2wordlist(1) converts a Tesseract Directed Acyclic Word Graph (DAWG) to a list of words using a unicharset as key.OPTIONSUNICHARSET The unicharset of the language. This is the unicharset generated by mftraining(1). DAWG The input DAWG, created by wordlist2dawg(1) WORDLIST Plain text (output) file in UTF-8, one word per lineSEE ALSOtesseract(1), mftraining(1), wordlist2dawg(1), unicharset(5), combine_tessdata(1) http://code.google.com/p/tesseract-ocr/wiki/TrainingTesseract3COPYINGCopyright (C) 2012 Google, Inc. Licensed under the Apache License, Version 2.0AUTHORThe Tesseract OCR engine was written by Ray Smith and his research groups at Hewlett Packard (1985-1995) and Google (2006-present). 02/09/2012 DAWG2WORDLIST(1)
Related Man Pages |
---|
hocr2djvused(1) - debian |
wordlist2dawg(1) - debian |
yagf(1) - debian |
polish(5) - debian |
html2markdown.py3(1) - debian |
Similar Topics in the Unix Linux Community |
---|
Google Voice |
Removed Altered Google GA code |
PM from Google Ireland |
OCR text that needs cleaning |
Google Trends: UNIX |