debian man page for mbtg

Query: mbtg

OS: debian

Section: 1

Format: Original Unix Latex Style Formatted with HTML and a Horizontal Scroll Bar

mbtg(1) 						      General Commands Manual							   mbtg(1)

NAME
MBTG - Memory Based Tagger generator
SYNOPSYS
mbtg -T <filename> -s <setting filename> or mbtg [options]
DESCRIPTION
This programs generates, based on a tagged corpus, all the files needed to be able to tag a text with mbt.
OPTIONS
-h or --help show help -T <tagged training corpus file> or -E <enriched tagged training corpus file> All further options have reasonable defaults, so using them is only needed for the experienced user. See the mbt manual for more details. -s settingsfile mbtg creates this file, which can be used to run mbt with minimal effort. (like mbt -s settings -T somefile) -p pattern the pattern for known words (default ddfa) -P pattern the pattern for unknown words (default dFapsss) -% <number> filter threshold for ambitag construction (default 5%) -l <lexiconfile> -L <file with list of frequent words> -r <ambitagfile> -k <known words case base> -u <unknown words case base> -K <known words instances file> -U <unknown words instances file> -V or --version show version info -e <sentence delimiter> (default '<utt>') -X keep the intermediate files -Otimbl options (Note: there is NO SPACE between O and the options) <options> classifier options for both known and unknown words instances bases K: <options> classifier options for known words instance base U: <options> classifier options for unknown words case base valid timbl options are: a d k m q v w x -
BUGS
possibly
AUTHORS
Ko van der Sloot Timbl@uvt.nl Antal van den Bosch Timbl@uvt.nl
SEE ALSO
timbl(1) mbt(1) mbtserver(1) 2011 march 21 mbtg(1)
Related Man Pages
frog(1) - debian
mbtserver(1) - debian
mbtg(1) - debian
timbl(1) - debian
voikkospell(1) - centos
Similar Topics in the Unix Linux Community
Blank out words
Extract words starting with a pattern from a file
Print one sentence 40 to 50 words end with period in a file
Grepping verbal forms from a large corpus
awk to print filename words along with delimiter