Linux and UNIX Man Pages

Linux & Unix Commands - Search Man Pages

mbtg(1) [debian man page]

mbtg(1) 						      General Commands Manual							   mbtg(1)

NAME
MBTG - Memory Based Tagger generator SYNOPSYS
mbtg -T <filename> -s <setting filename> or mbtg [options] DESCRIPTION
This programs generates, based on a tagged corpus, all the files needed to be able to tag a text with mbt. OPTIONS
-h or --help show help -T <tagged training corpus file> or -E <enriched tagged training corpus file> All further options have reasonable defaults, so using them is only needed for the experienced user. See the mbt manual for more details. -s settingsfile mbtg creates this file, which can be used to run mbt with minimal effort. (like mbt -s settings -T somefile) -p pattern the pattern for known words (default ddfa) -P pattern the pattern for unknown words (default dFapsss) -% <number> filter threshold for ambitag construction (default 5%) -l <lexiconfile> -L <file with list of frequent words> -r <ambitagfile> -k <known words case base> -u <unknown words case base> -K <known words instances file> -U <unknown words instances file> -V or --version show version info -e <sentence delimiter> (default '<utt>') -X keep the intermediate files -Otimbl options (Note: there is NO SPACE between O and the options) <options> classifier options for both known and unknown words instances bases K: <options> classifier options for known words instance base U: <options> classifier options for unknown words case base valid timbl options are: a d k m q v w x - BUGS
possibly AUTHORS
Ko van der Sloot Timbl@uvt.nl Antal van den Bosch Timbl@uvt.nl SEE ALSO
timbl(1) mbt(1) mbtserver(1) 2011 march 21 mbtg(1)

Check Out this Related Man Page

apertium-lextor(1)														apertium-lextor(1)

NAME
apertium-lextor - This application is part of ( apertium ) This tool is part of the apertium machine translation architecture: http://apertium.org. SYNOPSIS
apertium-lextor --trainwrd stopwords words n left right corpus model [ --weightexp w ] [ --debug ] apertium-lextor --trainlch stopwords lexchoices n left right corpus wordmodel dic bildic model [ --weightexp w ] [ --debug ] apertium-lextor --lextor model dic left right [ --debug ] [ --weightexp w ] DESCRIPTION
apertium-lextor is the application responsible for training and usage of the lexical selector module. OPTIONS
--trainwrd | -t Train word co-occurrences model. It needs the following required parameters: stopwords file containing a list of stop words. Stop words are ignored. words file containing a list of words. For each word a co-occurrence model is built. n number of words per co-occurrence model (for each model, the n most frequent words). left left-side context to take into account (number of words). right right-side context to take into account (number of words). corpus file containing the training corpus. model output file on which the co-occurrence models are saved. --trainlch | -r Train lexical choices co-occurrence models using a target language co-occurrence model and a bilingual dictionary. It needs the following required parameters: stopwords file containing a list of stop words. Stop words are ignored. lexchoices file containing a list of lexical choices. For each lexical choice a co-occurrence model is built. n number of words per co-occurrence model (for each model, the n most frequent words). left left-side context to take into account (number of words). right right-side context to take into account (number of words). corpus file containing the training corpus. wordmodel target-language word co-occurrence model (previously trained by means of the --trainwrd option). dic the lexical-selection dictionary (binary format). bildic the bilingual dictionary (binary format). model output file on which the co-occurrence models are saved. --lextor | -l Perform the lexical selection on the input stream. It needs the following required parameters: model file containing the model to be used for the lexical selection. dic lexical-selection dictionary (binary format). left left-side context to take into account (number of words). right right-side context to take into account (number of words). --weightexp w Specify a weight value to change the influence of surrounding words while training or performing the lexical selection. The parameter w must be a positive value. --debug | -d Show debug information while working. --help | -h Shows this help. --version | -v Shows license information. SEE ALSO
apertium-gen-lextorbil(1), apertium-preprocess-corpus-lextor(1), apertium-gen-stopwords-lextor(1), apertium-gen-wlist-lextor(1), aper- tium-gen-wlist-lextor-translation(1), apertium-lextor-eval(1), apertium-lextor-mono(1). BUGS
Lots of...lurking in the dark and waiting for you! AUTHOR
(c) 2005,2006 Universitat d'Alacant / Universidad de Alicante. All rights reserved. 2006-12-12 apertium-lextor(1)
Man Page