mbtg(1) [debian man page]

mbtg(1) 						      General Commands Manual							   mbtg(1)

NAME

       MBTG - Memory Based Tagger generator

SYNOPSYS

       mbtg -T <filename> -s <setting filename>

       or

       mbtg [options]

DESCRIPTION

       This programs generates, based on a tagged corpus, all the files needed to be able to tag a text with mbt.

OPTIONS

       -h or --help
	      show help

       -T <tagged training corpus file>

       or

       -E <enriched tagged training corpus file>

       All further options have reasonable defaults, so using them is only needed for the experienced user. See the mbt manual for more details.

       -s settingsfile
	      mbtg creates this file, which can be used to run mbt with minimal effort. (like mbt -s settings -T somefile)

       -p pattern
	      the pattern for known words (default ddfa)

       -P pattern
	      the pattern for unknown words (default dFapsss)

       -% <number>
	      filter threshold for ambitag construction (default 5%)

       -l <lexiconfile>

       -L <file with list of frequent words>

       -r <ambitagfile>

       -k <known words case base>

       -u <unknown words case base>

       -K <known words instances file>

       -U <unknown words instances file>

       -V or --version
	      show version info

       -e <sentence delimiter> (default '<utt>')

       -X
	      keep the intermediate files

       -Otimbl options
	       (Note: there is NO SPACE between O and the options)
		<options>   classifier options for both known and unknown words instances bases
		K: <options>   classifier options for known words instance base
		U: <options>   classifier options for unknown words case base
		valid timbl options are: a d k m q v w x -

BUGS

       possibly

AUTHORS

       Ko van der Sloot Timbl@uvt.nl

       Antal van den Bosch Timbl@uvt.nl

SEE ALSO

       timbl(1) mbt(1) mbtserver(1)

								   2011 march 21							   mbtg(1)

Check Out this Related Man Page

apertium-lextor(1)														apertium-lextor(1)

NAME

       apertium-lextor - This application is part of ( apertium )

       This tool is part of the apertium machine translation architecture: http://apertium.org.

SYNOPSIS

       apertium-lextor --trainwrd stopwords words n left right corpus model [ --weightexp w ] [ --debug ]

       apertium-lextor --trainlch stopwords lexchoices n left right corpus wordmodel dic bildic model [ --weightexp w ] [ --debug ]

       apertium-lextor --lextor model dic left right [ --debug ] [ --weightexp w ]

DESCRIPTION

       apertium-lextor is the application responsible for training and usage of the lexical selector module.

OPTIONS

       --trainwrd | -t
       Train word co-occurrences model. It needs the following required parameters:

       stopwords file containing a list of stop words. Stop words are ignored.

       words file containing a list of words. For each word a co-occurrence model is built.

       n number of words per co-occurrence model (for each model, the n most frequent words).

       left left-side context to take into account (number of words).

       right right-side context to take into account (number of words).

       corpus file containing the training corpus.

       model output file on which the co-occurrence models are saved.

       --trainlch | -r
       Train  lexical  choices co-occurrence models using a target language co-occurrence model and a bilingual dictionary. It needs the following
       required parameters:

       stopwords file containing a list of stop words. Stop words are ignored.

       lexchoices file containing a list of lexical choices. For each lexical choice a co-occurrence model is built.

       n number of words per co-occurrence model (for each model, the n most frequent words).

       left left-side context to take into account (number of words).

       right right-side context to take into account (number of words).

       corpus file containing the training corpus.

       wordmodel target-language word co-occurrence model (previously trained by means of the --trainwrd option).

       dic the lexical-selection dictionary (binary format).

       bildic the bilingual dictionary (binary format).

       model output file on which the co-occurrence models are saved.

       --lextor | -l
       Perform the lexical selection on the input stream. It needs the following required parameters:

       model file containing the model to be used for the lexical selection.

       dic lexical-selection dictionary (binary format).

       left left-side context to take into account (number of words).

       right right-side context to take into account (number of words).

       --weightexp w
       Specify a weight value to change the influence of surrounding words while training or performing the lexical  selection.  The  parameter  w
       must be a positive value.

       --debug | -d
       Show debug information while working.

       --help | -h
       Shows this help.

       --version | -v
       Shows license information.

SEE ALSO

       apertium-gen-lextorbil(1),   apertium-preprocess-corpus-lextor(1),  apertium-gen-stopwords-lextor(1),  apertium-gen-wlist-lextor(1),  aper-
       tium-gen-wlist-lextor-translation(1), apertium-lextor-eval(1), apertium-lextor-mono(1).

BUGS

       Lots of...lurking in the dark and waiting for you!

AUTHOR

       (c) 2005,2006 Universitat d'Alacant / Universidad de Alicante. All rights reserved.

								    2006-12-12							apertium-lextor(1)

Linux and UNIX Man Pages

mbtg(1) [debian man page]

Check Out this Related Man Page