Linux and UNIX Man Pages

Linux & Unix Commands - Search Man Pages

mbt(1) [debian man page]

mbt(1)							      General Commands Manual							    mbt(1)

NAME
MBT - Memory Based Tagger SYNOPSYS
mbt [options] DESCRIPTION
mbt is a memory-based tagger that can tag sequences, based on training files generated by mbtg. OPTIONS
-h or --help show help -s settingsfile use a settingsfile as generated by mbtg Or: -l <lexiconfile> -r <ambitagfile> -k <known words case base> -u <unknown words case base> -D <loglevel> Possible options levels are LogNormal , LogDebug , LogHeavy and LogExtreme -e <sentence delimiter> (default '<utt>') -E <enriched tagged testfile> -t <testfile> -T <tagged testfile> (default is untagged stdin) -o <outputfile> (default stdout) -Otimbl options (Note: there is NO SPACE between O and the options) <options> classifier options for both known and unknown words instance bases K: <options> classifier options for known words instance base U: <options> classifier options for unknown words instance base valid timbl options are: a d k m q v w x - -B <beamsize for search> (default = 1) -v di add distance to output -v db add distribution to output -v c add confidence to output -V or --version show version info. -L <file with list of frequent words> BUGS
possibly AUTHORS
Ko van der Sloot Timbl@uvt.nl Antal van den Bosch Timbl@uvt.nl SEE ALSO
timbl(1) mbtg(1) mbtserver(1) 2011 march 21 mbt(1)

Check Out this Related Man Page

frog(1) 						      General Commands Manual							   frog(1)

NAME
frog - Dutch morpho-syntactic analyzer, IOB chunker and dependency parser SYNOPSYS
frog [options] frog -t test-file DESCRIPTION
frog is an integration of memory-based natural language processing (NLP) modules developed for Dutch. frog's current version will tok- enize, tag, lemmatize, and morphologically segment word tokens in Dutch text files, add IOB chunks and will assign a dependency graph to each sentence. OPTIONS
-c <configfile> set the configuration using 'file' -d <level> set debug level. -e <encoding> set input encoding. (default UTF8) -h give some help --keep-parser-files=[yes|no] keep the intermediate files from the parser. Last sentence only! -n assume inputfile to hold one sentence per line -o <file> send output to 'file' instead of stdout. Defaults to the name of the inputfile with '.out' appended. --outputdir <dir> send all output to 'dir' instead of stdout. Creates filenames from the inputfilename(s) with '.out' appended. --skip=[mptc] skip parts of the proces: Tokenizer (t), Chunker (c), Multi-Word unit (m) or Parser (p) -Q Enable quotedetection in the tokenizer. May run havock! -S <port> Run a server on 'port' -t <file> process 'file' -x <xmlfile> process 'xmlfile', which is supposed to be in FoLiA format! If 'xmlfile' is empty, and --testdir=<dir> is provided, all files in 'dir' will be processed as FoLia XML. --testdir=<dir> process all files in 'dir'. see also --outputdir --tmpdir=<dir> location to store intermediate files. Default /tmp. -V or --version show version info --xmldir=<dir> generate FoLiA XML output and send it to 'dir'. Creates filenames from the inputfilename with '.xml' appended. -X <file> generate FoLiA XML output and send it to 'file'. Defaults to the name of the inputfile(s) with '.xml' appended. --id=<id> When -X for FoLia is given, use 'id' to give the doc an ID. BUGS
likely AUTHORS
Maarten van Gompel proycon@anaproy.nl Ko van der Sloot Timbl@uvt.nl Antal van den Bosch Timbl@uvt.nl SEE ALSO
ucto(1) 2012 January 31 frog(1)
Man Page