Linux and UNIX Man Pages

Linux & Unix Commands - Search Man Pages

apertium-lextor(1) [debian man page]

apertium-lextor(1)														apertium-lextor(1)

NAME
apertium-lextor - This application is part of ( apertium ) This tool is part of the apertium machine translation architecture: http://apertium.org. SYNOPSIS
apertium-lextor --trainwrd stopwords words n left right corpus model [ --weightexp w ] [ --debug ] apertium-lextor --trainlch stopwords lexchoices n left right corpus wordmodel dic bildic model [ --weightexp w ] [ --debug ] apertium-lextor --lextor model dic left right [ --debug ] [ --weightexp w ] DESCRIPTION
apertium-lextor is the application responsible for training and usage of the lexical selector module. OPTIONS
--trainwrd | -t Train word co-occurrences model. It needs the following required parameters: stopwords file containing a list of stop words. Stop words are ignored. words file containing a list of words. For each word a co-occurrence model is built. n number of words per co-occurrence model (for each model, the n most frequent words). left left-side context to take into account (number of words). right right-side context to take into account (number of words). corpus file containing the training corpus. model output file on which the co-occurrence models are saved. --trainlch | -r Train lexical choices co-occurrence models using a target language co-occurrence model and a bilingual dictionary. It needs the following required parameters: stopwords file containing a list of stop words. Stop words are ignored. lexchoices file containing a list of lexical choices. For each lexical choice a co-occurrence model is built. n number of words per co-occurrence model (for each model, the n most frequent words). left left-side context to take into account (number of words). right right-side context to take into account (number of words). corpus file containing the training corpus. wordmodel target-language word co-occurrence model (previously trained by means of the --trainwrd option). dic the lexical-selection dictionary (binary format). bildic the bilingual dictionary (binary format). model output file on which the co-occurrence models are saved. --lextor | -l Perform the lexical selection on the input stream. It needs the following required parameters: model file containing the model to be used for the lexical selection. dic lexical-selection dictionary (binary format). left left-side context to take into account (number of words). right right-side context to take into account (number of words). --weightexp w Specify a weight value to change the influence of surrounding words while training or performing the lexical selection. The parameter w must be a positive value. --debug | -d Show debug information while working. --help | -h Shows this help. --version | -v Shows license information. SEE ALSO
apertium-gen-lextorbil(1), apertium-preprocess-corpus-lextor(1), apertium-gen-stopwords-lextor(1), apertium-gen-wlist-lextor(1), aper- tium-gen-wlist-lextor-translation(1), apertium-lextor-eval(1), apertium-lextor-mono(1). BUGS
Lots of...lurking in the dark and waiting for you! AUTHOR
(c) 2005,2006 Universitat d'Alacant / Universidad de Alicante. All rights reserved. 2006-12-12 apertium-lextor(1)

Check Out this Related Man Page

AN(6)								   Games Manual 							     AN(6)

NAME
an - Anagram generator SYNOPSIS
an [-w] [-c string] [-d dictionary] [-l number_words] [-m word_length] [-u string] [-n number_anagrams] [-h] [-v] [--words] [--contain string] [--dict dictionary] [--length number_words] [--minimum word_length] [--used string] [--number number_anagrams] [--help] [--ver- sion] PHRASE DESCRIPTION
an finds all anagrams which can be made from the letters in PHRASE, using words in the specified dictionary. The default is to use /usr/share/dict/words. OPTIONS
-w, --words Print words that the letters in PHRASE can make, then exit. -c, --contain string Only print anagrams which contain string. -d, --dict dictionary Use dictionary file to find words which the letters in PHRASE can make. -m, --minimum word_length Only include word which are at least word_length long. -u, --used string Considers that letters in string have already used when analyzing letters in PHRASE. Like --contain but doesn't print out string. -l, --length number_words Finds anagrams which have a maximum of number_words words in them. This is very useful when number of anagrams which are output is large. -n, --number number_anagrams Stops after finding number_anagrams anagrams. -h, --help Print a usage message on standard output, then exit. -v, --version Print version information on standard output, then exit. EXAMPLE
an -c imp 'Paul Martin' Finds anagrams for Paul Martin which contain the word imp. AUTHOR
Paul Martin <pm@debian.org> THANKS
Richard Jones (richard@deep-thought.org) - Coding and algorithm design of original version. Julian Assange (proff@suburbia.net) - Algorithm design of original version. Please note that the email addresses for Richard and Julian are from over ten years ago and probably no longer work. COPYRIGHT
Copyright (c) Paul Martin 2012. All rights reserved. AN(6)
Man Page