frog(1) General Commands Manual frog(1)
NAME
frog - Dutch morpho-syntactic analyzer, IOB chunker and dependency parser
SYNOPSYS
frog [options]
frog -t test-file
DESCRIPTION
frog is an integration of memory-based natural language processing (NLP) modules developed for Dutch. frog's current version will tok-
enize, tag, lemmatize, and morphologically segment word tokens in Dutch text files, add IOB chunks and will assign a dependency graph to
each sentence.
OPTIONS
-c <configfile>
set the configuration using 'file'
-d <level>
set debug level.
-e <encoding>
set input encoding. (default UTF8)
-h
give some help
--keep-parser-files=[yes|no]
keep the intermediate files from the parser. Last sentence only!
-n
assume inputfile to hold one sentence per line
-o <file>
send output to 'file' instead of stdout. Defaults to the name of the inputfile with '.out' appended.
--outputdir <dir>
send all output to 'dir' instead of stdout. Creates filenames from the inputfilename(s) with '.out' appended.
--skip=[mptc]
skip parts of the proces: Tokenizer (t), Chunker (c), Multi-Word unit (m) or Parser (p)
-Q
Enable quotedetection in the tokenizer. May run havock!
-S <port>
Run a server on 'port'
-t <file>
process 'file'
-x <xmlfile>
process 'xmlfile', which is supposed to be in FoLiA format! If 'xmlfile' is empty, and --testdir=<dir> is provided, all files in
'dir' will be processed as FoLia XML.
--testdir=<dir>
process all files in 'dir'. see also --outputdir
--tmpdir=<dir>
location to store intermediate files. Default /tmp.
-V or --version
show version info
--xmldir=<dir>
generate FoLiA XML output and send it to 'dir'. Creates filenames from the inputfilename with '.xml' appended.
-X <file>
generate FoLiA XML output and send it to 'file'. Defaults to the name of the inputfile(s) with '.xml' appended.
--id=<id>
When -X for FoLia is given, use 'id' to give the doc an ID.
BUGS
likely
AUTHORS
Maarten van Gompel proycon@anaproy.nl
Ko van der Sloot Timbl@uvt.nl
Antal van den Bosch Timbl@uvt.nl
SEE ALSO
ucto(1)
2012 January 31 frog(1)