Linux and UNIX Man Pages

Linux & Unix Commands - Search Man Pages

voikkospell(1) [centos man page]

VOIKKOSPELL(1)						      General Commands Manual						    VOIKKOSPELL(1)

voikkospell - test program for Voikko spell checker SYNOPSIS
voikkospell [options] DESCRIPTION
voikkospell is a test program for spell checking functionality in libvoikko, library of Finnish language tools. It reads words from stdin (one word on a line) and print them to stdout, adding "C: " in front of correct words and "W: " in front of incorrect words. Common options of different Voikko test programs are listed in COMMON OPTIONS. OPTIONS
-m In addition to spelling result, prints morphological analysis info (A:) for recognized words. -M Prints morphological analysis info (A:) for recognized words without displaying spelling result. -t Prints only "C" or "W" instead of typical full output. -tt Prints only misspelled words. -s Prints suggestions (S:) for incorrectly spelled words. -cn Set cache size parameter to n. -1 disables the cache and 0 is the default. For checking large amounts of unsorted text you may want to set n to 5 to get better performance. -j n Use n threads for spell checking. When more than one thread is used checking is performed using large internal buffers which is why this mode should only be used for batch processing. -xc Like voikkospell -s but output is printed on one line separated by character c without "C", "W" or "S" in front of the words. If c is not defined words are separated by space and suggestions that have spaces in them are not printed. -l Prints a list of available dictionary variants and exits. The first variant is the default to be used when no specific variant has been requested. ignore_nonwords=n accept_first_uppercase=n accept_extra_hyphens=n accept_missing_hyphens=n ocr_suggestions=n Set the value of the specified boolean option. n can be either 0 (false) or 1 (true). COMMON OPTIONS
-p directory (voikkospell, voikkohyphenate, voikkogc) Look from directory before the standard locations when searching for dictionary files. -d variant (voikkospell, voikkohyphenate, voikkogc) Use dictionary variant variant instead of the default dictionary variant. The variant must be represented as a BCP 47 language tag. ignore_dot=n (voikkospell, voikkohyphenate) ignore_numbers=n (voikkospell, voikkohyphenate) Set the value of the specified boolean option. n can be either 0 (false) or 1 (true). -h, --help Print a help message and exit. --version Print version numbers for libvoikko and the test tool. AUTHOR
voikkospell and this manual page were written by Harri Pitkanen ( 2012-02-27 VOIKKOSPELL(1)

Check Out this Related Man Page

sgml-spell-checker(1)						       SGML						     sgml-spell-checker(1)

sgml-spell-checker - SGML spell checker SYNOPSIS
nsgmls -l yourdoc.sgml | sgml-spell-checker [option] ... DESCRIPTION
sgml-spell-checker is a tool that you can use to automatically spell-check your SGML documents. One of the advantages of this tool over some other SGML-aware spell checkers is that it scans your documents in the form in which the SGML parser actually sees it, which means it is not line-based, system entities are resolved, marked sections are treated appropriately, etc. Also, this tool can be made aware of particular DTDs, in the sense that it knows not to spell-check the content of elements that do not represent human-language text, such as <programlisting> in DocBook. An exclusion list for the DocBook DTD is included, others can be added trivially. The input to sgml-spell-checker is the text representation of your SGML document's Element Structure Information Set as generated by nsgmls (from SP or OpenSP; sometimes installed under the name onsgmls). In other words, you need to pipe the output of nsgmls into sgml-spell- checker as shown in the synopsis. Provide to nsgmls the options you need, such as -c to search more catalogs, -i to include a marked sec- tion, or more source files. Do not forget the -l option, or you won't get any file or line references for the misspellings. The second part of the pipe takes a couple of options; see below. Note that if the language of the document does not match your system's locale settings, you need to use the --language option. The output of sgml-spell-checker is a list of the words that are misspelled (in the opinion of aspell), together with file name and line number. Note that the line number designates where the element that contains the word started, not where the word actually is. So most likely you will have to search a few lines below the indicated location. OPTIONS
--debug Debug mode. Generates lots of output not of interest to the normal user. --language=language Sets the language of the document. (The format depends on the aspell installation, but something like en or en_US should work.) By default the language is taken from the system locale settings. --suggestions Shows correction suggestions for misspelled words. --dictionary=file Uses an additional aspell dictionary file. This option may be used multiple times. --dtd=dtd Uses the exclusion list for the specified DTD (e.g., docbook). --help Shows a brief help, then exits. EXAMPLES
nsgmls -l -D . mydoc.sgml | sgml-spell-checker --language=en --dtd=docbook --dictionary=mydict1.aspell --dictionary=mydict2.aspell (You can enter this command all on one line without the backslashes, or on several lines with the backslashes.) NOTES
Read the aspell documentation about how to set up the appropriate dictionaries. In case you're having trouble interpreting the aspell doc- umentation, here's how to make an aspell dictionary file from a flat word list: rm -f mydict1.aspell # aspell won't overwrite existing files aspell --language-tag=xx create master ./mydict1.aspell < mywordlist.txt Watch the slashes. aspell likes to see a slash in the name or it will search some default location. BUGS
This program should be able to identify the language from the document (e.g., <book lang="de">), but aspell doesn't handle changing the language on the fly. AUTHOR
Peter Eisentraut ( SGML spell checker 2004-09-19 sgml-spell-checker(1)
Man Page