Linux and UNIX Man Pages

Linux & Unix Commands - Search Man Pages

swish-e(1) [debian man page]

SWISH-E(1)						       SWISH-E Documentation							SWISH-E(1)

NAME
Swish-e - A Search Engine SYNOPSIS
swish [-e] [-i dir file ... ] [-S system] [-c file] [-f file] [-l] [-v (num)] swish -w word1 word2 ... [-f file1 file2 ...] [-P phrase_delimiter] [-p prop1 ...] [-s sortprop1 [asc|desc] ...] [-m num] [-t str] [-d delim] [-H (num)] [-x output_format] swish -k (char|*) [-f file1 file2 ...] swish -M index1 index2 ... outputfile swish -N /path/to/compare/file swish -V See the the SWISH-RUN(1) man page for details on run-time options. DESCRIPTION
Swish-e is Simple Web Indexing System for Humans - Enhanced. Swish-e can quickly and easily index directories of files or remote web sites and search the generated indexes. Swish-e is extremely fast in both indexing and searching, highly configurable, and can be seamlessly integrated with existing web sites to maintain a consistent design. Swish-e can index web pages, but can just as easily index text files, mailing list archives, or data stored in a relational database. Swish is designed to index small to medium sized collection of documents, Although a few users are indexing over a million documents, typi- cal usage is more often in the tens of thousands. Currently, Swish-e only indexes eight bit character encodings. DOCUMENTATION
Documentation is provided as HTML pages installed in $prefix/share/doc/swish-e where $prefix is /usr/local if building from source, or /usr if installed as part of a package from your OS vendor. Under Windows $prefix is selected at installation time. Documentation is also available on-line at http://swish-e.org. A subset of the documentation is installed as system man pages as well. The following man pages should be installed: swish-e(1) This man page. SWISH-CONFIG(1) Defines options that can be used in a configuration file. SWISH-RUN(1) Describes the run-time options and switches. SWISH-FAQ(1) Answers to commonly asked questions. SWISH-LIBRARY(1) API for the Swish-e search library. Applications can link against this library. SUPPORT
Support for Swish-e is provide via the Swish-e discussion list. See http://swish-e.org for information. 2.4.7 2009-04-04 SWISH-E(1)

Check Out this Related Man Page

httpindex(1)						      General Commands Manual						      httpindex(1)

NAME
httpindex - HTTP front-end for SWISH++ indexer SYNOPSIS
wget [ options ] URL... 2>&1 | httpindex [ options ] DESCRIPTION
httpindex is a front-end for index++(1) to index files copied from remote servers using wget(1). The files (in a copy of the remote direc- tory structure) can be kept, deleted, or replaced with their descriptions after indexing. OPTIONS
wget Options The wget(1) options that are required are: -A, -nv, -r, and -x; the ones that are highly recommended are: -l, -nh, -t, and -w. (See the EXAMPLE.) httpindex Options httpindex accepts the same short options as index++(1) except for -H, -I, -l, -r, -S, and -V. The following options are unique to httpindex: -d Replace the text of local copies of retrieved files with their descriptions after they have been indexed. This is useful to display file descriptions in search results without having to have complete copies of the remote files thus saving filesystem space. (See the extract_description() function in WWW(3) for details about how descriptions are extracted.) -D Delete the local copies of retrieved files after they have been indexed. This prevents your local filesystem from filling up with copies of remote files. EXAMPLE
To index all HTML and text files on a remote web server keeping descriptions locally: wget -A html,txt -linf -t2 -rxnv -nh -w2 http://www.foo.com 2>&1 | httpindex -d -e'html:*.html,text:*.txt' Note that you need to redirect wget(1)'s output from standard error to standard output in order to pipe it to httpindex. EXIT STATUS
Exits with a value of zero only if indexing completed sucessfully; non-zero otherwise. CAVEATS
In addition to those for index++(1), httpindex does not correctly handle the use of multiple -e, -E, -m, or -M options (because the Perl script uses the standard GetOpt::Std package for processing command-line options that doesn't). The last of any of those options ``wins.'' The work-around is to use multiple values for those options seperated by commas to a single one of those options. For example, if you want to do: httpindex -e'html:*.html' -e'text:*.txt' do this instead: httpindex -e'html:*.html,text:*.txt' SEE ALSO
index++(1), wget(1), WWW(3) AUTHOR
Paul J. Lucas <pauljlucas@mac.com> SWISH++ August 2, 2005 httpindex(1)
Man Page