BIORUBY(1) General Commands Manual BIORUBY(1)NAME
br_bioflat.rb -- OBDA flat file indexer
SYNOPSIS
Search:
br_bioflat.rb [--search] [options...]
br_bioflat.rb [--search] [--location DIR] [--dbname DBNAME] [options...] [KEYWORDS]
Create index:
br_bioflat.rb [--create] [--location DIR] [--dbname DBNAME] [--format genbank|embl|fasta] [options...] [--files FILES...]
Update index:
br_bioflat.rb [--update] [--location DIR] [--dbname DBNAME] [options...] [--files FILES...]
Show namespaces:
br_bioflat.rb [--show-namespaces] [--location DIR] [--dbname DBNAME] [DIR/DBNAME]
br_bioflat.rb [--show-namespaces] [--format=CLASS]
br_bioflat.rb [--show-namespaces] [--files file]
DESCRIPTION
This manual page documents briefly the br_bioflat.rb.
br_bioflat.rb is an OBDA flat file indexer.
OPTIONS --search Search a database for keywords.
--namespace--name
Only valid with the --search option. Set the search namespace. You can set this option multiple times to specify more than one
namespace.
--create Create an index.
--location
Specify the directory.
--dbname Specify the name of the database.
--primary--secondary
Set the primary and secondarynamespace of the index. Default primary/secondary namespaces depend on the format of the flatfiles.
Only valid with the --create option.
--add-secondary
Add secondary namespaces to the default specification. You can use this option many times. Only valid with the --create option.
--update Update an index
--sort Sort an index. You can set this to a path to an external sorting program, or BUILTIN to use the builtin sort module. This option
is only valid with --create (or --update) and --type flat options.
--renew Re-read all flatfiles and update whole index. This option is only valid with the --update option.
--show-namespaces
Display the namespaces for an index file.
AUTHOR
This manual page was written by David Nusinow dnusinow@debian.org for the Debian system (but may be used by others). Permission is granted
to copy, distribute and/or modify this document under the terms of the GNU General Public License, Version 2 any later version published by
the Free Software Foundation.
On Debian systems, the complete text of the GNU General Public License can be found in /usr/share/common-licenses/GPL.
BIORUBY(1)
Check Out this Related Man Page
MKNMZ(1) Namazu Project MKNMZ(1)NAME
mknmz - an indexer of Namazu
SYNOPSIS
mknmz [options] <target>...
DESCRIPTION
mknmz 2.0.9, an indexer of Namazu.
Target files:
-a, --all
target all files.
-t, --media-type=MTYPE
set the media type for all target files to MTYPE.
-h, --mailnews
same as --media-type='message/rfc822'
--mhonarc
same as --media-type='text/html; x-type=mhonarc'
-F, --target-list=FILE
load FILE which contains a list of target files.
--allow=PATTERN
set PATTERN for file names which should be allowed.
--deny=PATTERN
set PATTERN for file names which should be denied.
--exclude=PATTERN
set PATTERN for pathnames which should be excluded.
-e, --robots
exclude HTML files containing <meta name="ROBOTS" content="NOINDEX">
-M, --meta
handle HTML meta tags for field-specified search.
-r, --replace=CODE
set CODE for replacing URI.
--html-split
split an HTML file with <a name="..."> anchors.
--mtime=NUM
limit by mtime just like find(1)'s -mtime option. e.g., -50 for recent 50 days, +50 for older than 50.
Morphological Analysis:
-c, --use-chasen
use ChaSen for analyzing Japanese.
-k, --use-kakasi
use KAKASI for analyzing Japanese.
-m, --use-chasen-noun
use ChaSen for extracting only nouns.
-L, --indexing-lang=LANG index with language specific processing.
Text Operations:
-E, --no-edge-symbol
remove symbols on edge of word.
-G, --no-okurigana
remove Okurigana in word.
-H, --no-hiragana
ignore words consist of Hiragana only.
-K, --no-symbol
remove symbols.
Summarization:
-U, --no-encode-uri
do not encode URI.
-x, --no-heading-summary do not make summary with HTML's headings.
Index Construction:
--update=INDEX
set INDEX for updating.
-Y, --no-delete
do not detect removed documents.
-Z, --no-update
do not detect update and deleted documents.
Miscellaneous:
-s, --checkpoint
turn on the checkpoint mechanism.
-C, --show-config
show the current configuration.
-f, --config=FILE
use FILE as a config file.
-I, --include=FILE
include your customization FILE.
-O, --output-dir=DIR
set DIR to output the index.
-T, --template-dir=DIR
set DIR having NMZ.{head,foot,body}.*.
-q, --quiet
suppress status messages during execution.
-v, --version
show the version of namazu and exit.
-V, --verbose
be verbose.
--debug
be debug mode.
--help show this help and exit.
REPORTING BUGS
Report bugs to <bug-namazu@namazu.org>.
COPYRIGHT
Copyright (C) 1997-1999 Satoru Takabayashi All rights reserved.
Copyright (C) 2000,2001 Namazu Project All rights reserved.
This is free software; you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free
Software Foundation; either version 2, or (at your option) any later version.
This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MER-
CHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.
mknmz of Namazu 2.0.9 November 2001 MKNMZ(1)