02-25-2008
keyword searching of documents
Unix based fix-it needed?
Platform and feature: search programs on Apple computers (Leopard or Tiger; 10.4 and above; Spotlight)
Problem: the document search feature of these programs produce hits when keyword(s) used appear anywhere in the document's content.
Change required: we need to limit (.pdf) document searches to the titles we've created.
Present status: we've continued to use Apple's Panther search platform, which has no such problem; new generations of Apple are not compatible with Panther.
Example: Our foundation's Schizophrenia library contains over 35,000 journal articles, housed in over 3,000 (hierarchically organized) folders, within a single (desktop) folder.
The ability to search by document TITLE within all or part of this 18.6 gb matrix (or alternatively, by folder title) is a key feature of this library.
Why Apple? Our article labels are designed to produce (simultaneously) very content-rich and easy to read information (in a complex biomedical discipline); thus, it is extremely desirable to employ: 1) longer label sizes than PCs allow; and 2) grammatical characters (e.g., apostrophe, &, ... ) that PCs do not recognize.
9 More Discussions You Might Find Interesting
1. Solaris
I have installed sudo on our development server (SPARC,Solaris 9) and trying to edit /etc/sudoers file using visudo. Referred the following sites and not able to find the way.
http://www.courtesan.com/sudo/man/sudoers.html
http://www.kempston.net/solaris/sudo.html
To start with sudo, my... (3 Replies)
Discussion started by: chrs0302
3 Replies
2. UNIX for Dummies Questions & Answers
Hi all,
I am using Solaris 8 and have several printers (HP lasers or inkjets) connected behind PCs, printing thus being controlled by LPD.
All I can print is ASCII, not very keen, no images, no boxes etc.
Is there any thread (I assure, I have been searching!) or discussion explaning how to set... (0 Replies)
Discussion started by: nulnul7
0 Replies
3. Solaris
Here is the situation. We are a company that has been using a professional publishing system, the software is called "ProType". It runs on Solaris 2.4, however it is no longer supported and we are forced to move on to Adobe Indesign. We must convert all our documents (thousands) to InDesign format.... (4 Replies)
Discussion started by: Fred Goldman
4 Replies
4. Shell Programming and Scripting
Could do with some help on where to get started really. If anyone could help me it would be greatly appreciated.
I have been working on this for a while now and I don't really know where to start.
I am looking into creating a script that will process website hit files and output statistical... (1 Reply)
Discussion started by: amatuer_lee_3
1 Replies
5. Shell Programming and Scripting
I'm trying this:$ for n in `ls` ; do xterm -e vim $n & ; done
bash: syntax error near unexpected token `;'
$I want to edit my files all at once, not one at a time. How can I do that? (2 Replies)
Discussion started by: Orange Stripes
2 Replies
6. OS X (Apple)
Hi all,
I am in the process of building a shell script as part of a auditing utility. It will search a specified directory for keywords and output results of the file path, and line number that the word was found on. I built a test script (shown below) that does just this, but egrep apparently... (0 Replies)
Discussion started by: tmcmurtr
0 Replies
7. UNIX for Dummies Questions & Answers
Ok, it may sound a bit stupid but I can't find the answer.. How do one put 2 here documents in a row? for instance, i want to do something like:
diff <<eof <<eof2 # doesn't work...
> a
> b
> eof
> a
> c
> eof2
its mostly to satisfy my curiosity!!
thanks,
Anthony (4 Replies)
Discussion started by: anthalamus
4 Replies
8. Shell Programming and Scripting
Hi
I want to implement something like this:
if( keyword1 exists)
then
check if(keyword2 exists in the same line)
then replace keyword 2 with New_Keyword
else
Add New_Keyword at the end of line
end if
eg:
Check for Keyword JUNGLE and add/replace... (7 Replies)
Discussion started by: dashing201
7 Replies
9. UNIX for Beginners Questions & Answers
Hello Folks ,
I am a new bie to the world of unix , what i am planning to do is the I have the location in server to which i am access through the putty and the location is /mt/ttlog/avccomn/logs/201901/19 and at this location the files are listed as show
startjmsnode1.sh_03.out... (7 Replies)
Discussion started by: punpun26262626
7 Replies
htdig(1) General Commands Manual htdig(1)
NAME
htstat - returns statistics on the document and word databases, much like the -s option to htdig or htmerge.
SYNOPSIS
htstat [-v][-a][-c configfile][-u]
DESCRIPTION
Htdig retrieves HTML documents using the HTTP protocol and gathers information from these documents which can later be used to search these
documents. This program can be referred to as the search robot.
OPTIONS
-a Use alternate work files. Tells htstat to append .work to database files, causing a second copy of the database to be built. This
allows the original files to be used by htsearch during the run.
-c configfile
Use the specified configfile instead of the default.
-u Give a list of URLs in the document database.
-v Verbose mode. This increases the verbosity of the program. Using more than 2 is probably only useful for debugging purposes. The
default verbose mode (using only one -v) gives a nice progress report while digging.
FILES
/etc/htdig/htdig.conf
The default configuration file.
SEE ALSO
Please refer to the HTML pages (in the htdig-doc package) /usr/share/doc/htdig-doc/html/index.html and the manual pages htdigconfig(8) ,
htdig(1) and htmerge(1) for a detailed description of ht://Dig and its commands.
AUTHOR
This manual page was written by Robert Ribnitz, based on the HTML documentation of ht://Dig.
January 2004 htdig(1)