Linux and UNIX Man Pages

Linux & Unix Commands - Search Man Pages

plucene::analysis::porterstemfilter(3pm) [debian man page]

Plucene::Analysis::PorterStemFilter(3pm)		User Contributed Perl Documentation		  Plucene::Analysis::PorterStemFilter(3pm)

NAME
Plucene::Analysis::PorterStemFilter - Porter stemming on the token stream SYNOPSIS
# isa Plucene::Analysis:::TokenFilter my $token = $porter_stem_filter->next; DESCRIPTION
This class transforms the token stream as per the Porter stemming algorithm. Note: the input to the stemming filter must already be in lower case, so you will need to use LowerCaseFilter or LowerCaseTokenizer farther down the Tokenizer chain in order for this to work properly! The Porter Stemmer implements Porter Algorithm for normalization of English words by stripping their extensions and is used to generalize the searches. For example, the Porter algorithm maps both 'search' and 'searching' (as well as 'searchnessing') to 'search' such that a query for 'search' will also match documents that contains the word 'searching'. Note that the Porter algorithm is specific to the English language and may give unpredictable results for other languages. Also, make sure to use the same analyzer during the indexing and the searching. You can find more information on the Porter algorithm at www.tartarus.org/~martin/PorterStemmer. A nice online demonstration of the Porter algorithm is available at www.scs.carleton.ca/~dquesnel/java/stuff/PorterApplet.html. METHODS
next my $token = $porter_stem_filter->next; Returns the next input token, after being stemmed. perl v5.12.4 2011-08-14 Plucene::Analysis::PorterStemFilter(3pm)

Check Out this Related Man Page

Text::English(3pm)					User Contributed Perl Documentation					Text::English(3pm)

NAME
Text::English - Porter's stemming algorithm SYNOPSIS
use Text::English; @stems = Text::English::stem( @words ); DESCRIPTION
This routine applies the Porter Stemming Algorithm to its parameters, returning the stemmed words. It is derived from the C program "stemmer.c" as found in freewais and elsewhere, which contains these notes: Purpose: Implementation of the Porter stemming algorithm documented in: Porter, M.F., "An Algorithm For Suffix Stripping," Program 14(3), July 1980, pp. 130-137. Provenance: Written by B. Frakes and C. Cox, 1986. I have re-interpreted areas that use Frakes and Cox's "WordSize" function. My version may misbehave on short words starting with "y", but I can't think of any examples. The step numbers correspond to Frakes and Cox, and are probably in Porter's article (which I've not seen). Porter's algorithm still has rough spots (e.g current/currency, -ings words), which I've not attempted to cure, although I have added support for the British -ise suffix. NOTES
This is version 0.1. I would welcome feedback, especially improvements to the punctuation-stripping step. AUTHOR
Ian Phillipps <ian@unipalm.pipex.com> COPYRIGHT
Copyright Public IP Exchange Ltd (PIPEX). Available for use under the same terms as perl. perl v5.14.2 2005-04-10 Text::English(3pm)
Man Page

11 More Discussions You Might Find Interesting

1. UNIX for Dummies Questions & Answers

searching a file in a unix server

Please let me know which command and its format to use to search for a file. Windows provides a find option but i would like to know the equivalent in UNIX. for example i need to search for a file called uv.config and where exactly it is located in a unix server. TIA. (4 Replies)
Discussion started by: jhmr7
4 Replies

2. UNIX for Dummies Questions & Answers

searching by string length

Hi, I'm rather new to Unix and I'm trying to write a simple script to search through a dictionary for words based on the letters a user would pass as arguments to the script. Now I have the searching part done. However, the one thig that still eludes me is that I want to only keep the... (4 Replies)
Discussion started by: GADO
4 Replies

3. Shell Programming and Scripting

fast searching algorithm

hello, i need a searching algorithm in unix. since my input file is very bulky, so need a real fast searching algorithm, to match words. i am already using grep. (3 Replies)
Discussion started by: rochitsharma
3 Replies

4. Programming

hash table implementations in C Language

Hello List, Iam searching for a solution where i can use hash based searching . In Detail , I have linked list which will be dynamically increasing . I need a best searching mechanisim such a way that it can take only one itereation . Right now iam using linear search which is taking... (11 Replies)
Discussion started by: vlrk
11 Replies

5. UNIX for Advanced & Expert Users

Fastest way for searching the file

I want to search a file in fastest manner. Presently I am using 'find' command. But it is taking around 15min for searching. Is there any other method through which I can do it fast? (3 Replies)
Discussion started by: vaibhavbhat
3 Replies

6. UNIX for Advanced & Expert Users

search and count

Hi, I would like to seek help regarding searching a pattern on a particular input. Example input: "1|trunc(sysdate-1)|substring(pcol)" I would like to search for "|" and count it. any help will be much appreciated. Thanks! :) Newbie (2 Replies)
Discussion started by: janzper
2 Replies

7. UNIX for Dummies Questions & Answers

searching

How would i search to find all the lines consisting of of only the letter "z" followed by any four characters? (1 Reply)
Discussion started by: trob
1 Replies

8. Shell Programming and Scripting

Finding my lost file by searching for words in it

Got a question for you guys...I am searching through a public directory (that has tons of files) trying to find a file that I was working on a longggggg time ago. I can't remember what it is called, but I do remember the content. It should contains words like this: Joe Pulvo botnet zeus... (5 Replies)
Discussion started by: statichazard
5 Replies

9. Shell Programming and Scripting

searching a file with a specified text without using conventional file searching commands

without using conventional file searching commands like find etc, is it possible to locate a file if i just know that the file that i'm searching for contains a particular text like "Hello world" or something? (5 Replies)
Discussion started by: arindamlive
5 Replies

10. Shell Programming and Scripting

Help with searching a word,find the identifier under which it is there and giving a report

Hi, i have a directory structure which contains few files each. each file in turn has some functions. i'm searching for a word, say 'pen' in all the files present in all the directories. :wall: consider a file file1.c in Dir1. out of the 3 funcs present in this file, func1(pennum) and... (3 Replies)
Discussion started by: i.srini89
3 Replies

11. Shell Programming and Scripting

sed - searching token in certain order

Hello. I would like to write a bash function which would return "true" if the search succeed else return anything else. something like if ] ; then exit 1 fi function my_funct () { find first occurrence $2 in $1 if not found return "false" from that position,... (6 Replies)
Discussion started by: jcdole
6 Replies