Query: text::english
OS: debian
Section: 3pm
Format: Original Unix Latex Style Formatted with HTML and a Horizontal Scroll Bar
Text::English(3pm) User Contributed Perl Documentation Text::English(3pm)NAMEText::English - Porter's stemming algorithmSYNOPSISuse Text::English; @stems = Text::English::stem( @words );DESCRIPTIONThis routine applies the Porter Stemming Algorithm to its parameters, returning the stemmed words. It is derived from the C program "stemmer.c" as found in freewais and elsewhere, which contains these notes: Purpose: Implementation of the Porter stemming algorithm documented in: Porter, M.F., "An Algorithm For Suffix Stripping," Program 14(3), July 1980, pp. 130-137. Provenance: Written by B. Frakes and C. Cox, 1986. I have re-interpreted areas that use Frakes and Cox's "WordSize" function. My version may misbehave on short words starting with "y", but I can't think of any examples. The step numbers correspond to Frakes and Cox, and are probably in Porter's article (which I've not seen). Porter's algorithm still has rough spots (e.g current/currency, -ings words), which I've not attempted to cure, although I have added support for the British -ise suffix.NOTESThis is version 0.1. I would welcome feedback, especially improvements to the punctuation-stripping step.AUTHORIan Phillipps <ian@unipalm.pipex.com>COPYRIGHTCopyright Public IP Exchange Ltd (PIPEX). Available for use under the same terms as perl. perl v5.14.2 2005-04-10 Text::English(3pm)
Related Man Pages |
---|
english(3pm) - mojave |
lingua::stem::da(3pm) - debian |
lingua::stem::enbroken(3pm) - debian |
lingua::stem::it(3pm) - debian |
text::english(3pm) - debian |
Similar Topics in the Unix Linux Community |
---|
leet to English |
How can I get some interesting books? |
Türkler bi baksın |
Infraction for cougar_rea: Failure to repost in English |
Text search |