Marvin Humphrey > KinoSearch > KinoSearch::Analysis::Stemmer

Download:
KinoSearch-0.315.tar.gz

Dependencies

Annotate this POD

CPAN RT

New  1
Open  4
View/Report Bugs
Source  

NAME ^

KinoSearch::Analysis::Stemmer - Reduce related words to a shared root.

DEPRECATED ^

The KinoSearch code base has been assimilated by the Apache Lucy project. The "KinoSearch" namespace has been deprecated, but development continues under our new name at our new home: http://lucy.apache.org/

SYNOPSIS ^

    my $stemmer = KinoSearch::Analysis::Stemmer->new( language => 'es' );
    
    my $polyanalyzer = KinoSearch::Analysis::PolyAnalyzer->new(
        analyzers => [ $case_folder, $tokenizer, $stemmer ],
    );

This class is a wrapper around Lingua::Stem::Snowball, so it supports the same languages.

DESCRIPTION ^

Stemmer is an Analyzer which reduces related words to a root form (using the "Snowball" stemming library). For instance, "horse", "horses", and "horsing" all become "hors" -- so that a search for 'horse' will also match documents containing 'horses' and 'horsing'.

CONSTRUCTORS ^

new( [labeled params] )

    my $stemmer = KinoSearch::Analysis::Stemmer->new( language => 'es' );

INHERITANCE ^

KinoSearch::Analysis::Stemmer isa KinoSearch::Analysis::Analyzer isa KinoSearch::Object::Obj.

COPYRIGHT AND LICENSE ^

Copyright 2005-2011 Marvin Humphrey

This program is free software; you can redistribute it and/or modify it under the same terms as Perl itself.

syntax highlighting: