Nick Patch > Lingua-Stem-UniNE-0.04 > Lingua::Stem::UniNE::CS

Download:
Lingua-Stem-UniNE-0.04.tar.gz

Dependencies

Annotate this POD

Website

View/Report Bugs
Module Version: 0.04   Source   Latest Release: Lingua-Stem-UniNE-0.07

NAME ^

Lingua::Stem::UniNE::CS - Czech stemmer

VERSION ^

This document describes Lingua::Stem::UniNE::CS v0.04.

SYNOPSIS ^

    use Lingua::Stem::UniNE::CS qw( stem_cs );

    $stem = stem_cs($word);

    # alternate syntax
    $stem = Lingua::Stem::UniNE::CS::stem($word);

DESCRIPTION ^

A light stemmer for the Czech language that removes case endings from nouns and adjectives, possessive adjective endings from names, and takes care of palatalization.

This module provides the stem and stem_cs functions, which are synonymous and can optionally be exported. They accept a single word and return a single stem.

SEE ALSO ^

Lingua::Stem::UniNE provides a stemming object with access to all of the implemented University of Neuchâtel stemmers including this one. It has additional features like stemming lists of words.

Lingua::Stem::Any provides a unified interface to any stemmer on CPAN, including this one, as well as additional features like normalization, casefolding, and in-place stemming.

This stemming algorithm was defined in Indexing and stemming approaches for the Czech language (PDF) by Ljiljana Dolamic and Jacques Savoy and originally implemented by Ljiljana Dolamic in Java.

A Czech stemmer for Snowball by Jimmy O’Regan is available on the Snowball site but not included in the official distribution and therefore not included in Lingua::Stem::Snowball.

ACKNOWLEDGEMENTS ^

Ljiljana Dolamic and Jacques Savoy of the University of Neuchâtel authored the original stemming algorithm that was implemented in this module.

AUTHOR ^

Nick Patch <patch@cpan.org>

COPYRIGHT AND LICENSE ^

© 2012–2013 Nick Patch

This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself.

syntax highlighting: