The Perl Toolchain Summit needs more sponsors. If your company depends on Perl, please support this very important event.

NAME

Lingua::Stem::UniNE::CS - Czech stemmer

VERSION

This document describes Lingua::Stem::UniNE::CS v0.04.

SYNOPSIS

    use Lingua::Stem::UniNE::CS qw( stem_cs );

    $stem = stem_cs($word);

    # alternate syntax
    $stem = Lingua::Stem::UniNE::CS::stem($word);

DESCRIPTION

A light stemmer for the Czech language that removes case endings from nouns and adjectives, possessive adjective endings from names, and takes care of palatalization.

This module provides the stem and stem_cs functions, which are synonymous and can optionally be exported. They accept a single word and return a single stem.

SEE ALSO

Lingua::Stem::UniNE provides a stemming object with access to all of the implemented University of Neuchâtel stemmers including this one. It has additional features like stemming lists of words.

Lingua::Stem::Any provides a unified interface to any stemmer on CPAN, including this one, as well as additional features like normalization, casefolding, and in-place stemming.

This stemming algorithm was defined in Indexing and stemming approaches for the Czech language (PDF) by Ljiljana Dolamic and Jacques Savoy and originally implemented by Ljiljana Dolamic in Java.

A Czech stemmer for Snowball by Jimmy O’Regan is available on the Snowball site but not included in the official distribution and therefore not included in Lingua::Stem::Snowball.

ACKNOWLEDGEMENTS

Ljiljana Dolamic and Jacques Savoy of the University of Neuchâtel authored the original stemming algorithm that was implemented in this module.

AUTHOR

Nick Patch <patch@cpan.org>

COPYRIGHT AND LICENSE

© 2012–2013 Nick Patch

This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself.