Tara Andrews > Text-TEI-Collate > Text::TEI::Collate::Lang

Download:
Text-TEI-Collate-2.1.tar.gz

Dependencies

Annotate this POD

CPAN RT

Open  1
View/Report Bugs
Source  

NAME ^

Text::TEI::Collate::Lang - base class for collation language-specific extensions

DESCRIPTION ^

Text::TEI::Collate::Lang is the base package for any language extension (e.g. Text::TEI::Collate::Lang::Latin) to be used by Text::TEI::Collate. The base package provides three subroutines as default; any implementation should re-implement one or more of these functions, and can use the ones defined here otherwise. This would be a base class to subclass if we ever had a reason to instantiate it.

SUBROUTINES ^

distance

This is a rudimentary, and hopefully pretty quick, word distance function. It counts the occurrence of each letter in a word, and returns the sum of lettercount differences between the two passed words.

canonizer

This is essentially just the lc() builtin function.

comparator

This is a function that replaces all characters with their base character after an NFKD (Normalization Form Compatibility Decomposition) operation.

AUTHOR ^

Tara L Andrews <aurum@cpan.org>

syntax highlighting: