Maroš Kollár > Text-Phonetic > Text::Phonetic::Koeln

Download:
Text-Phonetic-2.05.tar.gz

Dependencies

Annotate this POD

CPAN RT

New  1
Open  0
View/Report Bugs
Source  

NAME ^

Text::Phonetic::Koeln - Kölner Phonetik algorithm

DESCRIPTION ^

The "Kölner Phonetik" is a phonetic algorithm for indexing names by sound, as pronounced in German. The goal is for names with the same pronunciation to be encoded to the same representation so that they can be matched despite minor differences in spelling.

In contrast to Soundex this algorithm is suitable for long names since the length of the encoded result is not limited. This algorithm is able to find allmost all ortographic variations in names, but also produces many false positives.

The result is always a sequence of numbers. Special characters and whitespaces are ignored. If your text might contain non-latin characters (except for German umlaute and 'ß') you should unaccent it prior to creating a phonetic code.

AUTHOR ^

    Maroš Kollár
    CPAN ID: MAROS
    maros [at] k-1.com
    http://www.k-1.com

COPYRIGHT ^

Text::Phonetic::Koeln is Copyright (c) 2006,2007 Maroš. Kollár. All rights reserved.

This program is free software; you can redistribute it and/or modify it under the same terms as Perl itself.

The full text of the license can be found in the LICENSE file included with this module.

SEE ALSO ^

Description of the algorithm can be found at http://de.wikipedia.org/wiki/K%C3%B6lner_Phonetik

Hans Joachim Postel: Die Kölner Phonetik. Ein Verfahren zur Identifizierung von Personennamen auf der Grundlage der Gestaltanalyse. in: IBM-Nachrichten, 19. Jahrgang, 1969, S. 925-931

syntax highlighting: