The Perl Toolchain Summit needs more sponsors. If your company depends on Perl, please support this very important event.

NAME

Lingua::YaTeA::ParsingPatternRecordSet - Perl extension for managing the set of the parsing patterns

SYNOPSIS

  use Lingua::YaTeA::ParsingPatternRecordSet;
  Lingua::YaTeA::ParsingPatternRecordSet->new($file_path,$tag_set,$message_set,$display_language);

DESCRIPTION

The module aims at managing the set of parsing pattern records used in the term extraction process. Each parsing pattern is associated to a record designated by a name defined as the concatenation of the Part-of-Speech tags of the parsing pattern. The module provides methods for managing the sets of parsing patterns read from a config file. The parsing patterns are stored in the field PARSING_RECORDS.

METHODS

new()

 new($file_path,$tag_set,$message_set,$display_language);

The method creates a new parsing pattern set. The parsing patterns are read from the config file $file_path. The parameter $tag_set provides sets of tags for the candidats, prepositions and determiners (this information has been previously loaded thanks to a another module). The parameters $message_set and display_language are used for printing related information in the right language.

loadPatterns()

    loadPatterns($file_path,$tag_set,$message_set,$display_language);

The method calls the parser of the file ($file_path) containing the parsing patterns.

creates a new parsing pattern set. The parameter $tag_set provides sets of tags for the candidats, prepositions and determiners (this information has been previously loaded thanks to a another module). The parameters $message_set and display_language are used for printing related information in the right language.

checkContentWords()

    checkContentWords($num_content_words,$num_line);

The method checks if there is at least a content word or a part-of-speech tag referring a content word in the pattern, otherwise it dies.

addPattern()

    addPattern($pattern);

The method adds the pattern $pattern in the current record of the parsing pattern set.

getRecord()

    getRecord($name);

The method returns the record of parsing pattern designated by the name $name (the concatenation of the Part-of-Speech tags).

addRecord()

    addRecord($name);

The method creates a new record designated by the name $name (the concatenation of the Part-of-Speech tags).

existRecord()

    existRecord($name);

The method checks if it exists a record designated by the name $name (the concatenation of the Part-of-Speech tags).

getRecordSet()

    getRecordSet();

The method returns the set of parsing pattern records.

print()

The method prints the set of parsing pattern records.

SEE ALSO

Sophie Aubin and Thierry Hamon. Improving Term Extraction with Terminological Resources. In Advances in Natural Language Processing (5th International Conference on NLP, FinTAL 2006). pages 380-387. Tapio Salakoski, Filip Ginter, Sampo Pyysalo, Tapio Pahikkala (Eds). August 2006. LNAI 4139.

AUTHOR

Thierry Hamon <thierry.hamon@univ-paris13.fr> and Sophie Aubin <sophie.aubin@lipn.univ-paris13.fr>

COPYRIGHT AND LICENSE

Copyright (C) 2005 by Thierry Hamon and Sophie Aubin

This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself, either Perl version 5.8.6 or, at your option, any later version of Perl 5 you may have available.