The Perl Toolchain Summit needs more sponsors. If your company depends on Perl, please support this very important event.

NAME

Lingua::YaTeA::Occurrence - Perl extension for the phrase occurrences

SYNOPSIS

  use Lingua::YaTeA::Occurrence;
  Lingua::YaTeA::Occurrence->new();

DESCRIPTION

This module implements a reprensentation of a phrase occurrence. Each occurrence is described by five fields: an identifier ID, a reference to the object referring the sentence where the phrase occurs SENTENCE, the character where the phrase begins START_CHAR, the character where the phrase ends END_CHAR and the indication whether the occurrence is a maximal noun phrase.

METHODS

new()

  new();

The method creates a new object for a phrase occurrence and returns the object.

getSentence()

  getSentence();

The method returns the obeject referring the sentence where the phrase occurs.

getStartChar()

  getStartChar();

The method returns the start character of the phrase occurrence.

getEndChar()

  getEndChar();

The method returns the end character of the phrase occurrence.

getID()

  getID();

The method returns the identifier of the phrase occurrence.

getDocument()

  getDocument();

The method returns the document reference where the phrase occurs

isMaximal()

  isMaximal();

The medthod indicates if the phrase occurrence is maximal.

setInfoForPhrase()

   setInfoForPhrase(@words, $maximal);

The method sets the information related to the phrase occurrence for the array of words @words. $maximal indicates if the phrase occurrence is maximal.

setInfoForTestifiedTerm()

   setInfoForTestifiedTerm($sentence, $start_char, $end_char);

The method sets the information related to the phrase occurrence for a testified term: the object referring the sentence $sentence, the start character $start_char and the end character $end_char.

print()

   print($fh);

The method prints the information related to the phrase occurrence in the file handler $fh.

isNotBest()

   isNotBest($other_occurrences_a,$parsing_direction);   

The method indicates if the current phrase occurrence is included in one of the occurrence of the array $other_occurrences_a or regarding the most convinient occurrence according to the parsing direction $parsing_direction. In that case, it returns 1, otherwise undef.

crossesWithoutPriority()

   crossesWithoutPriority($other,$parsing_direction);

This method indicates if the current phrase occrrence is partially embeded in a the other phrase occurrence $other, according to the prioritu given by $parsing_direction. In that case, it returns 1, otherwise undef.

isIncludedIn()

   isIncludedIn($other;

The method indicates if the current phrase occurrence is included in the phrase $other. In that case, it returns 1, otherwise undef.

SEE ALSO

Sophie Aubin and Thierry Hamon. Improving Term Extraction with Terminological Resources. In Advances in Natural Language Processing (5th International Conference on NLP, FinTAL 2006). pages 380-387. Tapio Salakoski, Filip Ginter, Sampo Pyysalo, Tapio Pahikkala (Eds). August 2006. LNAI 4139.

AUTHOR

Thierry Hamon <thierry.hamon@univ-paris13.fr> and Sophie Aubin <sophie.aubin@lipn.univ-paris13.fr>

COPYRIGHT AND LICENSE

Copyright (C) 2005 by Thierry Hamon and Sophie Aubin

This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself, either Perl version 5.8.6 or, at your option, any later version of Perl 5 you may have available.