Igor Myroshnichenko > Lingua-EN-Semtags-Engine-0.02 > Lingua::EN::Semtags::Engine

Download:
Lingua-EN-Semtags-Engine-0.02.tar.gz

Dependencies

Annotate this POD

CPAN RT

Open  0
Report a bug
Module Version: 0.02   Source  

NAME ^

Lingua::EN::Semtags::Engine - extract semantic tags (semtags) from English text

SYNOPSIS ^

  use Lingua::EN::Semtags::Engine;
  
  my $engine = Lingua::EN::Semtags::Engine->new;
  my @semtags = $engine->semtags("your blog post title");

DESCRIPTION ^

Lingua::EN::Semtags::Engine uses Lingua::EN::Tagger and WordNet::QueryData to extract semantic tags (semtags) from English text. Semtags are words which reflect the semantic essence of a piece of text (similar to topic keywords).

Lingua::EN::Semtags::Engine was designed and developed to solve a particular problem I was facing.

Problem: a user is processing blog post titles and needs to programmatically determine the posts' semantic context.

Solution: the user feeds a blog post title to Lingua::EN::Semtags::Engine and gets back a set of semtags which can be used for further processing (e.g., web searches).

Example: a blog post title like "BBtv: Graffiti Research Lab, the movie" (boingboing.net, Xeni Jardin, April 24, 2008 8:00 AM) would produce the following semtags: DECORATION WORKPLACE SHOW.

Please note that the module makes the following assumptions when attempting to extract semtags:

METHODS

semtags($string)

Calls sentence($string), gets back a populated instance of Lingua::EN::Semtags::Sentence, iterates over its Lingua::EN::Semtags::LangUnits, populates and returns an array of their semtags.

sentence($string)

Returns an instance of Lingua::EN::Semtags::Sentence, populates it with Lingua::EN::Semtags::LangUnit objects which represent meaningful language units.

tagger()

Returns the Lingua::EN::Tagger instance used by the engine.

verbose([$verbose])

Returns/sets the verbose mode.

wn()

Returns the WordNet::QueryData instance used by the engine.

SEE ALSO ^

Lingua::EN::Tagger, WordNet::QueryData, http://code.google.com/p/lingua-en-semtags-engine

AUTHOR ^

Igor Myroshnichenko <igorm@cpan.org>

Copyright (c) 2008, All Rights Reserved.

This software is free software and may be redistributed and/or modified under the same terms as Perl itself.