
Search::FreeText::LexicalAnalysis::Heuristics - lexical analysis heuristics

A pseudo-filter which does a bit before we get into the real lexical analysis system. This can do full text substitutions and corrections on the free text. It's really there to handle a few minor corrections and linguistic issues which can break the later stages. The main issue it handles is prefixes, which are sometimes fixed with a "-" character and sometimes without. We fix this.

my $stemmer = new Search::FreeText::LexicalAnalysis::Heuristics(); my $words = $lexicaliser->process($oldwords);

Called when the lexicon system is initialised. This method actually does very little, although it could compile and cache stuff if it seemed appropriate.
Called to process a reference to an array containing strings (well, one string) which can then be tokenised for further lexical processing.
Heuristics applied include:

Stuart Watt <S.N.K.Watt@rgu.ac.uk>
Copyright (c) 2003 The Robert Gordon University. All rights reserved.
This program is free software; you can redistribute it and/or modify it under the same terms as Perl itself.