
TermTagger.pl -- A Perl script for tagging corpus with terms

TermTagger.pl [options] corpus termlist selected_term_list


This script tags a corpus with terms.
Corpus (corpus) is a file with one sentence per line.
Term list (termlist) is a file containing one term per line.
For each term,
additionnal information (as canonical form) can be given after a column.
Each line of the output file (selected_term_list) contains the sentence number,
the term,
additional information,
all separated by a tabulation character.

Alvis web site: http://www.alvis.info

Thierry Hamon <thierry.hamon@lipn.univ-paris13.fr>

Copyright (C) 2006 by Thierry Hamon
This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself, either Perl version 5.8.6 or, at your option, any later version of Perl 5 you may have available.