TermTagger.pl -- A Perl script for tagging corpus with terms
TermTagger.pl [options] corpus termlist selected_term_list lemmatised_corpus
This script tags a corpus with terms. Corpus (corpus) is a file with one sentence per line. Term list (termlist) is a file containing one term per line. For each term, additionnal information (as canonical form) can be given after a column. Each line of the output file (selected_term_list) contains the sentence number, the term, additional information, all separated by a tabulation character.
corpus
termlist
selected_term_list
Alvis web site: http://www.alvis.info
Thierry Hamon <thierry.hamon@limsi.fr>
Copyright (C) 2006 by Thierry Hamon
This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself, either Perl version 5.8.6 or, at your option, any later version of Perl 5 you may have available.
To install Alvis::TermTagger, copy and paste the appropriate command in to your terminal.
cpanm
cpanm Alvis::TermTagger
CPAN shell
perl -MCPAN -e shell install Alvis::TermTagger
For more information on module installation, please visit the detailed CPAN module installation guide.