nat-codify - Command line tool to codify corpora
nat-codify <file1.nat> <file2.nat> nat-codify -tmx <file.tmx>
The -tokenize flag can be used to force NATools to tokenize the texts. Note that at the moment a Portuguese tokenizer is used for all languages. This might change in the future.
-tokenize
The -id=name flag can be used to force NATools Corpora name. By default the name is read interactively.
-id=name
The -q flag can be used to force quite mode. In thic case, the name is extracted from the file-names.
-q
The -lang=PT..EN flag can be used to force languages.
-lang=PT..EN
NATools documentation, perl(1), nat-create
Alberto Manuel Brandão Simões, <ambs@cpan.org>
Copyright (C) 2002-2012 by Alberto Manuel Brandão Simões
To install Lingua::NATools, copy and paste the appropriate command in to your terminal.
cpanm
cpanm Lingua::NATools
CPAN shell
perl -MCPAN -e shell install Lingua::NATools
For more information on module installation, please visit the detailed CPAN module installation guide.