The Perl Toolchain Summit needs more sponsors. If your company depends on Perl, please support this very important event.

NAME

nat-codify - Command line tool to codify corpora

SYNOPSIS

   nat-codify <file1.nat> <file2.nat>

   nat-codify -tmx <file.tmx>

DESCRIPTION

The -tokenize flag can be used to force NATools to tokenize the texts. Note that at the moment a Portuguese tokenizer is used for all languages. This might change in the future.

The -id=name flag can be used to force NATools Corpora name. By default the name is read interactively.

The -q flag can be used to force quite mode. In thic case, the name is extracted from the file-names.

The -lang=PT..EN flag can be used to force languages.

SEE ALSO

NATools documentation, perl(1), nat-create

AUTHOR

Alberto Manuel Brandão Simões, <ambs@cpan.org>

COPYRIGHT AND LICENSE

Copyright (C) 2002-2012 by Alberto Manuel Brandão Simões