Alberto Manuel Brandão Simões > Lingua-NATools-v0.7.5 > nat-codify

Download:
Lingua/Lingua-NATools-v0.7.5.tar.gz

Annotate this POD

View/Report Bugs
Source  

NAME ^

nat-codify - Command line tool to codify corpora

SYNOPSIS ^

   nat-codify <file1.nat> <file2.nat>

   nat-codify -tmx <file.tmx>

DESCRIPTION ^

The -tokenize flag can be used to force NATools to tokenize the texts. Note that at the moment a Portuguese tokenizer is used for all languages. This might change in the future.

The -id=name flag can be used to force NATools Corpora name. By default the name is read interactively.

The -q flag can be used to force quite mode. In thic case, the name is extracted from the file-names.

The -lang=PT..EN flag can be used to force languages.

SEE ALSO ^

NATools documentation, perl(1), nat-create

AUTHOR ^

Alberto Manuel Brandão Simões, <ambs@cpan.org>

COPYRIGHT AND LICENSE ^

Copyright (C) 2002-2012 by Alberto Manuel Brandão Simões

syntax highlighting: