nat-makeCWB - Dumps a NATools corpus in a format suitable to be imported in CWB
nat-makeCWB [-encode=<CWBName> -d=<CWBCrpDir> [-r=<CWBRegistry>]] <NatCrpDir>
This small scripts exports a NATools corpus directory to a pair of files that can be easily imported in Corpus WorkBench (CWB).
By default nat-makeCWB processes a NATools corpora dir an creates a pair of files, source.cqp and target.cqp that can be later imported into CWB using cwb-align-import.
Flags:
If this option is used then nat-makeCWB will try to use cwb tools to create the aligned corpus. This option should be follows by the corpora name. The corpora creates will nem named name_source and name_target respectively.
name_source
name_target
This option should be used in conjunction with option -d.
-d
The CWB registry directory will be guessed using cwb-config or CORPUS_REGISTRY environment variable. To use other path, please specify it with -r.
cwb-config
CORPUS_REGISTRY
This option is required when using -encode. It specifies CWB corpus directory (without the corpus name).
-encode
Use this option to force a registry path other than the system default.
Use this option if you need to debug the temporary files. If this option is supplied they will not be deleted.
NATools, perl(1)
Alberto Manuel Brandão Simões, <ambs@cpan.org>
Copyright (C) 2010 by Alberto Manuel Brandão Simões
To install Lingua::NATools, copy and paste the appropriate command in to your terminal.
cpanm
cpanm Lingua::NATools
CPAN shell
perl -MCPAN -e shell install Lingua::NATools
For more information on module installation, please visit the detailed CPAN module installation guide.