Patrik Lambert > Lingua-AlignmentSet > orderAlSetAsBilCorpus.pl

Download:
Lingua-AlignmentSet-1.1.tgz

Annotate this POD

View/Report Bugs
Source  

NAME ^

orderAlSetAsBilCorpus.pl - Place sentence pairs of a secondary corpus at the head of the Alignment Set, in the same order.

SYNOPSIS ^

perl orderAlSetAsBilCorpus.pl [options] required_arguments

See description in the manual (-man option).

Required arguments:

        -ist FILENAME    Input source-to-target links file
        -if BLINKER|GIZA|NAACL    Input file(s) format (required if not TALP)
        -cs FILENAME    New corpus source text file
        -ct FILENAME    New corpus target text file
        -ost FILENAME    Output source-to-target links file
        -of BLINKER|GIZA|NAACL    Output file(s) format (required if not TALP)

Options:

        -is FILENAME    Input source words file
        -it FILENAME    Input target words file
        -its FILENAME Input target-to-source links file
        -os FILENAME    Output source words file
        -ot FILENAME    Output target words file
        -ots FILENAME Output target-to-source links file
        -range BEGIN-END    Input Alignment Set range
        -alignMode as-is|null-align|no-null-align    Alignment mode
        -help|?    Prints the help and exits
        -man    Prints the manual and exits
        -v INT    verbose mode

ARGUMENTS ^

--ist,--i_st,--i_sourceToTarget FILENAME

Input source-to-target (i.e. links) file name (or directory, in case of BLINKER format)

--if,--i_format BLINKER|GIZA|NAACL

Input Alignment Set format (required if different from default, TALP).

--cs,--corpsrc FILENAME

New corpus source text file

--ct,--corptrg FILENAME

New corpus target text file

--os,--o_st,--o_sourceToTarget FILENAME

Output (new format) source-to-target (i.e. links) file name (or directory, in case of BLINKER format)

--of,--o_format BLINKER|GIZA|NAACL

Output (new) Alignment Set format (required if different from default, TALP)

OPTIONS ^

--os,--o_s,--o_source FILENAME

Output (new format) source (words) file name. Not applicable in GIZA Format.

--ot,--o_t,--o_target FILENAME

Output (new format) target (words) file name. Not applicable in GIZA Format.

--ots,--o_ts,--o_targetToSource FILENAME

Output (new format) target-to-source (i.e. links) file name (or directory, in case of BLINKER format)

--alignMode as-is|null-align|no-null-align

Take alignment "as-is" or force NULL alignment or NO-NULL alignment (see AlignmentSet.pm documentation).

--help, --?

Prints a help message and exits.

--man

Prints a help message and exits.

DESCRIPTION ^

Place sentence pairs of a secondary corpus at the head of the Alignment Set, in the same order.

EXAMPLES ^

perl orderAlSetAsBilCorpus.pl -ist eng2spa.A3.final -if giza -cs align_ref/test.eng.iso -ct align_ref/test.spa.iso -ost eng2spa.reordered -of giza

AUTHOR ^

Patrik Lambert <lambert@gps.tsc.upc.es>

COPYRIGHT AND LICENSE ^

Copyright 2005 by Patrick Lambert

This program is free software; you can redistribute it and/or modify it under the terms of the GNU General Public License (version 2 or any later version).

syntax highlighting: