Patrik Lambert > Lingua-AlignmentSet-1.1 > symmetrise_alSet-version.pl

Download:
Lingua-AlignmentSet-1.1.tgz

Annotate this POD

View/Report Bugs
Source  

NAME ^

symmetrise_alSet-version.pl - symmetrises an alignments

SYNOPSIS ^

perl symmetrise_alSet-version.pl [options] required_arguments

Required arguments:

        -ist FILENAME    Input source-to-target links file
        -its FILENAME Input target-to-source links file
        -if BLINKER|GIZA|NAACL    Input file(s) format (required if not TALP)
        -of BLINKER|GIZA|NAACL    Output file(s) format (required if not TALP)
        -g     Directory where the groups file is
Options:

        -is FILENAME    Input source words file
        -it FILENAME    Input target words file
        -os FILENAME    Output source words file
        -ot FILENAME    Output target words file
        -ost FILENAME Output source-to-target links file
        -ots FILENAME Output target-to-source links file
        -range BEGIN-END    Input Alignment Set range
        -alignMode as-is|null-align|no-null-align    Alignment mode
        -v INTEGER     Verbose status (default: 0 i.e. silent)
        -extract INTEGER     Does the extraction (of mw appearing >= extract times) instead of the symmetrisation
        -action union|intersection     Action undertaken if no grouping is done (default: intersection)
        -help|?    Prints the help and exits
        -man    Prints the manual and exits

ARGUMENTS ^

--ist,--i_st,--i_sourceToTarget FILENAME

Input source-to-target (i.e. links) file name (or directory, in case of BLINKER format)

--if,--i_format BLINKER|GIZA|NAACL

Input Alignment Set format (required if different from default, TALP).

--of,--o_format BLINKER|GIZA|NAACL

Output (new) Alignment Set format (required if different from default, TALP)

OPTIONS ^

--is,--i_s,--i_source FILENAME

Input source (words) file name. Not applicable in GIZA Format.

--it,--i_t,--i_target FILENAME

Input target (words) file name. Not applicable in GIZA Format.

--its,--i_ts,--i_targetToSource FILENAME

Input target-to-source (i.e. links) file name (or directory, in case of BLINKER format)

--range BEGIN-END

Range of the input source-to-target file (BEGIN and END are the sentence pair numbers)

--os,--o_s,--o_source FILENAME

Output (new format) source (words) file name. Not applicable in GIZA Format.

--ot,--o_t,--o_target FILENAME

Output (new format) target (words) file name. Not applicable in GIZA Format.

--ost,--o_st,--o_sourceToTarget FILENAME

Output (new format) source-to-target (i.e. links) file name (or directory, in case of BLINKER format). NOT optional when doing symmetrisation.

--ots,--o_ts,--o_targetToSource FILENAME

Output (new format) target-to-source (i.e. links) file name (or directory, in case of BLINKER format)

--alignMode as-is|null-align|no-null-align

Take alignment "as-is" or force NULL alignment or NO-NULL alignment (see AlignmentSet.pm documentation).

--help, --?

Prints a help message and exits.

--man

Prints a help message and exits.

AUTHOR ^

Patrik Lambert <lambert@gps.tsc.upc.edu>

COPYRIGHT AND LICENSE ^

Copyright 2004 by Patrick Lambert

This program is free software; you can redistribute it and/or modify it under the terms of the GNU General Public License (version 2 or any later version).

syntax highlighting: