Patrik Lambert > Lingua-AlignmentSet-1.1 > visualise_alSet-version.pl

Download:
Lingua-AlignmentSet-1.1.tgz

Annotate this POD

View/Report Bugs
Source  

NAME ^

visualise_alSet-version.pl - Displays the aligned sentence pairs as a links enumeration or matrix

SYNOPSIS ^

perl visualise_alSet-version.pl [options] required_arguments

Required arguments:

        -ist FILENAME    Input source-to-target links file
        -is FILENAME    Input source words file (not applicable in GIZA format)
        -it FILENAME    Input target words file (not applicable in GIZA format)
        -if BLINKER|GIZA|NAACL    Input file(s) format (required if not TALP)

Options:

        -rep enumLinks|matrix|drawLines    Type of visual representation (default: matrix)
        -format text|latex    Format of the output (default: latex)
        -its FILENAME Input target-to-source links file
        -range BEGIN-END    Input Alignment Set range
        -alignMode as-is|null-align|no-null-align    Alignment mode
        -mark STRING    How a link is marked in the matrix representation
        -maxRows INTEGER Maximum number of rows allowed in the matrix
        -maxCols INTEGER Maximum number of columns allowed in the matrix
        -help|?    Prints the help and exits
        -man    Prints the manual and exits

ARGUMENTS ^

--ist,--i_st,--i_sourceToTarget FILENAME

Input source-to-target (i.e. links) file name (or directory, in case of BLINKER format)

--is,--i_s,--i_source FILENAME

Input source (words) file name. Not applicable in GIZA Format.

--it,--i_t,--i_target FILENAME

Input target (words) file name. Not applicable in GIZA Format.

--if,--i_format BLINKER|GIZA|NAACL

Input Alignment Set format (required if different from default, TALP).

OPTIONS ^

--rep, --representation enumLinks|matrix|drawLines

Type of visual represention (cf documentation for the AlignmentSet.pm module). Note that 'drawLines' representation is not available yet.

--format text|latex

Format of the output. If representation=matrix, format must be 'latex'. In this case, the latex output is best seen with a ps viewer (instead of a dvi viewer).

--its,--i_ts,--i_targetToSource FILENAME

Input target-to-source (i.e. links) file name (or directory, in case of BLINKER format)

--range BEGIN-END

Range of the input source-to-target file (BEGIN and END are the sentence pair numbers)

--alignMode as-is|no-null-align|null-align

Take alignment "as-is" or force NULL alignment or NO-NULL alignment (see AlignmentSet.pm documentation).

--mark STRING

Defines how a link is marked in the matrix.Common values are 'cross', 'ambiguity', 'confidence' (cf AlignmentSet.pm documentation). You can also write a latex-compatible mark, such as '$\blacksquare$'.

--maxRows INTEGER

The maximum number of rows (source words) allowed in a matrix. If the sentence pair contains more, the alignment is displayed as 'enumLinks' representation.

--maxCols INTEGER

The maximum number of columns (target words) allowed in a matrix. If the sentence pair contains more, the matrix is continued below.

--help, --?

Prints a help message and exits.

--man

Prints a help message and exits.

DESCRIPTION ^

Displays the aligned sentence pairs as a links enumeration or matrix. The command-line utility has been made for convenience. For full details, see the documentation of the AlignmentSet.pm module.

EXAMPLES ^

Visualising as an enumeration of links, in text format, the first 10 sentence pairs of a GIZA file:

perl visualise_alSet-version.pl -ist test-giza.spa2eng.giza -if=GIZA -range=-10 -rep enumLinks -format text

Visualising as a matrix the first 10 sentence pairs in a NAACL file, with a personalized mark (black squares), and redirecting the ouput to a .tex file:

perl visualise_alSet-version.pl -ist test-giza.spa2eng.naacl -is test.spa.naacl -it test.eng.naacl -rep matrix -format latex -range -10 -mark '$\blacksquare$' > matrix.tex

AUTHOR ^

Patrik Lambert <lambert@gps.tsc.upc.edu>

COPYRIGHT AND LICENSE ^

Copyright 2004 by Patrick Lambert

This program is free software; you can redistribute it and/or modify it under the terms of the GNU General Public License (version 2 or any later version).

syntax highlighting: