The Perl Toolchain Summit needs more sponsors. If your company depends on Perl, please support this very important event.

NAME

visualise_alSet-version.pl - Displays the aligned sentence pairs as a links enumeration or matrix

SYNOPSIS

perl visualise_alSet-version.pl [options] required_arguments

Required arguments:

        -ist FILENAME    Input source-to-target links file
        -is FILENAME    Input source words file (not applicable in GIZA format)
        -it FILENAME    Input target words file (not applicable in GIZA format)
        -if BLINKER|GIZA|NAACL    Input file(s) format (required if not TALP)

Options:

        -rep enumLinks|matrix|drawLines    Type of visual representation (default: matrix)
        -format text|latex    Format of the output (default: latex)
        -its FILENAME Input target-to-source links file
        -range BEGIN-END    Input Alignment Set range
        -alignMode as-is|null-align|no-null-align    Alignment mode
        -mark STRING    How a link is marked in the matrix representation
        -maxRows INTEGER Maximum number of rows allowed in the matrix
        -maxCols INTEGER Maximum number of columns allowed in the matrix
        -help|?    Prints the help and exits
        -man    Prints the manual and exits

ARGUMENTS

--ist,--i_st,--i_sourceToTarget FILENAME

Input source-to-target (i.e. links) file name (or directory, in case of BLINKER format)

--is,--i_s,--i_source FILENAME

Input source (words) file name. Not applicable in GIZA Format.

--it,--i_t,--i_target FILENAME

Input target (words) file name. Not applicable in GIZA Format.

--if,--i_format BLINKER|GIZA|NAACL

Input Alignment Set format (required if different from default, TALP).

OPTIONS

--rep, --representation enumLinks|matrix|drawLines

Type of visual represention (cf documentation for the AlignmentSet.pm module). Note that 'drawLines' representation is not available yet.

--format text|latex

Format of the output. If representation=matrix, format must be 'latex'. In this case, the latex output is best seen with a ps viewer (instead of a dvi viewer).

--its,--i_ts,--i_targetToSource FILENAME

Input target-to-source (i.e. links) file name (or directory, in case of BLINKER format)

--range BEGIN-END

Range of the input source-to-target file (BEGIN and END are the sentence pair numbers)

--alignMode as-is|no-null-align|null-align

Take alignment "as-is" or force NULL alignment or NO-NULL alignment (see AlignmentSet.pm documentation).

--mark STRING

Defines how a link is marked in the matrix.Common values are 'cross', 'ambiguity', 'confidence' (cf AlignmentSet.pm documentation). You can also write a latex-compatible mark, such as '$\blacksquare$'.

--maxRows INTEGER

The maximum number of rows (source words) allowed in a matrix. If the sentence pair contains more, the alignment is displayed as 'enumLinks' representation.

--maxCols INTEGER

The maximum number of columns (target words) allowed in a matrix. If the sentence pair contains more, the matrix is continued below.

--help, --?

Prints a help message and exits.

--man

Prints a help message and exits.

DESCRIPTION

Displays the aligned sentence pairs as a links enumeration or matrix. The command-line utility has been made for convenience. For full details, see the documentation of the AlignmentSet.pm module.

EXAMPLES

Visualising as an enumeration of links, in text format, the first 10 sentence pairs of a GIZA file:

perl visualise_alSet-version.pl -ist test-giza.spa2eng.giza -if=GIZA -range=-10 -rep enumLinks -format text

Visualising as a matrix the first 10 sentence pairs in a NAACL file, with a personalized mark (black squares), and redirecting the ouput to a .tex file:

perl visualise_alSet-version.pl -ist test-giza.spa2eng.naacl -is test.spa.naacl -it test.eng.naacl -rep matrix -format latex -range -10 -mark '$\blacksquare$' > matrix.tex

AUTHOR

Patrik Lambert <lambert@gps.tsc.upc.edu>

COPYRIGHT AND LICENSE

Copyright 2004 by Patrick Lambert

This program is free software; you can redistribute it and/or modify it under the terms of the GNU General Public License (version 2 or any later version).

3 POD Errors

The following errors were encountered while parsing the POD:

Around line 128:

You forgot a '=back' before '=head1'

Around line 130:

'=item' outside of any '=over'

Around line 174:

You forgot a '=back' before '=head1'