sta2moses - convert from Stockholm Tree Aligner format to Moses/GIZA++ (plain text)
sta2moses alignments.xml
This script reads through a parallel treebank using the tree alignment file (alignments.xml) and produces sentence aligned plain text files (to be used with Moses/Giza++). The corpus will be stored in alignments.src and alignments.trg.
Lingua::Align::Corpus
Joerg Tiedemann
Copyright (C) 2009 by Joerg Tiedemann
This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself, either Perl version 5.8.8 or, at your option, any later version of Perl 5 you may have available.
To install Lingua::Align, copy and paste the appropriate command in to your terminal.
cpanm
cpanm Lingua::Align
CPAN shell
perl -MCPAN -e shell install Lingua::Align
For more information on module installation, please visit the detailed CPAN module installation guide.