The Perl Toolchain Summit needs more sponsors. If your company depends on Perl, please support this very important event.

Changes for version 0.02

  • some kind of pre-release with basic documentation and lot's of experimental stuff

Documentation

convert a treebank from one format to another
count co-occurrence frequencies for arbitrary features of nodes in a parallel treebank (corpus).
a simple script for computing precision and recall scores for a tree aligmnent task
a script that converts an aligned parallel treebank to plain text format (Moses/Giza++ format). Alignment has to be in sta (Stockholm Tree Aligner) format.
This script extracts all aligned phrase pairs from a tree-aligned parallel treebank. The output is written to STDOUT in Moses (phrase extract) format.
a simple frontend for training and applying a tree aligner model using Lingua::Align::Trees and a standard binary classifier
a simple script for computing precision and recall scores for a tree aligmnent task
a script that converts an aligned parallel treebank to plain text format (Moses/Giza++ format). Sentence alignments have to be stored in OPUS (xces) format.

Modules

Perl extension for Munkres' solution to classical Assignment problem for square and rectangular matrices This module extends the solution of Assignment problem for square matrices to rectangular matrices by padding zeros. Thus a rectangular matrix is converted to square matrix by padding necessary zeros.
Perl modules for the alignment of parallel corpora
A virtual Perl module that links to the local classifier that will be used for the alignment
A Perl module that prepares data instances for training a MaxEnt classifier with megam (http://www.cs.utah.edu/~hal/megam/) and that calls megam for classification
Perl extension for reading a tokenized plain text corpus, 1 sentence per line; can also be used as a virtual module to open other types of corpora (treebanks etc) using the "-type" attribute
Perl extension for reading plain text corpus files with token factors separated by '|' (Moses style); 1 sentence per line
Virtual Perl extension for reading parallel sentence-aligned corpora; default format is plain text (see Lingua::Align::Corpus::Parallel::Bitext)
Perl extension for reading a simple parallel corpus (two corpus files, one for the source language, one for the target language); text on corresponding lines are aligned with each other
Perl extension for reading tree-aligned parallel corpora in Dublin Subtree Aligner format
Perl extension to read Viterbi word alignments produced by GIZA++
Perl extension to read sentence-aligned parallel corpora in Moses format
Perl extension to read sentence-aligned parallel corpora in OPUS format (http://www.let.rug.nl/~tiedeman/OPUS/)
Perl extension for reading sentence-aligned parallel corpora aligned by ID
Perl extension to read tree-aligned parallel corpora in STockholm Tree Aligner Format
Perl extension to read sentence-aligned parallel corpora in Moses format
Virtual Perl extension to read treebanks; default format = Penn Treebank format
Perl extension to read treebanks in Alpino Treebank format http://www.let.rug.nl/vannoord/alp/
Perl extension to read treebanks in Penn Treebank format
Perl extension to read treebanks in Penn Treebank format
Perl extension to read treebanks in Penn Treebank format including dependency relations produced by the Stanford parser
Perl extension to read treebanks in TigerXML format
Perl modules for feature extraction for the Lingua::Align::Trees tree aligner
Virtual Perl extension for tree alignment (alignment inference using local classification scores as alignment costs)
A Perl extension that calls Algorithm::Munkres to assign links between tree nodes (Kuhn-Munkres algorithm http://en.wikipedia.org/wiki/Hungarian_algorithm)
Perl modules for Yet Another Discriminative Word Aligner
Perl modules for Yet Another Discriminative Word Aligner
Perl modules for Yet Another Discriminative Word Aligner
Perl modules for Yet Another Discriminative Word Aligner
Perl modules for Yet Another Discriminative Word Aligner
Perl modules for Yet Another Discriminative Word Aligner
Perl modules for Yet Another Discriminative Word Aligner
Perl modules for Yet Another Discriminative Word Aligner
Perl modules implementing a discriminative tree aligner

Provides

in lib/Lingua/Align/Classifier/Diagonal.pm
in lib/Lingua/Align/Classifier/Clues.pm
in lib/Lingua/Align/Classifier/LibSVM.pm
in lib/Lingua/Align/Features/Alignment.pm
in lib/Lingua/Align/Features/Cooccurrence.pm
in lib/Lingua/Align/Features/History.pm
in lib/Lingua/Align/Features/Lexical.pm
in lib/Lingua/Align/Features/Orthography.pm
in lib/Lingua/Align/Features/Tree.pm
in lib/Lingua/Align/Words.pm