doc::index River stage zero No dependents

TIEDEMANN/Lingua-Align-0.04 - 10 Dec 2012 18:31:24 UTC

Lingua::Align - Perl modules for the alignment of parallel corpora River stage zero No dependents

Lingua::Align contains modules for automatic tree alignment based on discriminative classification and alignment inference. More details about the tree aligner can be found in Lingua::Align::Trees. The following gives a general overview and motivatio...

TIEDEMANN/Lingua-Align-0.04 - 10 Dec 2012 18:31:24 UTC

Lingua::Align::Trees - Perl modules implementing a discriminative tree aligner River stage zero No dependents

This module implements a discriminative tree aligner based on binary classification. Alignment features are extracted for each candidate node pair to be used in a standard binary classifier. As a default we use a MaxEnt learner using a log-linerar co...

TIEDEMANN/Lingua-Align-0.04 - 10 Dec 2012 18:31:24 UTC

Lingua::Align::Features - Feature extraction for tree alignment River stage zero No dependents

Extract features from a pair of nodes from two given syntactic trees (source and target language). The trees should be complex hash structures as produced by Lingua::Align::Corpus::Treebank::TigerXML. The returned features are given as simple key-val...

TIEDEMANN/Lingua-Align-0.04 - 10 Dec 2012 18:31:24 UTC

coocfreq - count co-occurrence frequencies for arbitrary features of nodes in a parallel treebank River stage zero No dependents

This script counts frequencies and co-occurrence frequencies of source and target language features. It runs through the sentence aligned treebank and combines all node pairs. Note that co-occurrence frequencies in a sentence are " max( srcfreq(srcfe...

TIEDEMANN/Lingua-Align-0.04 - 10 Dec 2012 18:31:24 UTC

treealign - training tree alignment classifiers and aligning syntactic trees River stage zero No dependents

This script allows you to train a tree alignment model and to apply them to parallel treebanks. Tree alignment is based on local binary classification and rich feature sets. Currently, training data has to be in Stockholm Tree Aligner format. The out...

TIEDEMANN/Lingua-Align-0.04 - 10 Dec 2012 18:31:24 UTC
6 results (0.048 seconds)