Author image Joerg Tiedemann
and 1 contributors

Documentation

  • convert_treebank - convert a treebank from one format to another
  • coocfreq - count co-occurrence frequencies for arbitrary features of nodes in a parallel treebank (corpus).
  • treealigneval - a simple script for computing precision and recall scores for a tree aligmnent task
  • sta2moses - a script that converts an aligned parallel treebank to plain text format (Moses/Giza++ format). Alignment has to be in sta (Stockholm Tree Aligner) format.
  • sta2phrases - This script extracts all aligned phrase pairs from a tree-aligned parallel treebank. The output is written to STDOUT in Moses (phrase extract) format.
  • treealign - a simple frontend for training and applying a tree aligner model using Lingua::Align::Trees and a standard binary classifier
  • treealigneval - a simple script for computing precision and recall scores for a tree aligmnent task
  • treebank2moses.pl - a script that converts an aligned parallel treebank to plain text format (Moses/Giza++ format). Sentence alignments have to be stored in OPUS (xces) format.
  • doc::index

Modules

Provides