The Perl Toolchain Summit needs more sponsors. If your company depends on Perl, please support this very important event.

NAME

Lingua::Ogmios - Perl extension for configurable Natural Language Processing (NLP) platform

SYNOPSIS

use Lingua::Ogmios;

my $ogmios = Lingua::Ogmios::new();

DESCRIPTION

This module implements the alpha version of the configurable Natural Language Processing (NLP) platform named Ogmios. It provides overall methods for the linguistic annotation of textual documents. Linguistic annotations depend on the configuration variables and dependencies between linguistic steps. This is a new version of the module Alvis::NLPPlatform

The Omgios NLP platform annotates textual documents with existing NLP toos such Part-of-speech taggers (TreeTagger, GeniaTagge, Flemm), term recognizer and term extractor (Lingua::YaTeA). Textual documents are loaded in XML format and internally manipulated through data structures representing the annotation levels: textual elements (tokens, words, sentences), the properties associated with these elements (Part-of-Speech categories, semantics categories) and relations between elements (syntactic, semantic and anaphoric relations).

Each NLP tool is integrated in the platform through a wrapper. Wrappers are specific module which prepares the input for the NLP tool (based on the information in the internal data structures) and parse the output to add computed information to the data structures.

METHODS

XMLout()

    $self->XMLout();

addDocumentCollection

    $self->addDocumentCollection($documentCollection);

getConfig

    $self->getConfig;

getDocumentCollection

    $self->getDocumentCollection;

getTimer

    $self->getTimer;

linguisticProcessing

    $self->linguisticProcessing;

loadData

    $self->loadData($data);

loadDocuments

    $self->loadDocuments($files);

load_config

    $self->load_config($rcfile);

new

    Lingua::Ogmios::new("rcfile" => $rcfile);

printConfig

    $self->printConfig;

printConfigDOT

    $self->printConfigDOT;

tokenisation

    $self->tokenisation;

SEE ALSO

Thierry Hamon et Adeline Nazarenko. "Le développement d'une plate-forme pour l'annotation spécialisée de documents web: retour d'expérience", Traitement Automatique des Langues (TAL). 2008. 49(2). pages 127-154. (the most detailed presentation of the platform but in French)

Thierry Hamon et Adeline Nazarenko et Thierry Poibeau et Sophie Aubin et Julien Derivière "A Robust Linguistic Platform for Efficient and Domain specific Web Content Analysis". Proceedings of RIAO 2007 - Session Poster. 30 may - 1 june 2007. Pittsburgh, USA.

AUTHORS

Thierry Hamon <thierry.hamon@limsi.fr>

LICENSE

Copyright (C) 2013 by Thierry Hamon

This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself, either Perl version 5.8.6 or, at your option, any later version of Perl 5 you may have available.