Author image Thierry Hamon
and 1 contributors

NAME

Lingua::YaTeA::Island - Perl extension for island of reliability

SYNOPSIS

  use Lingua::YaTeA::Island;
  Lingua::YaTeA::Island->new($index,$type,$source);

DESCRIPTION

This module reprensents the island of reliability and provided related methods for manipulating if. An island of reliability is a subsequence (contiguous or not) of a Maximal Noun Phrase (MNP) that corresponds to a shorter term candidate that was parsed during the first step of the parsing process.

An island is defined with a list of parsed phrase (i.e. the sequence of Part-of-Speech tags) corresponding to the current island (field SOURCE), the index set for the parsed phrase corresponding to the current island (field INDEX_SET), the origin of the island (field TYPE ; value endogenous if issued from the parsing of the current text, exogenous if issued fom an input resource or previous text parsing).

An identifier (recorded in the field ID is associated to the isalnd. the information that the island is used in the parsing of a wider parsed phrase or island, is recorded in the field INTEGRATED (the default value is 0).

METHODS

new()

    new($index,$type,$source);

The method defined a new island. $source is the list of parsed phrase (i.e. the concatenation of Part-of-Speech tags or the key of the pharses) corresponding to the island. $index is the index set for the parsed phrase corresponding to the current island. $type is the origin of the island (value endogenous if issued from the parsing of the current text, exogenous if issued fom an input resource or previous text parsing).

getIndexSet()

    getIndexSet();

The method returns the index set for the parsed phrase corresponding to the current island.

getType()

    getType();

The method returns the origin of the island (endogenous or exogenous).

getParsingMethod()

     getParsingMethod();

The method return the parsing methods associated to the parsed phrase corresponding to the island of reliability.

getIF()

    getIF();

The method returns the inflected form of the parsed phrase corresponding to the island of reliability.

getSource()

    getSource();

The method return the parsed phrase (i.e. the sequence of Part-of-Speech tags) corresponding to the island of reliability.

getID()

    getID();

The method returns the identifier of the island.

importNodeSets()

    importNodeSets();

This method returns a copu of the node sets corresponding to the island. The methods also updates the index set of the island.

gapSize()

    gapSize();

The method returns the number of words into the word sequence delimited by the island island but not appearing in the island.

print()

    print($fh);

The pethod prints the island into the stream $fh.

SEE ALSO

Sophie Aubin and Thierry Hamon. Improving Term Extraction with Terminological Resources. In Advances in Natural Language Processing (5th International Conference on NLP, FinTAL 2006). pages 380-387. Tapio Salakoski, Filip Ginter, Sampo Pyysalo, Tapio Pahikkala (Eds). August 2006. LNAI 4139.

AUTHOR

Thierry Hamon <thierry.hamon@univ-paris13.fr> and Sophie Aubin <sophie.aubin@lipn.univ-paris13.fr>

COPYRIGHT AND LICENSE

Copyright (C) 2005 by Thierry Hamon and Sophie Aubin

This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself, either Perl version 5.8.6 or, at your option, any later version of Perl 5 you may have available.