Author image Thierry Hamon
and 1 contributors

NAME

Lingua::YaTeA::ForbiddenStructureSet - Perl extension for managing the forbiddent structures.

SYNOPSIS

  use Lingua::YaTeA::ForbiddenStructureSet;
  Lingua::YaTeA::ForbiddenStructureSet->new();

DESCRIPTION

This module gathers forbidden structures used while the chunking step. The set of forbidden structures is composed of five fields:

  • ANY

    It lists the strings corresponding to forbidden structures used in ANY position of a chunk

  • START

    It lists the strings corresponding to forbidden structures used in the START position of a chunk

  • END

    It lists the strings corresponding to forbidden structures used in the END position of a chunk

  • START_TRIGGERS

    This field contains the triggers defining the beginning of the forbidden structure.

  • END_TRIGGERS

    This field contains the triggers defining the end of the forbidden structure.

METHODS

new()

    new($file_path)

The method creates a forbidden structure set and loads the forbidden structures from the file $file_path.

loadStructures()

    loadStructures($file_path);

The method loads the forbidden structures from the file $file_path, and set the triggers.

getTriggerSet()

    getTriggerSet($position);

This methid returns the trigger set that can be used in the position ANY, START or END.

getSubset()

    getSubset($name);

This methid returns the forbidden structure subset according the given position $name (i.e. ANY, START or END).

cleanInfos()

    cleanInfos($infos_a);

The internal method is used to chomp the informations read in the file. infos_a is the reference to the array containing the information related to a forbidden structure.

sort()

    sort($subset);

the method sorts the forbidden structure subset according their length.

SEE ALSO

Sophie Aubin and Thierry Hamon. Improving Term Extraction with Terminological Resources. In Advances in Natural Language Processing (5th International Conference on NLP, FinTAL 2006). pages 380-387. Tapio Salakoski, Filip Ginter, Sampo Pyysalo, Tapio Pahikkala (Eds). August 2006. LNAI 4139.

AUTHOR

Thierry Hamon <thierry.hamon@lipn.univ-paris13.fr> and Sophie Aubin <sophie.aubin@lipn.univ-paris13.fr>

COPYRIGHT AND LICENSE

Copyright (C) 2005 by Thierry Hamon and Sophie Aubin

This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself, either Perl version 5.8.6 or, at your option, any later version of Perl 5 you may have available.