Lingua::YaTeA::TagSet - Perl extension for managing the set of Part-of-Speech tags and inflected that can be accepted in the terms.
use Lingua::YaTeA::TagSet; Lingua::YaTeA::TagSet->new();
This module provides methods for managing a set of Part-of-Speech tags of inflected forms that can be accepted in the terms to extract. Such lists of tags or inflected form are used by the Parsing Patterns while the syntactic analusys of the maximal noun phrases. This information is generally stored in the c<TagSet> configuration file. The definition of the different classes is provided: CANDIDATES}(the content words), DETERMINERS (the determiner words), PREPOSITIONS (the preposition words) and COORDINATIONS (the coordination words).
CANDIDATES}
DETERMINERS
PREPOSITIONS
COORDINATIONS
$file = "/home/thierry/YaTeAconfig/EN/TagSet"; new($file);
This method creates an object with 5 fields CANDIDATES, PREPOSITIONS, DETERMINERS, COORDINATIONS, ANY and loads the tags contained in the file $file. The field ANY contains all the tags. A field designates a class of tags and is a hash table.
CANDIDATES
ANY
$file
loadTags($file);
This method opens the file $file and loads all the tagsets.
addTag($subset,$tag);
This method stores the tag $tag in the right class $subset.
$tag
$subset
getSubset($subset);
This method returns the field of the current object that contains tags of the class $subset.
getTagList($subset);
This method returns all the tags of the class $subset.
existTag($subset,$tag);
This methods indicates whether the tag $tag exists in the subset of tags $subset (it returns the value 1) or not (its returns the value 0). The field of the objet corresponding to the structure is set.
parseSubset($line);
This method parses a line containing the definition of a subset of tags. it sets the corresponding field in the object calling the method. In case of an error of format, the method dies.
makeALL($subset, \@tags);
The method sets the field ALL of the subset $subset with the tags contained in the array \@tags given by reference. This field is a string containg a regex. Each tag is an alternative.
ALL
\@tags
sort(\@tags);
The method returns a regex where each element of the array given by reference is a alternative and is sorted.
Sophie Aubin and Thierry Hamon. Improving Term Extraction with Terminological Resources. In Advances in Natural Language Processing (5th International Conference on NLP, FinTAL 2006). pages 380-387. Tapio Salakoski, Filip Ginter, Sampo Pyysalo, Tapio Pahikkala (Eds). August 2006. LNAI 4139.
Thierry Hamon <thierry.hamon@univ-paris13.fr> and Sophie Aubin <sophie.aubin@lipn.univ-paris13.fr>
Copyright (C) 2005 by Thierry Hamon and Sophie Aubin
This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself, either Perl version 5.8.6 or, at your option, any later version of Perl 5 you may have available.
To install Lingua::YaTeA, copy and paste the appropriate command in to your terminal.
cpanm
cpanm Lingua::YaTeA
CPAN shell
perl -MCPAN -e shell install Lingua::YaTeA
For more information on module installation, please visit the detailed CPAN module installation guide.