Author image Thierry Hamon
and 1 contributors

NAME

Lingua::YaTeA::LexiconItem - Perl extension for representing word

SYNOPSIS

  use Lingua::YaTeA::LexiconItem;
  Lingua::YaTeA::LexiconItem->($form);

DESCRIPTION

The module implements the representation of the word occuring in the lexicon of corpus. The word is described with its inflected form (field $IF), its Part-of-Speech tag (field POS), it lemmatised form (field LF), the size in characters (field LENGTH), its frequency in the corpus (field FREQUENCY).

METHODS

new()

    new($form);

The method creates a new lexicon item from the form $form(the concatenation of the inflected form, the Part-of-Speech tag, and the lemmatised form).

setLF()

    setLF($LF,$IF);

The method set the field LF (lemmatised form). If the $LF is equal to unknown or @card@ (some default lemma from TreeTagger), the inflected form is considered as the lemmatised form.

setLength()

    setLength();

The method computes the size in characters of the inflected form.

incrementFrequency()

    incrementFrequency();

The method increments the frequency of the lexicon item.

getID()

    getID();

The method returns the identifier of the lexicon item.

getIF()

    getIF();

The method returns the inflected form of the lexicon item.

getPOS()

    getPOS();

The method returns the Part-of-Speech of the lexicon item.

getLF()

    getLF();

The method returns the lemmatised form of the lexicon item.

getLength()

    getLength();

The method returns the size, in characters, of the inflected form of the lexicon item.

getFrequency()

    getFrequency()

The method returns the frequency of the lexicon item.

getAny()

    getAny($field);

The method returns the value of the field $field.

isCleaningFrontier()

    isCleaningFrontier($chunking_data);

The method indicates if the lexicon item apprears in one of the cleaning frontier $chunking_data.

isCleaningException()

    isCleaningException($chunking_data);

The method indicates if the lexicon item apprears in one of the cleaning exception $chunking_data.

SEE ALSO

Sophie Aubin and Thierry Hamon. Improving Term Extraction with Terminological Resources. In Advances in Natural Language Processing (5th International Conference on NLP, FinTAL 2006). pages 380-387. Tapio Salakoski, Filip Ginter, Sampo Pyysalo, Tapio Pahikkala (Eds). August 2006. LNAI 4139.

AUTHOR

Thierry Hamon <thierry.hamon@univ-paris13.fr> and Sophie Aubin <sophie.aubin@lipn.univ-paris13.fr>

COPYRIGHT AND LICENSE

Copyright (C) 2005 by Thierry Hamon and Sophie Aubin

This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself, either Perl version 5.8.6 or, at your option, any later version of Perl 5 you may have available.