The Perl Toolchain Summit needs more sponsors. If your company depends on Perl, please support this very important event.

NAME

Lingua::YaTeA::LexiconItem - Perl extension for lexicon of the corpus.

SYNOPSIS

  use Lingua::YaTeA::LexiconItem;
  Lingua::YaTeA::LexiconItem->new();

DESCRIPTION

The module manages the lexicon of the corpus, i.e. the list of the words appearing in the corpus. Each word, or lexicon unit is stores in the field ITEM (a reference to a hashtable). The key of the lexicon unit is the concatenation of the inflected form, Part-Of-Speech tag, and lemmatized form.

METHODS

new()

    new();

The method creates a new lexicon objet.

addItem()

    addItem($item,$key);

The method adds a lexicon unit (or item) $item to the lexicon. The associated key $key is provided.

addOccurrence()

    addOccurrence($form);

the method adds an new occurrrence of the lexicon unit having the form $form. If the unit doesn't already exist, the lexicon item is created, otherwise its frequency is incremented.

getItem()

    getItem($key);

The method returns the lexicon item given the key $key.

itemExists()

    itemExists($key);

The method checks if the lexicon item exists given the key $key. If the item exists it returns 1, otherwise 0.

buildKey()

    buildKey($item);

The method builds the key of the lexicon item $item.

SEE ALSO

Sophie Aubin and Thierry Hamon. Improving Term Extraction with Terminological Resources. In Advances in Natural Language Processing (5th International Conference on NLP, FinTAL 2006). pages 380-387. Tapio Salakoski, Filip Ginter, Sampo Pyysalo, Tapio Pahikkala (Eds). August 2006. LNAI 4139.

AUTHOR

Thierry Hamon <thierry.hamon@univ-paris13.fr> and Sophie Aubin <sophie.aubin@lipn.univ-paris13.fr>

COPYRIGHT AND LICENSE

Copyright (C) 2005 by Thierry Hamon and Sophie Aubin

This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself, either Perl version 5.8.6 or, at your option, any later version of Perl 5 you may have available.