Author image Thierry Hamon
and 1 contributors

NAME

Lingua::YaTeA::TestifiedTermParser - Perl extension for the parser of testified term file (based on Parse::Yapp)

SYNOPSIS

  use Lingua::YaTeA::TestifiedTermParser;

  my $fh = FileHandle->new("<$file_path");

  my $parser = Lingua::YaTeA::TestifiedTermParser->new();

  $parser->YYData->{TTS} = $this;
  $parser->YYData->{WORD} = $word_characters_regexp;
  $parser->YYData->{TAGSET} = $tag_set;
  $parser->YYData->{MATCH} = $match_type;
  $parser->YYData->{FH} = $fh;
  $parser->YYData->{FILTERING_LEXICON} = $filtering_lexicon_h;

  $parser->YYParse(yylex => \&Lingua::YaTeA::ParsingPatternParser::_Lexer, yyerror => \&Lingua::YaTeA::ParsingPatternParser::_Error);

DESCRIPTION

The module implements a parser for analysing testified term file.

The parser takes into account several information: the word character list (field WORD) i.e. all the possible characters in a word, the Part-of-Speech tagset (field TAGSET), the type of matching (field MATCH), the file handler to read (field FH), and the lexicon of the corpus (field FILTERING_LEXICON).

METHODS

_Error()

    _Error($error_objet);

The method is used to manage the parsing error and prints a message explaining the error.

_Lexer()

    _Lexer($parser_info);

The method applies the parser on the data contains in the structure $parser_info (field INPUT).

SEE ALSO

Sophie Aubin and Thierry Hamon. Improving Term Extraction with Terminological Resources. In Advances in Natural Language Processing (5th International Conference on NLP, FinTAL 2006). pages 380-387. Tapio Salakoski, Filip Ginter, Sampo Pyysalo, Tapio Pahikkala (Eds). August 2006. LNAI 4139.

AUTHOR

Thierry Hamon <thierry.hamon@univ-paris13.fr> and Sophie Aubin <sophie.aubin@lipn.univ-paris13.fr>

COPYRIGHT AND LICENSE

Copyright (C) 2005 by Thierry Hamon and Sophie Aubin

This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself, either Perl version 5.8.6 or, at your option, any later version of Perl 5 you may have available.