NAME
Plucene::SearchEngine::Index::DOC - a Plucene backend for indexing Microsoft Word documents
VERSION
version 0.001
DESCRIPTION
This backend analyzes a DOC file for its textual content (using antiword
).
This code is not currently actively maintained.
METHODS
gather_data_from_file
Overrides the method from Plucene::SearchEngine::Index::Text to provide DOC parsing.
AVAILABILITY
The latest version of this module is available from the Comprehensive Perl Archive Network (CPAN). Visit http://www.perl.com/CPAN/ to find a CPAN site near you, or see https://metacpan.org/module/Plucene::SearchEngine::Index::MSOffice/.
SOURCE
The development version is on github at http://github.com/doherty/Plucene-SearchEngine-Index-MSOffice and may be cloned from git://github.com/doherty/Plucene-SearchEngine-Index-MSOffice.git
BUGS AND LIMITATIONS
You can make new bug reports, and view existing ones, through the web interface at https://github.com/doherty/Plucene-SearchEngine-Index-MSOffice/issues.
AUTHORS
Sopan Shewale <sopan.shewale@gmail.com>
Mike Doherty <doherty@pythian.com>
COPYRIGHT AND LICENSE
This software is copyright (c) 2012 by Sopan Shewale <sopan.shewale@gmail.com>.
This is free software; you can redistribute it and/or modify it under the same terms as the Perl 5 programming language system itself.