The Perl Toolchain Summit needs more sponsors. If your company depends on Perl, please support this very important event.

Changes for version 0.06

  • Deep linking to the correct page of a PDF . We now do our own snippet extraction+highlighting for context
  • ICal importer ! Elasticsearch 2.x support removed There were too many breaking API changes between Elasticsearch 2.x and 5.x, so I removed the support for 2.x. Sorry. ! Elasticsearch 5.x required

Modules

connect to Apache::Tika
A simple local search engine
a search index entry
metadata extractors
HTML snippet extractor
schema definition for the Elasticsearch index
helper routines
filter HTML to a set of allowed tags and attributes
clean up mail parts

Provides

in lib/CORION/Apache/Tika/Connection.pm
in lib/CORION/Apache/Tika/Connection/AEHTTP.pm
in lib/CORION/Apache/Tika/Connection/LWP.pm
in lib/CORION/Apache/Tika/DocInfo.pm
in lib/CORION/Apache/Tika/Server.pm
in lib/Dancer/SearchApp/Defaults.pm
in lib/Dancer/SearchApp/Extractor/Audio.pm
in lib/Dancer/SearchApp/Extractor/Image.pm
in lib/Dancer/SearchApp/Extractor/PDF.pm
in lib/Mail/Email/IMAP.pm
in lib/Search/Elasticsearch/Plugin/Langdetect.pm
in lib/Search/Elasticsearch/Plugin/Langdetect/API.pm
in lib/Search/Elasticsearch/Plugin/Langdetect.pm