The Perl Toolchain Summit needs more sponsors. If your company depends on Perl, please support this very important event.

NAME

Dancer::SearchApp::Extractor - metadata extractors

METHODS

->extract( %options )

  my $info = $tika->get_meta( $content );
  my $entry = $extractor->extract(
              url => $url,
              info => $info,
              #content => \$content, # if we have it
              filename => $file, # if we have it
              folder => $res{ folder }, # if we have it
  )->then(sub { ... });
  
  # Do something with the hashrefs we get back,
  # like insert the first one into Elasticsearch

This method goes through all installed plugins and offers the file for inspection. The $info parameter will contain the information and content extracted by Apache Tika, so especially the MIME type will be available.

The method returns a promise so that analysis can happen in the background. The promise will be passed a list of the found items that were not undef. Currently no ranking is performed and all plugins are treated as equally applicable.