The Perl Toolchain Summit needs more sponsors. If your company depends on Perl, please support this very important event.

Changes for version 0.03

  • Added filters! Thanks to Tatsuhiko Miyagawa for the inspiration and advice
  • Fix F::E::Plain's return value (should have been F::E::Result)
  • Make sure recode() is called before creating F::E::Result
  • Add register_filter()
  • Backward Incompatible Changes ***
  • rename register() to register_processor().
  • Change F::E::Result API. Now results have a metadata field, which can contain anything other than the text field. Also, now F::E::Result has mandatory fields 'filename', 'mime_type', 'text'.

Modules

Extract Text From Arbitrary File Types
Extract Text From HTML Files
Extract Text From MP3 Files
Extract Text From Plain Text Files
Extract Text From RTF Files
Extraction Result Object

Provides

in lib/File/Extract/Base.pm