The Perl Toolchain Summit needs more sponsors. If your company depends on Perl, please support this very important event.

Changes for version 0.011 - 2014-06-20

  • Dist::Zilla maintenance (Stanislaw Pusep)
  • Documentation update (Stanislaw Pusep)
  • drop PWP::Encoding dependency (Sergey Romanov)

Documentation

compute cosine similarity between two documents
uses MinHash & SpeedyFx to compare large text data
efficiently count unique tokens from a file

Modules

tokenize/hash large amount of strings efficiently

Examples