The Perl Toolchain Summit needs more sponsors. If your company depends on Perl, please support this very important event.

Changes for version 0.013 - 2021-06-12

  • Regenerated README.pod (Stanislaw Pusep)
  • Bump copyright year (Stanislaw Pusep)
  • Update nedtrie.h to the latest version (Stanislaw Pusep)
  • Silence Test::Pod::LinkCheck (Stanislaw Pusep)
  • Compatibility with Perl >= 5.32 (fix for issue #3) (Stanislaw Pusep)

Documentation

compute cosine similarity between two documents
uses MinHash & SpeedyFx to compare large text data
efficiently count unique tokens from a file

Modules

tokenize/hash large amount of strings efficiently

Examples