4 results (0.061 seconds)
++ed by:
Станислав Пусеп
and 1 contributors
  • Sergey Romanov
cosine_cmp - compute cosine similarity between two documents
minhash_cmp - uses MinHash & SpeedyFx to compare large text data
uniq_wc - efficiently count unique tokens from a file
Text::SpeedyFx - tokenize/hash large amount of strings efficiently
Changes for version 0.011
    • Dist::Zilla maintenance (Stanislaw Pusep)
    • Documentation update (Stanislaw Pusep)
    • drop PWP::Encoding dependency (Sergey Romanov)

Hosting generously
sponsored by Bytemark