String::Similarity::Group - take a list of strings and group them by similarity within a threshold

Imagine you have a list of filenames, and you want to group them by similarity. You can simply pass at list of strings, the min similarity to match, and you get an array of groups ( array refs of similar elements). Or if you have a list of strings, a...

LEOCHARRE/String-Similarity-Group-1.16 - 20 Aug 2009 14:59:02 GMT - Search in distribution

CONFIG - [documentation] Description of all configuration options for measures

The following is a list of options supported by the measures of semantic relatedness. This is intended to serve as a "master list" of options so that descriptions can be copied from here and pasted into the documentation for specific modules. trace T...

TPEDERSE/WordNet-Similarity-2.05 - 16 Jun 2008 23:05:03 GMT - Search in distribution

TPath - general purpose path languages for trees

TPath provides an xpath-like language for arbitrary trees. You implement a minimum of two methods -- "children" and "tag" -- and then you can explore your trees via concise, declarative paths. In tpath, "attributes" are node attributes of any sort an...

DFH/TPath-1.007 - 05 Aug 2014 16:26:44 GMT - Search in distribution

Lucene - API to the C++ port of the Lucene search engine

Like it or not Lucene has become the de-facto standard for open-source high-performance search. It has a large user-base, is well documented and has plenty of committers. Unfortunately until recently Lucene was entirely written in Java and therefore ...

TBUSCH/Lucene-0.18   (3 reviews) - 27 Sep 2007 18:15:28 GMT - Search in distribution

README

1. Introduction The Ngram Statistics Package (NSP) is a suite of programs that aids in analyzing Ngrams in text files. We define an Ngram as a sequence of 'n' tokens that occur within a window of at least 'n' tokens in the text; what constitutes a "t...

TPEDERSE/Text-NSP-1.27 - 16 Feb 2013 21:32:39 GMT - Search in distribution

intro.pod - introduction to WordNet::SenseRelate::TargetWord

This package consists of a set of Perl modules along with supporting Perl programs that perform the task of Word Sense Disambiguation. The program(s) attempt to disambiguate the sense of a single target word in a given context as described by Banerje...

SID/WordNet-SenseRelate-TargetWord-0.09 - 24 Dec 2006 13:13:56 GMT - Search in distribution

Crypt::FNA

FNA stands for Fractal Numerical Algorithm, the symmetrical encryption method based on two algorithms that I developed for: 1. the construction of a family of fractal curves (F) 2. a encryption based on these curves. A precise description of this alg...

ANAK/Crypt-FNA-0.65 - 19 Jun 2013 02:22:40 GMT - Search in distribution

Archive::Ar - Interface for manipulating ar archives

Archive::Ar is a pure-perl way to handle standard ar archives. This is useful if you have those types of archives on the system, but it is also useful because .deb packages for the Debian GNU/Linux distribution are ar archives. This is one building b...

JBAZIK/Archive-Ar-2.02 - 14 Jul 2014 04:51:07 GMT - Search in distribution

Acme::Tools - Lots of more or less useful subs lumped together and exported into your namespace

Subs created and collected since the mid-90s....

KJETIL/Acme-Tools-0.16   (1 review) - 14 Feb 2015 00:56:38 GMT - Search in distribution

Bio::DB::GFF - Storage and retrieval of sequence annotation data

Bio::DB::GFF provides fast indexed access to a sequence annotation database. It supports multiple database types (ACeDB, relational), and multiple schemas through a system of adaptors and aggregators. The following operations are supported by this mo...

CJFIELDS/BioPerl-1.6.924 - 10 Jul 2014 20:22:23 GMT - Search in distribution

Prima::Widget - window management

Prima::Widget is a descendant of Prima::Component, a class, especially crafted to reflect and govern properties of a system-dependent window, such as its position, hierarchy, outlook etc. Prima::Widget is mapped into the screen space as a rectangular...

KARASIK/Prima-1.43   (3 reviews) - 10 Apr 2015 19:21:06 GMT - Search in distribution

UMLS::Interface - Perl interface to the Unified Medical Language System (UMLS)

This package provides a Perl interface to the Unified Medical Language System (UMLS). The UMLS is a knowledge representation framework encoded designed to support broad scope biomedical research queries. There exists three major sources in the UMLS. ...

BTMCINNES/UMLS-Interface-1.43 - 23 Jun 2015 18:31:45 GMT - Search in distribution

Regexp::English - Perl module to create regular expressions more verbosely

Regexp::English provides an alternate regular expression syntax, one that is slightly more verbose than the standard mechanisms. In addition, it adds a few convenient features, like incremental expression building and bound captures. You can access a...

CHROMATIC/Regexp-English-1.01 - 05 Apr 2011 06:01:30 GMT - Search in distribution

WordNet::SenseKey - convert WordNet sense keys to sense numbers, and v.v.

The WordNet::Similarity package is designed to work with words in the form of lemma#pos#num where "lemma" is the word lemma, "pos" is the part of speech, and "num" is the sense number. Unfortuantely, the sense numbering is not stable from one WordNet...

LINAS/WordNet-SenseKey-1.03 - 14 Jan 2009 22:13:10 GMT - Search in distribution

Bio::Das::Feature - A genomic annotation

A Bio::Das::Segment::Feature object contains information about a feature on the genome retrieve from a DAS server. Each feature -- also known as an "annotation" -- has a start and end position on the genome relative to a reference sequence, as well a...

LDS/Bio-Das-1.17 - 29 Jun 2010 19:43:55 GMT - Search in distribution

Text::FastTemplate - Class that compiles text templates into subroutines.

Text::FastTemplate compiles templates that are written in a line-oriented syntax that resembles the C-preprocessor syntax into Perl subroutines. As much as possible, it is designed to be: Simple the API and the template syntax are very simple. Fast t...

BOZZIO/Text-FastTemplate-0.95 - 07 Nov 2001 06:42:15 GMT - Search in distribution

umls-similarity.pl - This program returns a semantic similarity score between two concepts.

BTMCINNES/UMLS-Similarity-1.45 - 25 Jun 2015 13:50:38 GMT - Search in distribution
  • create-icfrequency.pl - This program sums the frequency counts of the CUIs from a specified set of sources in plain text.
  • create-icpropagation.pl - This program determines the probability of the CUIs in a specified set of sources and relations.

Ace::Sequence::Homol - Temporary Sequence Homology Class

*Ace::Sequence::Homol* is a subclass of Ace::Object (not Ace::Sequence) which is specialized for returning information about a DNA or protein homology. This is a temporary placeholder for a more sophisticated homology class which will include support...

LDS/AcePerl-1.92 - 11 Nov 2008 16:47:31 GMT - Search in distribution

UI::KeyboardLayout - Module for designing keyboard layouts

In this section, a "keyboard" has a certain "character repertoir" (which characters may be entered using this keyboard), and a mapping associating a character in the repertoir to a keypress or to several (sequential or simultaneous) keypresses. A sma...

ILYAZ/UI-KeyboardLayout-0.68 - 11 Nov 2014 03:01:08 GMT - Search in distribution

SystemC::SystemPerl - SystemPerl Language Extension to SystemC

SystemPerl is a version of the SystemC language. It is designed to expand text so that needless repetition in the language is minimized. By using sp_preproc, SystemPerl files can be expanded into C++ files at compile time, or expanded in place to mak...

WSNYDER/SystemPerl-1.344 - 06 Nov 2014 03:24:47 GMT - Search in distribution