The Perl Toolchain Summit needs more sponsors. If your company depends on Perl, please support this very important event.

NAME

umls-association.pl This program calculates the assocation between two concepts or sets of concepts

SYNOPSIS

This utility takes two sets of concepts and returns their assocation score

USAGE

Usage: umls-assocation.pl [OPTIONS] CUI_set1 CUI_set2 --matrix Matrix_File --measure Assoc_Measure

INPUT

[CUI_set1] [CUI_set2]

Two sets of CUIs in the UMLS. Each CUI in the CUI set is comma seperated

[Matrix_File]

File name containing co-occurrence data in sparse matrix format

[Assoc_Measure]

A string specifying the association measure to use The measure used to calculate the assocation. Recommended = x2

The package uses the Text::NSP package to do the calculation. The measure included within this package are:

    1.  Frequency
    2.  Random
    3.  Dice Coefficient
    4.  Fishers exact test - left sided
    5.  Fishers exact test - right sided
    6.  Fishers twotailed test - right sided
    7.  Jaccard Coefficient
    8.  Log-likelihood ratio
    9.  Mutual Information
    10.  Odds Ratio
    11.  Pointwise Mutual Information
    12. Phi Coefficient
    13. Pearson's Chi Squared Test
    14. Poisson Stirling Measure
    15. T-score

OPTIONS

Optional command line arguements

General Options:

Displays the quick summary of program options.

--noorder

If selected, the order in which CUIs appear will be disregarded when the association score is calculated.

--lta

Linking Term Association - Calculates the association scores using implicit or intermediate relationships between the specified CUIs, and the count of unique shared co-occurrences.

--mwa

Minimum Weight Association - Calculates the association scores using implicit or intermediate relationships between the specified CUIs, and the minimum co-occurrence count between shared co-occurrences.

--lsa

Linking Set Association - Calculates the association scores using the association between the sets of co-occurring terms of the original terms

--sbc

Shared B to C association - Calculates the association scores using the association between the set of A co-occuring terms, and the term C.

--wsa

Weighted Set Association - Same as linking set association, but weights the members of the linking set based on their association with the original term. The association measure used for weighting is the same as specified for quantifying association overall (--measure)

--nonorm

Indicates that the weights in WSA will NOT be normalized between 0 and 1 and instrad the direct association score will be used

--precision N

Displays values up to N places of decimal. (DEFAULT: 4)

--help

Displays the quick summary of program options.

--version

Displays the version information.

OUTPUT

The association between the two concepts (or terms)

SYSTEM REQUIREMENTS

  • Perl (version 5.8.5 or better) - http://www.perl.org

  • Text::NSP - http://search.cpan.org/dist/Text-NSP

CONTACT US

  If you have any trouble installing and using UMLS-Assocation, 
  please contact us via the users mailing list :
    
      umls-association@yahoogroups.com
     
  You can join this group by going to:
    
      http://tech.groups.yahoo.com/group/umls-assocation/
     
  You may also contact us directly if you prefer :
    
      Sam Henry: henryst at vcu.edu 

AUTHOR

 Bridget T. McInnes, Virginia Commonwealth University 
 Alexander D. McQuilkin, Virginia Commonwealth University
 Sam Henry, Virginia Commonwealth University

COPYRIGHT

Copyright (c) 2015

 Sam Henry, Virginia Commonwealth University 
 henryst at vcu.edu

This program is free software; you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation; either version 2 of the License, or (at your option) any later version.

This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.

You should have received a copy of the GNU General Public License along with this program; if not, write to:

 The Free Software Foundation, Inc.,
 59 Temple Place - Suite 330,
 Boston, MA  02111-1307, USA.