Sam Henry
and 1 contributors

NAME This program calculates the assocation a dataset of term pairs


This utility takes a file of line seperated term pairs as input. The file is of the form: "cui1<>cui2\n" with each line containing a new cui pair. It outputs a line seperated list of association score and term pair of the form: "score<>cui1<>cui2". Each line contains a different cui pair and their score


Usage: [OPTIONS] CUI_LIST_FILE OUTPUT_FILE --measure Assoc_Measure --matrix Matrix_FileName



the input file containing line seperated cui pairs of the form: "cui1<>cui2"


the output file, where each score and cui pair are output of the form: score<>cui1<>cui2


File name containing co-occurrence data in sparse matrix format


A string specifying the association measure to use The measure used to calculate the assocation. Recommended = x2

The package uses the Text::NSP package to do the calculation. The measure included within this package are: 1. Dice Coefficient 2. Fishers exact test - left sided 3. Fishers exact test - right sided 4. Fishers twotailed test - right sided 5. Jaccard Coefficient 6. Log-likelihood ratio 7. Mutual Information 8. Odds Ratio 9. Pointwise Mutual Information 10. Phi Coefficient 11. Pearson's Chi Squared Test 12. Poisson Stirling Measure 13. T-score


Optional command line arguements. These options are identical to Please see for descriptions.


The association between the each concept pair of the input file written to a new line of the output file.


  • Perl (version 5.8.5 or better) -

  • Text::NSP -


  If you have any trouble installing and using UMLS-Assocation, 
  please contact us via the users mailing list :
  You can join this group by going to:
  You may also contact us directly if you prefer :
      Sam Henry: henryst at 


 Sam Henry, Virginia Commonwealth University
 Bridget T. McInnes, Virginia Commonwealth University 
 Alexander D. McQuilkin, Virginia Commonwealth University


Copyright (c) 2015

 Bridget T. McInnes, Virginia Commonwealth University 
 btmcinnes at

This program is free software; you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation; either version 2 of the License, or (at your option) any later version.

This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.

You should have received a copy of the GNU General Public License along with this program; if not, write to:

 The Free Software Foundation, Inc.,
 59 Temple Place - Suite 330,
 Boston, MA  02111-1307, USA.