NAME

PDL::Gaussian -- Gaussian distributions.

SYNOPSIS

$x = PDL::Gaussian->new([3],[5]);
$x->set_covariance(...)

DESCRIPTION

This package provides a set of standard routines to handle sets gaussian distributions.

A new set of gaussians is initialized by

$x = PDL::Gaussian->new(xdims,gdims);

Where xdims is a reference to an array containing the dimensions in the space the gaussian is in and gdimslist is a reference to an array containing the dimensionality of the gaussian space. For example, after

$x = PDL::Gaussian->new([2],[3,4]);
$y = PDL::Gaussian->new([],[]);

The variable $x contains set of 12 (=3*4) 2-Dimensional gaussians and $y is the simplest form: one 1D gaussian. Currently, xdims may containe either zero or one dimensions due to limitations of PDL::PP.

To set the distribution parameters, you can use the routines

$x->set_covariance($cv);     # covariance matrices
$x->set_icovariance($icv);   # inverse covariance matrices
$x->set_mu($mu);	      # centers

The dimensions of $cv and $icv must be (@xdims,@xdims,@gdims) and the dimensions of $mu must be (@xdims,@gdims).

Alternatively you can use the routines

$cv = $x->get_covariance();  # cv = reference to covariance matrix
...			      # Fuzz around with cv
$x->upd_covariance();	      # update

and similarly for icovariance (inverse covariance). The last sub call is important to update the other parts of the object.

To get a string representation of the gaussians (most useful for debugging) use the routine

$string = $x->asstr();

It is possible to calculate the probability or logarithm of probability of each of the distributions at some points.

$x->calc_value($x,$p);
$x->calc_lnvalue($x,$p);

Here, $x must have dimensions (ndims,...) and $p must have dimensions (gdimslist, ...) where the elipsis represents the same dimensions in both variables. It is usually advisable to work with the logarithms of probabilities to avoid numerical problems.

It is possible to generate the parameters for the gaussians from data. The function

$x->fromweighteddata($data,$wt,$small_covariance);

where $data is of dimensions (ndims,npoints) and $wt is of dimensions (npoints,gdimslist), analyzes the data statistically and gives a corresponding gaussian distribution. The parameter $small_covariance is the smallest allowed covariance in any direction: if one or more of the eigenvalues of the covariance matrix are smaller than this, they are automatically set to $small_covariance to avoid singularities.

BUGS

Some of the routines (upd_covariance in particular, but likely others) cause segmentation faults and stack traces with current versions of PDL, which renders this module essentially unusable. That is why this module is no longer included in the main PDL distribution (but is available in the CVS version). Fixes are always welcome, so that we may re-include it.

Stupid interface.

Limitation to 1 x-dimensions is questionable (although it's hard to imagine a case when more is needed). Note that this does not mean that you can only have 1-dimensional gaussians. It just means that if you want to have a 6-dimensional gaussian, your xs must be structured like (6) and not (2,3). So clumping the dimensions should make things workable.

Also, it limits you so that even if you have one variable, you need to have the '1' dimensions explicitly everywhere.

Singular distributions are not handled. This should use SVD and be able to handle both infinitely narrow and wide dimensions, preferably so that infinitely narrow dimensions can be queried like $x-relations()> or something like that.

The routines should, if the user requests for it, check all the dimensions of the given arguments for reasonability.

AUTHOR

Copyright (C) 1996 Tuomas J. Lukka (lukka@fas.harvard.edu) All rights reserved. There is no warranty. You are allowed to redistribute this software / documentation under certain conditions. For details, see the file COPYING in the PDL distribution. If this file is separated from the PDL distribution, the copyright notice should be included in the file.