The Perl Toolchain Summit needs more sponsors. If your company depends on Perl, please support this very important event.

NAME

DiaColloDB::Profile::Multi - diachronic collocation db, (co-)frequency profile, by date-slice

SYNOPSIS

 ##========================================================================
 ## PRELIMINARIES
 
 use DiaColloDB::Profile::Multi;
 
 ##========================================================================
 ## Constructors etc.
 
 $mp = CLASS_OR_OBJECT->new(%args);
 $mp2 = $mp->clone();
 
 ##========================================================================
 ## I/O: JSON
 
 $obj = $CLASS_OR_OBJECT->loadJsonData( $data,%opts);
 
 ##========================================================================
 ## I/O: Text
 
 undef = $CLASS_OR_OBJECT->saveTextHeader($fh, hlabel=>$hlabel, titles=>\@titles);
 $bool = $mp->saveTextFh($fh,%opts);
 
 ##========================================================================
 ## I/O: HTML
 
 $bool = $mp->saveHtmlFile($filename_or_handle, %opts);
 
 ##========================================================================
 ## Compilation and Trimming
 
 $mp_or_undef = $mp->compile($func,%opts);
 $mp = $mp->uncompile();
 $class = $CLASS_OR_OBJECT->pclass();
 $prf = $mp->sumover();
 $mp_or_undef = $mp->trim(%opts);
 $mp = $mp->stringify( $obj);
 
 @ppairs = $CLASS_OR_OBJECT->align($mp1,$mp2);
 @mps    = $CLASS_OR_OBJECT->xfill(\@mps,%opts)
 @xkd    = $CLASS_OR_OBJECT->xkeys(\@mps,%opts)
 
 ##========================================================================
 ## Binary operations
 
 $mp = $mp->_add($mp2,%opts);
 $mp3 = $mp1->add($mp2,%opts);
 $diff = $mp1->diff($mp2);

DESCRIPTION

DiaColloDB::Profile::Multi is the top-level class for representing diachronic collocate frequency profile data over multiple date-slices as retrieved e.g. from a native index or DDC back-end. The API closely mirrors that of the DiaColloDB::Profile class used to represent slice-wise sub-profiles.

Globals & Constants

Variable: @ISA

DiaColloDB::Profile::Multi inherits from DiaColloDB::Persistent.

Constructors etc.

new
 $mp = CLASS_OR_OBJECT->new(%args);

%args, object structure:

 profiles => \@profiles,   ##-- ($profile, ...) : sub-profiles, with {label} key
 titles   => \@titles,     ##-- item group titles (default:undef: unknown)
 qinfo    => \%qinfo,      ##-- query info (optional)

%qinfo structure:

 q12 => $q12,              ##-- collocation-pair (w1,w2) count-query string (DDC)
 q1  => $q1,               ##-- collocation-item (w1) count-query string (DDC)
 q2  => $q2,               ##-- collocation-item (w2) count-query string (DDC)
 qN  => $qN,               ##-- total frequency count-query string (DDC)
 fcoef => $fcoef,          ##-- item count coefficient (DDC)
 qtemplate => $qtemplate,  ##-- template query string (replace '__W2.i__' with w2 item property #i (e.g. 0:date, 1:lemma, ...))
clone
 $mp2 = $mp->clone();
 $mp2 = $mp->clone($keep_compiled)

clones %$mp; if $keep_score is true, compiled data is cloned too.

I/O: JSON

loadJsonData
 $obj = $CLASS_OR_OBJECT->loadJsonData( $data,%opts);

guts for loadJsonString(), loadJsonFile()

I/O: Text

saveTextHeader
 undef = $CLASS_OR_OBJECT->saveTextHeader($fh, hlabel=>$hlabel, titles=>\@titles);

wraps DiaColloDB::Profile::saveTextHeader().

saveTextFh
 $bool = $mp->saveTextFh($fh,%opts);

save flat text representation to a filehandle. %opts:

 header => $bool,     ##-- include header-row? (default=1)
 ...                  ##-- other options are passed to DiaColloDB::Profile::saveTextFh()

I/O: HTML

saveHtmlFile
 $bool = $mp->saveHtmlFile($filename_or_handle, %opts);

save flat HTML table representation to a file. %opts:

 table  => $bool,     ##-- include <table>..</table> ? (default=1)
 body   => $bool,     ##-- include <html><body>..</html></body> ? (default=1)
 qinfo  => $varname,  ##-- include <script> for qinfo data? (default='qinfo')
 header => $bool,     ##-- include header-row? (default=1)
 format => $fmt,      ##-- printf score formatting (default="%.2f")

Compilation and Trimming

compile
 $mp_or_undef = $mp->compile($func,%opts);

compile all sub-profiles for score-function $func; just calls $prf->compile($func,%opts) for each sub-profile $prf.

uncompile
 $mp = $mp->uncompile();

un-compiles all scores for $mp.

pclass
 $class = $CLASS_OR_OBJECT->pclass();

sub-profile class for use by psum()

sumover
 $prf = $mp->sumover();
 $prf = $CLASS_OR_OBJECT->sumover(\@profiles);

sum of sub-profiles, compiled as for $profiles[0]. used for global trimming.

trim
 $mp_or_undef = $mp->trim(%opts);

trim sub-profiles; calls $prf->trim(%opts) for each sub-profile $prf. %opts: as for DiaColloDB::Profile::trim(), also:

 empty => $bool,   ##-- remove empty sub-profiles? (default=true)
 global => $bool,  ##-- trim sub-profiles globally (default=false)
stringify
 $mp = $mp->stringify( $obj);
 $mp = $mp->stringify(\@key2str)
 $mp = $mp->stringify(\&key2str)
 $mp = $mp->stringify(\%key2str)

stringifies multi-profile (destructive) via $obj->i2s($key2), $key2str->($i2) or $key2str->{$i2}; calls $prf->stringify(...) for each sub-profile $prf.

align
  @ppairs = $CLASS_OR_OBJECT->align($mp1,$mp2);
 \@ppairs = $CLASS_OR_OBJECT->align($mp1,$mp2);
  • aligns subprofile-pairs from $mp1 and $mp2

  • $mp1, $mp2 are either:

    HASH-refs

    with a 'profiles' key (e.g. DiaColloDB::Profile::Multi objects), or

    ARRAY-refs

    of DiaColloDB::Profile-like objects to align

  • subprofiles are aligned in stored order

  • arguments must be EITHER singletons (1 subprofile) OR of same size; this lets you compare e.g. a global profile with a sliced one by something like PDL's "implicit threading".

xfill
  @mps = $CLASS_OR_OBJECT->xfill(\@mps, %opts);
 \@mps = $CLASS_OR_OBJECT->xfill(\@mps, %opts);

Ensure sub-profile labels for all MultiProfiles in \@mps are identical by inserting dummy objects for every sub-profile label (slice) represented by any MultiProfile object in the argument array \@mps. After calling this method, any pair of profiles from the array \@mps can safely be passed to align().

Options %opts:

 class => $class,  ##-- null profile class; default='DiaColloDB::Profile'
xkeys

Binary operations

_add
 $mp = $mp->_add($mp2,%opts);

adds $mp2 frequency data to $mp (destructive); implicitly un-compiles sub-profiles. %opts: passed to DiaColloDB::Profile::_add().

add
 $mp3 = $mp1->add($mp2,%opts);

returns new multi-profile representing sum of $mp1 and $mp2 frequency data. %opts: passed to Profile::_add().

diff
 $diff = $mp1->diff($mp2);

returns score-diff of multi-profiles $mp1 and $mp2; wraps DiaColloDB::Profile::MultiDiff->new($mp1,$mp2)

AUTHOR

Bryan Jurish <moocow@cpan.org>

COPYRIGHT AND LICENSE

Copyright (C) 2015-2020 by Bryan Jurish

This package is free software; you can redistribute it and/or modify it under the same terms as Perl itself, either Perl version 5.14.2 or, at your option, any later version of Perl 5 you may have available.

SEE ALSO

DiaColloDB::Profile(3pm), DiaColloDB::Profile::MultiDiff(3pm), DiaColloDB(3pm), perl(1), ...