MARC::Record::Stats - scans one or many MARC::Record and gives a statistics on the tags and subtags


Version 0.0.4


This module provides functionality for script. Description of the module interface follows.

    use MARC::Record::Stats;
                my $records = [];
                # code skipped ...
                my $stats = MARC::Record::Stats->new;
                # $records is array of MARC::Record
                for my $r ( @$records ) {
                        $stats->add_record_to_stats( $r );
                $stats->report( *STDOUT, { dots => 1 } );
        ### Some useless features:
                my $record;
                my $records = [];
                # code skipped ...
                # single record statistics
                # $record is a MARC::Record
        my $stats1 = Marc::Record::Stats->new( $record );
        # merge $stats1 and statistics for $records
        # $records is a reference to an array of MARC::Record
        my $stats2 = Marc::Record::Stats->new( $records, $stats1 );
        # ...
        $stats1->report( *STDOUT );
#       $stats2->report( *STDOUT );


new $records [, $stats]

Builds statistics on $records, appends $stats if given.

$records A MARC::Record or a reference to an array of MARC::Record
$stats Marc::Record::Stats object that contains accumulated statistics.

report $fh, $config

Prints out a report on the collected statistics to a filehandle $fh. $config keeps configuretion for the reporter. See MARC::Record::Stats::Report for details


Returns a hashref that contains the statistics:

        <stats_hash> = {
                nrecords        => <int>, # the number of records
                tags            => {
                        <tag>   => <tagstat>, # for every tag found in records
        <tag>       = \d{3} # a tag, three digits
        <tag_stat>  = {
                occurence       => <int>,                       # how many records contain this tag
                subtags         => <subtag_stat>,
        <subtag_stat> = {
                <subtag>        => {
                        occurence       => <int>,               # how many records contain this subtag
                        repeatable  => <0|1>,       # whether or not is repeatable
        <subtag> = [a-z0-9] # alphanum, subtag

add_record_to_stats $record

Add $record to statistics.

get_record_stats $record

returns a reference to a hash: { <tag> => <tag_data> } where <tag_data> is a reference to a hash with the keys occurence - how many times the field with the tag <tag> was found in the record, subtags - result of subtag_stats.

subtag_stats $field

returns a reference to a hash { <subtag letter> => <occurence> } where <occurence> is the number of times the subfield with the code <subtag letter> was found in the fied $field.

$field is MARC::Field


Andrei V. Toutoukine, <tut at>


Please report any bugs or feature requests to bug-marc-record-stats at, or through the web interface at I will be notified, and then you'll automatically be notified of progress on your bug as I make changes.


You can find documentation for this module with the perldoc command.

    perldoc Marc::Record::Stats

You can also look for information at:



Copyright 2011 Andrei V. Toutoukine.

This program is free software; you can redistribute it and/or modify it under the terms of either: the GNU General Public License as published by the Free Software Foundation; or the Artistic License.

See for more information.