The Perl Toolchain Summit needs more sponsors. If your company depends on Perl, please support this very important event.

NAME

File::FormatIdentification::RandomSampling::Model - methods to identify files using random sampling

VERSION

version 0.005

SYNOPSIS

This module is an extension of File::FormatIdentification::RandomSampling to get a good estimation about the mimetype of media (or files). It uses random sampling of sectors to obtain heuristics about the content types.

To check the mimetype of a given binary string:

  my $ff = File::FormatIdentification::RandomSampling->new(); # basic instantiation
  my $type = $ff->calc_mimetype($buffer); # calc type of given binary string

The model was learned and uses a decision tree. The module is in very early state. You should check the files cfi_create_training_data.pl and cfi_learn_model.pl to create own models.

NAME

File::FormatIdentification::RandomSampling::Model

SOURCE

The actual development version is available at https://art1pirat.spdns.org/art1/crazy-fast-image-scan

METHODS

calc_mimetype

returns string indicating mimetype of a given buffer.

AUTHOR

Andreas Romeyke <pause@andreas-romeyke.de>

COPYRIGHT AND LICENSE

This software is Copyright (c) 2020 by Andreas Romeyke.

This is free software, licensed under:

  The GNU General Public License, Version 3, June 2007