- COPYRIGHT AND LICENSE
File::FormatIdentification::RandomSampling - methods to identify content of device o media files using random sampling
This module is suitable to get a good estimation about the content of media (or files). It uses random sampling of sectors to obtain heuristics about the content types.
To check the base type of a given binary string:
my $ff = File::FormatIdentification::RandomSampling->new(); # basic instantiation my $type = $ff->calc_type($buffer); # calc type of given binary string
The following tools are supplied with this module and are presented below:
This script scans devices or images very fast using random sampling and reports wht kind of content could be found.
For a detailed documentation use the included POD there.
This script scans a bunch of files and calcs most frequent one- and bigrams and stores them in a CSV file.
The actual development version is available at https://art1pirat.spdns.org/art1/crazy-fast-image-scan
resets the internal bytegram state. Also called if object will be instantiated
uses the most significant first 8 bytegram entries to from a histogram, returned as hash reference
returns true, if 1-byte bytegrams are uniform
returns true, if 1-byte bytegrams indicating empty buffers
returns true, if 1-byte bytegrams are typical for texts
returns true, if 1-byte bytegrams are typical for MPEG/Quicktime Videos
returns string indicating type of a given buffer
Andreas Romeyke <firstname.lastname@example.org>
This software is Copyright (c) 2020 by Andreas Romeyke.
This is free software, licensed under:
The GNU General Public License, Version 3, June 2007