Nicolas Franck
and 1 contributors

NAME

Catmandu::Importer::PDFPages - Catmandu importer to extract text data per page from one pdf

SYNOPSIS

    # From the command line

    # Export pdf pages with their text and coördinates

    $ catmandu convert PDFPages --file input.pdf to YAML

    #In a script

    use Catmandu::Sane;

    use Catmandu::Importer::PDFPages;

    my $importer = Catmandu::Importer::PDFPages->new( file => "/tmp/input.pdf" );

    $importer->each(sub{

        my $page = $_[0];
        #..

    });

EXAMPLE OUTPUT IN YAML

    - label: Cover Page
      height: 878
      width: 595
      text: "Hello world"

AUTHORS

Nicolas Franck <nicolas.franck at ugent.be>

SEE ALSO

Catmandu, Catmandu::Importer , Poppler