The Perl Toolchain Summit needs more sponsors. If your company depends on Perl, please support this very important event.

NAME

SWISH::Filters::PDF2XML - Perl extension for filtering PDF documents

DESCRIPTION

This is a plug-in module that uses the CAM::PDF package to convert PDF documents to XML. Any info tags found in the PDF document are created as meta tags.

You may pass into SWISH::Filter's new method a tag to use as the XML <title> if found in the PDF info tags:

    my %user_data;
    $user_data{pdf}{title_tag} = 'title';

    $was_filtered = $filter->filter(
        document  => $filename,
        user_data => \%user_data,
    );

Then if a PDF info tag of "title" is found that will be used as the HTML <title>. If no tag is passed, title will be used as the default tag.

AUTHOR

Peter Karman

SEE ALSO

SWISH::Filter