The Perl Toolchain Summit needs more sponsors. If your company depends on Perl, please support this very important event.

Documentation

ocr
read an image file and turn into text
get text content of pdf document images within
get text from pdf and resort to ocr as needed

Modules

read an image with tesseract and get output
get images from pdf document
get ocr and images out of a pdf file
extract text fom pdf document resorting to ocr as needed
save ocr to text file for easy retrieval