-
-
15 Aug 2013 02:11:22 UTC
- Distribution: CAM-PDF
- Source (raw)
- Browse (raw)
- Changes
- How to Contribute
- Issues (51)
- Testers (6443 / 3 / 0)
- Kwalitee
Bus factor: 0- 54.93% Coverage
- License: perl_5
- Perl: v5.6.0
- Activity
24 month- Tools
- Download (749.66KB)
- MetaCPAN Explorer
- Permissions
- Subscribe to distribution
- Permalinks
- This version
- Latest version
and 1 contributors-
Clotho Advanced Media, Inc.
- Dependencies
- Crypt::RC4
- Digest::MD5
- Text::PDF
- and possibly others
- Reverse dependencies
- CPAN Testers List
- Dependency graph
NAME
getpdftext.pl - Extracts and print the text from one or more PDF pages
SYNOPSIS
getpdftext.pl [options] infile.pdf [<pagenums>] Options: -c --check just validates the page instead of printing it -g --geometry just computes geometry, prints nothing -v --verbose print diagnostic messages -h --help verbose help message -V --version print CAM::PDF version <pagenums> is a comma-separated list of page numbers. Ranges like '2-6' allowed in the list Example: 4-6,2,12,8-9
DESCRIPTION
Extracts all of the text from the specified PDF page(s) and prints them to STDOUT. If no pages are specified, all pages are processed.
The
--check
and--geometry
modes are distinctly different. They are used primarily for debugging.SEE ALSO
CAM::PDF
renderpdf.pl
AUTHOR
See CAM::PDF
Module Install Instructions
To install CAM::PDF, copy and paste the appropriate command in to your terminal.
cpanm CAM::PDF
perl -MCPAN -e shell install CAM::PDF
For more information on module installation, please visit the detailed CPAN module installation guide.