The Perl Toolchain Summit needs more sponsors. If your company depends on Perl, please support this very important event.

NAME

App::pdfgrep - Print lines matching text in PDF files

VERSION

This document describes version 0.015 of App::pdfgrep (from Perl distribution App-PDFUtils), released on 2023-02-10.

FUNCTIONS

pdfgrep

Usage:

 pdfgrep(%args) -> [$status_code, $reason, $payload, \%result_meta]

Print lines matching text in PDF files.

This is a wrapper for pdftotext + grep-like utility that is based on AppBase::Grep. The unique features include multiple patterns and --dash-prefix-inverts.

This function is not exported.

Arguments ('*' denotes required arguments):

  • all => true

    Require all patterns to match, instead of just one.

  • color => str (default: "auto")

    Specify when to show color (never, always, or auto/when interactive).

  • count => true

    Supress normal output, return a count of matching lines.

  • dash_prefix_inverts => bool

    When given pattern that starts with dash "-FOO", make it to mean "^(?!.*FOO)".

    This is a convenient way to search for lines that do not match a pattern. Instead of using -v to invert the meaning of all patterns, this option allows you to invert individual pattern using the dash prefix, which is also used by Google search and a few other search engines.

  • files => array[filename]

    If not specified, will search for all PDF files recursively from the current directory.

  • ignore_case => bool

    If set to true, will search case-insensitively.

  • invert_match => bool

    Invert the sense of matching.

  • line_number => true

    Show line number along with matches.

  • pattern => str

    Specify *string* to search for.

  • quiet => true

    Do not print matches, only return appropriate exit code.

  • regexps => array[str]

    Specify additional *regexp pattern* to search for.

Returns an enveloped result (an array).

First element ($status_code) is an integer containing HTTP-like status code (200 means OK, 4xx caller error, 5xx function error). Second element ($reason) is a string containing error message, or something like "OK" if status is 200. Third element ($payload) is the actual result, but usually not present when enveloped result is an error response ($status_code is not 2xx). Fourth element (%result_meta) is called result metadata and is optional, a hash that contains extra information, much like how HTTP response headers provide additional metadata.

Return value: (any)

HOMEPAGE

Please visit the project's homepage at https://metacpan.org/release/App-PDFUtils.

SOURCE

Source repository is at https://github.com/perlancar/perl-App-PDFUtils.

AUTHOR

perlancar <perlancar@cpan.org>

CONTRIBUTING

To contribute, you can send patches by email/via RT, or send pull requests on GitHub.

Most of the time, you don't need to build the distribution yourself. You can simply modify the code, then test via:

 % prove -l

If you want to build the distribution (e.g. to try to install it locally on your system), you can install Dist::Zilla, Dist::Zilla::PluginBundle::Author::PERLANCAR, Pod::Weaver::PluginBundle::Author::PERLANCAR, and sometimes one or two other Dist::Zilla- and/or Pod::Weaver plugins. Any additional steps required beyond that are considered a bug and can be reported to me.

COPYRIGHT AND LICENSE

This software is copyright (c) 2023, 2022, 2021, 2020, 2017 by perlancar <perlancar@cpan.org>.

This is free software; you can redistribute it and/or modify it under the same terms as the Perl 5 programming language system itself.

BUGS

Please report any bugs or feature requests on the bugtracker website https://rt.cpan.org/Public/Dist/Display.html?Name=App-PDFUtils

When submitting a bug or request, please include a test-file or a patch to an existing test-file that illustrates the bug or desired feature.