NAME

webreaper -- download a page page and its links

SYNOPSIS

        webreaper URL

DESCRIPTION

THIS IS ALPHA SOFTWARE

The webreaper program downloads web sites. It creates a directory, named after the host of the URL given on the command line, in the current working directory.

Command line switches

-r --- referer for the first URL
-u --- username for basic auth
-p --- password for basic auth
-v --- verbose ouput

FEATURES SO FAR

limits itself to the starting domain

WISH LIST

limit directory level
limit content types, file names
specify a set of patterns to ignore
do conditional GETs
Tk or curses interface?
create an error log, report, or something
download stats (clock time, storage space, etc)
multiple levels of verbosity for output
read items from a config file
allow user to add/delete allowed domains during runtime
specify directory where to save downloads
optional sleep time between requests
ensure that path names are safe (i.e. no ..)

SOURCE AVAILABILITY

This source is part of a SourceForge project which always has the latest sources in CVS, as well as all of the previous releases.

        https://sourceforge.net/projects/brian-d-foy/

If, for some reason, I disappear from the world, one of the other members of the project can shepherd this module appropriately.

AUTHOR

brian d foy, <bdfoy@cpan.org>

COPYRIGHT

You may use this program under the same terms as Perl itself.

To install webreaper, copy and paste the appropriate command in to your terminal.

cpanm

cpanm webreaper

CPAN shell

perl -MCPAN -e shell
install webreaper

For more information on module installation, please visit the detailed CPAN module installation guide.

	Global
`s`	Focus search bar
`?`	Bring up this help dialog

	GitHub
`g` `p`	Go to pull requests
`g` `i`	go to github issues (only if github is preferred repository)

	POD
`g` `a`	Go to author
`g` `c`	Go to changes
`g` `i`	Go to issues
`g` `d`	Go to dist
`g` `r`	Go to repository/SCM
`g` `s`	Go to source
`g` `b`	Go to file browse

	Search terms
module: (e.g. module:Plugin)
distribution: (e.g. distribution:Dancer auth)
author: (e.g. author:SONGMU Redis)
version: (e.g. version:1.00)