grepurl - print links in HTML
grepurl [-bdv] [-e extension[,extension] [-E extension[,extension] [-h host[,host]] [-H host[,host]] [-p regex] [-P regex] [-s scheme[,scheme]] [-s scheme[,scheme]] [-u URL]
The grepurl program searches through the URL specified in the -u switch and prints the URLs that satisfies the given set of options. It applies the options roughly in order of which part of the URL the option affects (scheme, host, path, extension).
So far, grepurl expects to search through HTML, although I want to add other content types, especially plain text, RSS feeds, and so on.
turn relative URLs into absolute ones
turn on debugging output
select links with these extensions (comma separated)
exclude links with these extensions (comma separated)
select links with these hosts (comma separated)
exclude links with these hosts (comma separated)
select only paths that match this Perl regex
exclude paths that match this Perl regex
select only these schemes (comma separated)
exclude these schemes (comma separated)
extract URLs from plain text file (not implemented)
extract URLs from URL (may be file://), expects HTML
turn on verbose output
grepurl -u http://www.example.com/
grepurl -b -u http://www.example.com/
grepurl -e jpg -u http://www.example.com/
grepurl -e jpg,jpeg -u http://www.example.com/
grepurl -E cfm,asp -u http://www.example.com/
grepurl -h www.panix.com -u http://www.example.com/
grepurl -h www.panix.com,www.perl.com -u http://www.example.com/
grepurl -H www.microsoft.com -u http://www.example.com/
grepurl -p perl -u http://www.example.com
grepurl -p "pea?rl" -u http://www.example.com
grepurl -p "fred|barney" -u http://www.example.com
grepurl -P SCO -u http://www.example.com
grepurl -P "Micro.*" -u http://www.example.com
grepurl -s http -u http://www.example.com/
grepurl -s ftp,gopher -u http://www.example.com/
This source is part of a SourceForge project which always has the latest sources in CVS, as well as all of the previous releases.
http://sourceforge.net/projects/brian-d-foy/
If, for some reason, I disappear from the world, one of the other members of the project can shepherd this module appropriately.
brian d foy, <bdfoy@cpan.org>
Copyright 2004, brian d foy, All rights reserved.
You may use this program under the same terms as Perl itself.
To install grepurl, copy and paste the appropriate command in to your terminal.
cpanm
cpanm grepurl
CPAN shell
perl -MCPAN -e shell install grepurl
For more information on module installation, please visit the detailed CPAN module installation guide.