wikicrawl - crawl Wikipedia to generate graph from the found article links
Crawl wikipedia and create a Graph::Easy text describing the inter-article links that were found during the crawl.
At least one argument must be given to start:
perl examples/wikicrawl.pl --lang=fr
Here are the options:
Print the full documentation, not just this short overview.
Write version info and exit.
Select the language of Wikipedia that we should crawl. Currently supported are 'de', 'en' and 'fr'. Default is 'en'.
Set the root node where the crawl should start. Default is of course 'Xkcd'.
The maximum depth the crawl should go. Please select small values under 10. Default is 4.
The maximum number of links we follow per article. Please select small values under 10. Default is 5.
The maximum number of nodes we crawl. Set to -1 (default) to disable.
http://forums.xkcd.com/viewtopic.php?f=2&t=21300&p=672184 and Graph::Easy.
This library is free software; you can redistribute it and/or modify it under the terms of the GPL.
See the LICENSE file of Graph::Easy for a copy of the GPL.
Copyright (C) 2008 by integral forum.xkcd.com Copyright (C) 2008 by Tels http://bloodgate.com
To install Graph::Easy, copy and paste the appropriate command in to your terminal.
cpanm
cpanm Graph::Easy
CPAN shell
perl -MCPAN -e shell install Graph::Easy
For more information on module installation, please visit the detailed CPAN module installation guide.