WWW::Orkut::Spider - Perl extension for spidering the orkut community
use WWW::Orkut::Spider; my $orkut = WWW::Orkut::Spider->new; $orkut->login($user,$pass); $orkut->get_hisfriends($uid); print $orkut->get_xml_profile($uid);
WWW::Orkut::Spider uses WWW:Mechanize to scrape orkut.com. Output is a simple xml format containing friends, communities and profiles for a given Orkut UID. - Access to orkut.com via WWW::Mechanize - Collects UIDs - Fetches Profiles/Communities/Friends for a given UID - Output via simple xml format
You can specify a Proxy Server here i.e: http://www.proxy.de:8080/ or: undef
login orkut as user with pass return undef if unseccessful
logout of orkut
return name of given known uid
return array with all known uids
return a simple <tag>value</tag>
only after login follow the link to friendslist and get friends uids return 1 if success
parse uid friends page for more uids
follow through all friends pages called after GET of first friend page
parse html page for friends uids helper for follow friends used after GET FriendList
iterate n times over found uids to find more friends more than n=1 seems insane, unlikely to work don't let your script crash in this function, WWW::Mechanize may decide to die if orkut.com gets one of its server failures FIXME: logout/login all 50 requests may help
return profile of uid as simple xml
return communities of uid as simple xml
return friendslist of uid as simple xml
Net::Orkut ( using LWP directly )
mm-pause@manno.name
Copyright (C) 2004 by mm-pause@manno.name
This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself, either Perl version 5.8.2 or, at your option, any later version of Perl 5 you may have available.
To install WWW::Orkut::Spider, copy and paste the appropriate command in to your terminal.
cpanm
cpanm WWW::Orkut::Spider
CPAN shell
perl -MCPAN -e shell install WWW::Orkut::Spider
For more information on module installation, please visit the detailed CPAN module installation guide.