Fancazzista::Scrap
use Fancazzista::Scrap; my %config = ( 'websites' => [ { name => "Korben", url => "https://korben.info", selector => ".status-publish .entry-title", linkSelector => "a", textSelector => "a" limite => 10 # optionnal 5 by default } ], 'subreddits' => [ { "name" => "javascript", "limit" => 10 # optionnal 5 by default } ], 'devto' => [ { "tag" => "perl", "limit" => 10 # optionnal 5 by default } ] ); my @scrapped = Fancazzista::Scrap::scrapContent(\%config); @scrapped : [ { name => '<name>', url => '<url'>, articles => [ { link => '<article-url>', text => '<article-title>' } ], from_devto => 1 # if source is dev.to from_website => 1 # if source if a website from_reddit => 1 # if source if reddit } ]
Perl module for scrap reddit post, dev.to post, website content. It only scrap article/post link and link text.
Antoine MICELI, https://miceli.click
Copyright (C) 2020 by Antoine MICELI
This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself, either Perl version 5.18.4 or, at your option, any later version of Perl 5 you may have available.
To install Fancazzista::Scrap, copy and paste the appropriate command in to your terminal.
cpanm
cpanm Fancazzista::Scrap
CPAN shell
perl -MCPAN -e shell install Fancazzista::Scrap
For more information on module installation, please visit the detailed CPAN module installation guide.