The Perl Toolchain Summit needs more sponsors. If your company depends on Perl, please support this very important event.

NAME

Fancazzista::Scrap

SYNOPSIS

    use Fancazzista::Scrap;

    my %config = (
        'websites' => [
            {
                name         => "Korben",
                url          => "https://korben.info",
                selector     => ".status-publish .entry-title",
                linkSelector => "a",
                textSelector => "a"
                limite       => 10 # optionnal 5 by default
            }
        ],
        'subreddits' => [
            {
                "name" => "javascript",
                "limit" => 10 # optionnal 5 by default
            }
        ],
        'devto' => [
            {
                "tag" => "perl",
                "limit" => 10 # optionnal 5 by default
            }
        ]
    );

    my @scrapped = Fancazzista::Scrap::scrapContent(\%config);

    @scrapped :
    [
        { 
            name => '<name>', 
            url => '<url'>, 
            articles => [
                { link => '<article-url>', text => '<article-title>' }
            ],
            from_devto => 1 # if source is dev.to
            from_website => 1 # if source if a website
            from_reddit => 1 # if source if reddit
        }
    ]   

DESCRIPTION

Perl module for scrap reddit post, dev.to post, website content. It only scrap article/post link and link text.

SEE ALSO

AUTHOR

Antoine MICELI, https://miceli.click

COPYRIGHT AND LICENSE

Copyright (C) 2020 by Antoine MICELI

This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself, either Perl version 5.18.4 or, at your option, any later version of Perl 5 you may have available.