The Perl Toolchain Summit needs more sponsors. If your company depends on Perl, please support this very important event.

Changes for version v0.6.0 - 2020-02-01

  • Reject the extracted journalist name if it happens to be one of the known newspaper name.
  • Improve the extraction of https://www.storm.mg

Modules

download and extract news articles from Internet.
A data class for containing news article.

Provides

in lib/NewsExtractor/CSSExtractor.pm
in lib/NewsExtractor/CSSRuleSet.pm
in lib/NewsExtractor/Constants.pm
in lib/NewsExtractor/Download.pm
in lib/NewsExtractor/Error.pm
in lib/NewsExtractor/Extractor.pm
in lib/NewsExtractor/GenericExtractor.pm
in lib/NewsExtractor/JSONLDExtractor.pm
in lib/NewsExtractor/SiteSpecificExtractor.pm
in lib/NewsExtractor/SiteSpecificExtractor/news_tvbs_com_tw.pm
in lib/NewsExtractor/SiteSpecificExtractor/www_allnews_tw.pm
in lib/NewsExtractor/SiteSpecificExtractor/www_ksnews_com_tw.pm
in lib/NewsExtractor/SiteSpecificExtractor/www_ntdtv_com.pm
in lib/NewsExtractor/SiteSpecificExtractor/www_peopo_org.pm
in lib/NewsExtractor/SiteSpecificExtractor/www_rvn_com_tw.pm
in lib/NewsExtractor/TXExtractor.pm
in lib/NewsExtractor/TextUtil.pm
in lib/NewsExtractor/Types.pm