HTML::ContentExtractor - extract the main content from a web page by analysising the DOM tree! River stage zero No dependents

Web pages often contain clutter (such as ads, unnecessary images and extraneous links) around the body of an article that distracts a user from actual content. This module is used to reduce the noise content in web pages and thus identify the content...

JZHANG/HTML-ContentExtractor-0.03 - 23 Jun 2007 01:36:57 GMT

1 result (0.026 seconds)