HTML::TreeBuilder::XPath - add XPath support to HTML::TreeBuilder River stage three • 59 direct dependents • 203 total dependents

This module adds typical XPath methods to HTML::TreeBuilder, to make it easy to query a document....

MIROD/HTML-TreeBuilder-XPath-0.14 - 20 Sep 2011 01:46:15 UTC

lib/HTML/Robot/Scrapper/Parser/HTML/TreeBuilder/XPath.pm River stage one • 1 direct dependent • 1 total dependent

HERNAN/HTML-Robot-Scrapper-0.11 - 31 Oct 2013 12:12:41 UTC

HTML::TreeBuilder::LibXML - HTML::TreeBuilder and XPath compatible interface with libxml River stage two • 8 direct dependents • 20 total dependents

HTML::TreeBuilder::XPath is libxml based compatible interface to HTML::TreeBuilder, which could be slow for a large document. HTML::TreeBuilder::LibXML is drop-in-replacement for HTML::TreeBuilder::XPath. This module doesn't implement all of HTML::Tr...

TOKUHIROM/HTML-TreeBuilder-LibXML-0.26 - 19 Oct 2016 15:08:57 UTC

HTML::Linear - represent HTML::Tree as a flat list River stage zero No dependents

SYP/HTML-Untemplate-0.019 - 23 Jun 2014 08:41:42 UTC

HTML::Untemplate - web scraping assistant River stage zero No dependents

Suppose you have a set of HTML documents generated by populating the same template with the data from some kind of database. HTML::Untemplate is a set of command-line tools ("xpathify", "untemplate") and modules (HTML::Linear and it's dependencies) w...

SYP/HTML-Untemplate-0.019 - 23 Jun 2014 08:41:42 UTC

HTML::Seamstress - HTML::Tree subclass for HTML templating via tree rewriting River stage one • 1 direct dependent • 1 total dependent

TBONE/HTML-Seamstress-6.112830 - 10 Oct 2011 16:08:41 UTC

HTML::AsText::Fix - extends HTML::Element::as_text() to render text properly River stage zero No dependents

Consider the following HTML sample: <p> <span>AAA</span> BBB </p> <h2>CCC</h2> DDD <br> EEE "HTML::Element::as_text()" method stringifies it as *AAABBBCCCDDDEEE*. Despite being correct, this is far from the actual renderization within a "real" browse...

SYP/HTML-AsText-Fix-0.003 - 23 Jun 2014 10:25:38 UTC

HTML::Encapsulate - rewrites an HTML page as a self-contained set of files River stage zero No dependents

The main motivation for this module is for archiving and printing web pages: these typically come in various separate pieces and aren't simple to download as one chunk. However, it is possible to preserve the content of a web page, but to rewrite the...

NPW/HTML-Encapsulate-v0.3.0 - 13 Nov 2015 11:59:12 UTC

HTML::Linear::Path - represent paths inside HTML::Tree River stage zero No dependents

SYP/HTML-Untemplate-0.019 - 23 Jun 2014 08:41:42 UTC

HTML::Robot::Scrapper - Your robot to parse webpages River stage one • 1 direct dependent • 1 total dependent

This cralwer has been created to be extensible. Scalable with redis queue. The main idea is: i need a queue of urls to be crawled, it can be an array living during my instance (not scalable)... or it can be a Redis queue ( scallable ), being acessed ...

HERNAN/HTML-Robot-Scrapper-0.11 - 31 Oct 2013 12:12:41 UTC

HTML::Linear::Element - represent elements to populate HTML::Linear River stage zero No dependents

SYP/HTML-Untemplate-0.019 - 23 Jun 2014 08:41:42 UTC

HTML::Element::AbsoluteXPath - Add absolute XPath support to HTML::Element River stage zero No dependents

HTML::Element::AbsoluteXPath adds ABSOLUTE XPath support to HTML::Element by adding 'abs_xpath' method to HTML::Element package. It generates smarter XPaths with HINTS which are attributes name of HTML element, like 'class','id','width','name' and et...

KHS/HTML-Element-AbsoluteXPath-1.0 - 21 Jan 2016 02:13:39 UTC

HTML::TreeBuilder::LibXML::Node - HTML::Element compatible API for HTML::TreeBuilder::LibXML River stage two • 8 direct dependents • 20 total dependents

TOKUHIROM/HTML-TreeBuilder-LibXML-0.26 - 19 Oct 2016 15:08:57 UTC

13 results (0.046 seconds)