HTML::TokeParser - Alternative HTML::Parser interface River stage four • 538 direct dependents • 4191 total dependents

The "HTML::TokeParser" is an alternative interface to the "HTML::Parser" class. It is an "HTML::PullParser" subclass with a predeclared set of token types. If you wish the tokens to be reported differently you probably want to use the "HTML::PullPars...

OALDERS/HTML-Parser-3.76 - 04 Mar 2021 18:06:59 UTC - Search in distribution

HTML::TokeParser::Simple - Easy to use HTML::TokeParser interface River stage three • 32 direct dependents • 105 total dependents

"HTML::TokeParser" is an excellent module that's often used for parsing HTML. However, the tokens returned are not exactly intuitive to parse: ["S", $tag, $attr, $attrseq, $text] ["E", $tag, $text] ["T", $text, $is_data] ["C", $text] ["D", $text] ["P...

OVID/HTML-TokeParser-Simple-3.16 - 26 Jun 2013 06:55:44 UTC - Search in distribution

lib/HTML/Content/TokeParserTokenizer.pm River stage zero No dependents

JTAVERNI/HTML-Content-Extractor-0.01 - 22 Aug 2005 03:38:43 UTC - Search in distribution

Test::HTML::Content::NoXPath - HTML::TokeParser fallback for Test::HTML::Content River stage one • 1 direct dependent • 1 total dependent

This is the module that gets loaded when Test::HTML::Content can't find its prerequisites : XML::XPath HTML::Tidy EXPORT Nothing. It stomps over the Test::HTML::Content namespace....

CORION/Test-HTML-Content-0.09 - 06 Feb 2013 19:01:17 UTC - Search in distribution

html_fmt - Reformat HTML, indented according to structure River stage two • 39 direct dependents • 55 total dependents

Writes its input to "STDOUT", reformatted and indented according to the HTML structure. With no arguments, "html_fmt" looks for its input on "STDIN". If it has an argument that looks like a URI, "html_fmt" treats that argument as a URI. If it has an ...

JKEGL/Marpa-R2-8.000000 - 17 Dec 2018 15:40:25 UTC - Search in distribution
  • html_score - Show complexity metric and other stats for web page

SWISH - Perl interface to the SWISH-E search engine. River stage one • 1 direct dependent • 1 total dependent

NOTE: This is alpha code and is not to be used in a production environment and the interface is expected to change while swish 2.2 is being developed. Testing and feedback on using this module is gratefully appreciated. NOTE: This module is now depre...

HANK/SWISH-0.08 - 01 Nov 2003 23:00:07 UTC - Search in distribution

perlfaq6 - Regular Expressions River stage zero No dependents

This section is surprisingly small because the rest of the FAQ is littered with answers involving regular expressions. For example, decoding a URL and checking whether something is a number can be handled with regular expressions, but those answers a...

ETHER/perlfaq-5.20210520 - 20 May 2021 04:17:26 UTC - Search in distribution

Lexing River stage one • 2 direct dependents • 2 total dependents

OVID/HOP-Lexer-0.032 - 08 Mar 2007 07:46:05 UTC - Search in distribution

lwptut - An LWP Tutorial River stage four • 2120 direct dependents • 6040 total dependents

LWP (short for "Library for WWW in Perl") is a very popular group of Perl modules for accessing data on the Web. Like most Perl module-distributions, each of LWP's component modules comes with documentation that is a complete reference to its interfa...

OALDERS/libwww-perl-6.57 - 20 Sep 2021 20:29:02 UTC - Search in distribution

RSSycklr - (beta) Highly configurable recycling of syndication (RSS/Atom) feeds into tailored, guaranteed XHTML fragments. River stage zero No dependents

This is a more of a mini-app engine than a pure module. RSSycklr is a package that wraps up the best parts of XML::Feed and HTML::Truncate then filters it through XML::LibXML to guarantee valid XHTML and adds a side of Template for auto-formatted out...

ASHLEY/RSSycklr-0.15 - 03 Aug 2011 00:55:59 UTC - Search in distribution

WWW::phpBB - phpBB2 forum scraper River stage zero No dependents

This module can be used to scrape a phpBB2 instalation using the web interface. It requires a local phpBB2 setup (you can download the old 2.x versions from http://sourceforge.net/projects/phpbb/files/phpBB%202/ ) that will be overwritten and it can ...

STEFANT/WWW-phpBB-0.09 - 04 May 2011 20:56:40 UTC - Search in distribution

IMDB::Film - OO Perl interface to the movies database IMDB. River stage one • 1 direct dependent • 1 total dependent

Overview IMDB::Film is an object-oriented interface to the IMDB. You can use that module to retrieve information about film: title, year, plot etc....

STEPANOV/IMDB-Film-0.53 - 04 Jan 2013 16:59:02 UTC - Search in distribution
  • IMDB::Persons - Perl extension for retrieving movies persons from IMDB.com
  • IMDB::BaseClass - a base class for IMDB::Film and IMDB::Persons.

Bundle::OpenXPKI - A bundle to install modules required for OpenXPKI River stage zero No dependents

This bundle helps with the installation of modules required for OpenXPKI. OpenXPKI is an open source trust center software which aims to create an enterprise-scale PKI solution. For more information see http://www.openxpki.org...

ALECH/Bundle-OpenXPKI-0.06 - 21 Nov 2008 16:12:39 UTC - Search in distribution

refmunger - Mangles HREF links within HTML documents River stage zero No dependents

Mangles HREF and IMG links within HTML documents to conform to a given filename convention. For example, can mangle HTML document names to fit the MS-DOS convention of 8.3, the 32-character limit for MacOS and even the 14-character UNIX limitations....

ADESC/HTML-RefMunger-0.01 - 16 Feb 1999 22:43:40 UTC - Search in distribution

WWW::Lipsum - perl interface to www.lipsum.com River stage zero No dependents

Generate *Lorem Ipsum* place holder text from perl, using www.lipsum.com <http://www.lipsum.com/>...

ZOFFIX/WWW-Lipsum-1.001012 - 22 Mar 2015 03:47:37 UTC - Search in distribution

Class::Sniff - Look for class composition code smells River stage one • 2 direct dependents • 3 total dependents

ALPHA code. You've been warned. The interface is rather ad-hoc at the moment and is likely to change. After creating a new instance, calling the "report" method is your best option. You can then visually examine it to look for potential problems: my ...

NEILB/Class-Sniff-0.10 - 07 Jun 2014 02:15:13 UTC - Search in distribution

HTTP::GetImages - Spider to recover and store images from web pages. River stage zero No dependents

This module allow syou to automate the searching, recovery and local storage of images from the web, including those linked by anchor ("A"), mage ("IMG") and image map ("AREA") elements. Supply a URI or list of URIs to process, and "HTTP::GetImages" ...

LGODDARD/HTTP-GetImages-0.343 - 19 Sep 2002 06:44:50 UTC - Search in distribution

HTML::LinkAdd - Add hyperlinks to phrases in HTML documents River stage zero No dependents

A simple object that accepts a class reference, a path to a file, and a hash of text-phrase/link-URLs, and supplies a method to obtain the HTML with supplied hyperlinks interpolated. If the values of the supplied has are anonymous lists, the first va...

LGODDARD/HTML-LinkAdd-0.13 - 04 Jan 2011 11:13:38 UTC - Search in distribution

XML::Descent - Recursive descent XML parsing River stage one • 2 direct dependents • 6 total dependents

The conventional models for parsing XML are either DOM (a data structure representing the entire document tree is created) or SAX (callbacks are issued for each element in the XML). XML grammar is recursive - so it's nice to be able to write recursiv...

ANDYA/XML-Descent-1.04 - 29 May 2009 20:46:21 UTC - Search in distribution

Bundle::RT - CPAN Bundle for RT Dependencies River stage zero No dependents

RSPIER/Bundle-RT-0.030601 - 12 Aug 2006 04:53:25 UTC - Search in distribution
76 results (0.046 seconds)