XML::Catalogs::HTML - Catalog of HTML and XHTML DTDs
use XML::Catalogs::HTML -libxml; --- use XML::Catalogs::HTML; XML::Catalogs::HTML->notify_libxml(); --- use XML::Catalogs::HTML; my $url = XML::Catalogs::HTML->get_catalog_url(); my $path = XML::Catalogs::HTML->get_catalog_path();
To properly parse named entities in an XML document, the parser must have access to the XML subformat's DTDs.
XML::LibXML, for one, does not cache DTDs it downloads. Instead, it relies on them being in the system's XML catalog. This is not always configured properly for a number of reasons.
An XML catalog is simply a set of DTDs and a table of contents that associates DTD identifiers with the DTDs.
This module provides a catalog of HTML and XHTML DTDs in case they are not present in the system's catalog.
It works on all platforms, it works without requiring root priviledges, and it works with CPAN's dependency system.
Currently, only the DTDs for HTML 4.01 and XHTML 1.0 are included in this distribution. Please let me know if you need earlier versions.
use XML::Catalogs::HTML -libxml
This loads XML::Catalogs::HTML and calls
This method informs XML::LibXML of this catalog. XML::LibXML will use the local DTDs when parsing HTML and XHTML documents. This only affects the current process.
This mechanism does not stop working when XML::LibXML's
no_network => 1option is used.
Note that XML::LibXML version 1.53 is required for this features.
Returns a file:// URL to the catalog.
Returns the file path of the catalog.
http://en.wikipedia.org/wiki/XML_Catalog, Wikipedia's entry on XML Catalogs.
XML::LibXML, an excellent XML parser that supports catalogs.
XML::Catalogs, this module's base class.
HTML::DTD, an alternate source for HTML and XHTML DTDs.
http://www.w3.org/blog/systeam/2008/02/08/w3c_s_excessive_dtd_traffic, An example of the real world effects of not having local DTDs.
Please report any bugs or feature requests to
bug-XML-Catalogs-HTML at rt.cpan.org, or through the web interface at http://rt.cpan.org/NoAuth/ReportBug.html?Queue=XML-Catalogs-HTML. I will be notified, and then you'll automatically be notified of progress on your bug as I make changes.
You can find documentation for this module with the perldoc command.
You can also look for information at:
RT: CPAN's request tracker
AnnoCPAN: Annotated CPAN documentation
The .dtd and .ent files included in this distrubution are covered by Copyright. See the individual files for the notice. The may be distributed unmodified. See http://www.w3.org/Consortium/Legal/2002/copyright-documents-20021231 for the exact terms.
For everything else, the following applies:
No rights reserved.
The author has dedicated the work to the Commons by waiving all of his or her rights to the work worldwide under copyright law and all related or neighboring legal rights he or she had in the work, to the extent allowable by law.
Works under CC0 do not require attribution. When citing the work, you should not imply endorsement by the author.
Feel free to use this module as a model for your own XML::Catalogs::* module.