The London Perl and Raku Workshop takes place on 26th Oct 2024. If your company depends on Perl, please consider sponsoring and/or attending.

Documentation

main crawling machine in the Combine system
controls a Combine crawling job
export records in XML from Combine database
Initializations of MySQL and config directories
starts, monitors and restarts a combine harvesting process
various operations on the Combine database

Modules

HTML parser in combine package
TeX parser in combine package
class for interfacing to various web-index format translators
Normalise and validate URIs for harvesting

Provides

in Combine/Check_record.pm
in Combine/CleanXML2CanDoc.pm
in Combine/Config.pm
in Combine/DataBase.pm
in Combine/FromImage.pm
in Combine/LogSQL.pm
in Combine/MySQLhdb.pm
in Combine/PosCheck_record.pm
in Combine/UA.pm
in Combine/XWI2XML.pm
in Combine/Zebra.pm
in PlugIns/MPCA/PosCheck_MPCA_record.pm
Saa
in PlugIns/MPCA/Saa.pm
in PlugIns/MPCA/Tana.pm
in PlugIns/MPCA/classifyMPCA.pm
in templates/classifyPlugInTemplate.pm