The Perl Toolchain Summit needs more sponsors. If your company depends on Perl, please support this very important event.

Using PerlSAX

Working with PerlSAX involves using two classes (packages), a PerlSAX parser that generates parsing events and a class that you write that will receive those parsing events, the ``handler''. This guide will use the XML::Parser::PerlSAX parser that uses Clark Cooper's XML::Parser module.

The handler class implements the PerlSAX handler methods that you are interested in. The following example, MyHandler.pm, prints a message every time an element starts or ends:

    package MyHandler;

    sub new {
        my ($type) = @_;
        return bless {}, $type;
    }

    sub start_element {
        my ($self, $element) = @_;

        print "Start element: $element->{Name}\n";
    }

    sub end_element {
        my ($self, $element) = @_;

        print "End element: $element->{Name}\n";
    }

    1;

To use your handler you will need to have a script, myhandler.pl, that loads and creates your handler and the parser, and then calls the parser to parse the XML instance and send events to your handler:

    use XML::Parser::PerlSAX;
    use MyHandler;

    my $my_handler = MyHandler->new;
    my $parser = XML::Parser::PerlSAX->new( Handler => $my_handler );

    foreach my $instance (@ARGV) {
        $parser->parse(Source => { SystemId => $instance });
    }

Given this XML instance, myhandler.xml:

    <?xml version="1.0"?>

    <article>
    <title>Using PerlSAX</title>
    <paragraph>Working with PerlSAX ...</paragraph>
    </article>

Running myhandler.pl like this:

    perl myhandler.pl myhandler.xml

will produce this output:

    Start element: article
    Start element: title
    End element: title
    Start element: paragraph
    End element: paragraph
    End element: article

For More Information

PerlSAX.pod describes the PerlSAX interface. Each parser module describes it's individual capabilities. XML::Parser::PerlSAX is the most commonly used PerlSAX implementation.

The files described in this doc are in the `examples' directory. A more complete implementation of the very simple handler above is in the module XML::Handler::Sample. Other, more complex handlers are in the XML::Handler directory as well.

Another hands-on doc for PerlSAX is the XML-Parser-and-PerlSAX.pod. This doc describes the difference between and the purpose of PerlSAX with respect to XML::Parser.

This document was inspired by and uses the code examples from David Megginson's ``Quick Start for SAX Application Writers.'' <http://www.megginson.com/SAX/quickstart.html>