The Perl Toolchain Summit needs more sponsors. If your company depends on Perl, please support this very important event.

NAME

Catmandu::Importer::XML - Import serialized XML documents

DESCRIPTION

This Catmandu::Importer reads XML and transforms it into a data structure.

See Catmandu::Importer, Catmandu::Iterable, Catmandu::Logger and Catmandu::Fixable for methods and options derived from these modules.

The importer can also be used internally for custom importers that need to parse XML data.

CONFIGURATION

type

By default (type "simple"), elements and attributes and converted to keys in a key-value structure. For instance this document:

    <doc attr="value">
      <field1>foo</field1>
      <field1>bar</field1>
      <bar>
        <doz>baz</doz>
      </bar>
    </doc>
     

is imported as

    {
        attr => 'value',
        field1 => [ 'foo', 'bar' ],
        field2 => { 'doz' => 'baz' },
    }

With type "ordered" elements are preserved in the order of their appereance. For instance the sample document above is imported as:

    [ 
        doc => { attr => "value" }, [
            [ field1 => { }, ["foo"] ],
            [ field1 => { },  ["bar"] ],
            [ field2 => { }, [ [ doz => { }, ["baz"] ] ] ]
        ]
    ] 
depth

Maximum depth for type "simple". For instance with depth 1, the sample document above would be imported as:

    {
        attr => 'value',
        field1 => [ 'foo', 'bar' ],
        field2 => { 
            doz => [ [ doz => { }, ["baz"] ] ]
        }
    }
attributes

Include XML attributes. Enabled by default.

path

Path expression to select XML elements. If not set the root element is selected.

root

Include root element name, if enabled. Disabled by default, unless the path option is set.

ns

Set to strip for stripping namespace prefixes and xmlns-attributes.

whitespace

Include ignoreable whitespace. Disabled by default.

transform

Optional (list of) XSLT stylesheets to process records. This option is not fully implemented yet!

SEE ALSO

This module is just a thin layer on top of XML::Struct::Reader. Have a look at XML::Struct to implement Importers and Exporters for more specific XML-based data formats.