Benjamin Habegger
and 1 contributors

NAME

WebSource::Extract - Extract parts of the input

DESCRIPTION

An Extract operator allows to extract sub parts of its input. There exists different flavors of such an operator. The main one consists in querying the input using an XPath expression.

Such an operator is described by a DOM Node having the following form :

<ws:extract name="opname" forward-to="ops"> <path>//an/xpath/expression</path> </ws:extract>

The operator queries any input with the expression found in the path sub-element an returns the found results.

To use a different flavor of the Extract operator (for example xslt) it is necessary to add a type attribut to the ws:extract element. The parameters (sub-elements of ws:extract) depend on the type of operator used.

Each flavor of the Extract operator is implemented by a perl module named WebSource::Extract::flavor (eg. WebSource::Extract::xslt). See the corresponding man page for a full description.

Current existing flavors include :

xslt : apply an XSL stylesheet to the input
form : extract form data
regexep : extract data using a regular expression

SYNOPSIS

$exop = WebSource::Extract->new(wsdnode => $desc);

METHODS

See WebSource::Module

SEE ALSO

WebSource, WebSource::Extract::xslt, WebSource::Extract::form, WebSource::Extract::regexp