The Perl Toolchain Summit needs more sponsors. If your company depends on Perl, please support this very important event.

NAME

WWW::FetchStory::Fetcher::Owl - fetching module for WWW::FetchStory

VERSION

version 0.2307

DESCRIPTION

This is the Owl story-fetching plugin for WWW::FetchStory.

info

Information about the fetcher.

$info = $self->info();

priority

The priority of this fetcher. Fetchers with higher priority get tried first. This is useful where there may be a generic fetcher for a particular site, and then a more specialized fetcher for particular sections of a site. For example, there may be a generic Owl fetcher, and then refinements for particular Owl community, such as the sshg_exchange community. This works as either a class function or a method.

This must be overridden by the specific fetcher class.

$priority = $self->priority();

$priority = WWW::FetchStory::Fetcher::priority($class);

allow

If this fetcher can be used for the given URL, then this returns true. This must be overridden by the specific fetcher class.

    if ($obj->allow($url))
    {
        ....
    }

Private Methods

extract_story

Extract the story-content from the fetched content.

    my ($story, $title) = $self->extract_story(content=>$content,
        title=>$title);

parse_toc

Parse the table-of-contents file.

    %info = $self->parse_toc(content=>$content,
                         url=>$url,
                         urls=>\@urls);

This should return a hash containing:

chapters

An array of URLs for the chapters of the story. In the case where the story only takes one page, that will be the chapter. In the case where multiple URLs have been passed in, it will be those URLs.

title

The title of the story.

It may also return additional information, such as Summary.

parse_chapter_urls

Figure out the URLs for the chapters of this story.

parse_title

Get the title from the content

parse_author

Get the author from the content

parse_summary

Get the summary from the content