Plucene::SearchEngine::Index::RSS - Index RSS files
my @articles = Plucene::SearchEngine::Index::URL->( "http://planet.perl.org/rss10.xml" ); $indexer->index($_->document) for @articles;
This examines RSS files and creates document hashes for individual items in the feed. The objects have the following Plucene fields:
The date that this article was published.
The creator, if one was specified.
The name of the feed from which this was taken.
The URL that the article links to, and the URL of the feed.
The text of the article.
The title of the article.
Plucene::SearchEngine::Index uses MIME types to determine the type of a file, this module doesn't work particularly well using the
File frontend. It works OK with the
URL frontend if the webserver sends the right content type header. If not, you may have to fudge it by registering your own handlers:
Plucene::SearchEngine::Index::RSS->register_handler("text/xml"); # For instance
Simon Cozens, <email@example.com>
Copyright (C) 2004 by Simon Cozens
This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself.