Plucene::SearchEngine::Index::RSS - Index RSS files
my @articles = Plucene::SearchEngine::Index::URL->( "http://planet.perl.org/rss10.xml" ); $indexer->index($_->document) for @articles;
This examines RSS files and creates document hashes for individual items in the feed. The objects have the following Plucene fields:
The date that this article was published.
The creator, if one was specified.
The name of the feed from which this was taken.
The URL that the article links to, and the URL of the feed.
The text of the article.
The title of the article.
Since Plucene::SearchEngine::Index uses MIME types to determine the type of a file, this module doesn't work particularly well using the File frontend. It works OK with the URL frontend if the webserver sends the right content type header. If not, you may have to fudge it by registering your own handlers:
Plucene::SearchEngine::Index
File
URL
Plucene::SearchEngine::Index::RSS->register_handler("text/xml"); # For instance
Plucene::SearchEngine::Index.
Simon Cozens, <simon@cpan.org>
Copyright (C) 2004 by Simon Cozens
This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself.
To install Plucene::SearchEngine::Index::RSS, copy and paste the appropriate command in to your terminal.
cpanm
cpanm Plucene::SearchEngine::Index::RSS
CPAN shell
perl -MCPAN -e shell install Plucene::SearchEngine::Index::RSS
For more information on module installation, please visit the detailed CPAN module installation guide.