Mike Doherty > Plucene-SearchEngine-Index-MSOffice > Plucene::SearchEngine::Index::PPT

Download:
Plucene-SearchEngine-Index-MSOffice-0.001.tar.gz

Dependencies

Annotate this POD

View/Report Bugs
Module Version: 0.001   Source  

NAME ^

Plucene::SearchEngine::Index::PPT - a Plucene backend for indexing Microsoft Powerpoint presentations

VERSION ^

version 0.001

DESCRIPTION ^

This backend analysis a PPT file. The module use the tool called ppthtml, provided by xlhtml packges available from http://chicago.sourceforge.net/xlhtml/, or your operating system's package manager.

This code is not currently actively maintained.

text

The text part of the PPT

link

A list of links in the HTML

Additionally, any META tags are turned into Plucene fields.

METHODS ^

gather_data_from_file

Overrides the method from Plucene::SearchEngine::Index::HTML to provide PPT parsing.

AVAILABILITY ^

The latest version of this module is available from the Comprehensive Perl Archive Network (CPAN). Visit http://www.perl.com/CPAN/ to find a CPAN site near you, or see https://metacpan.org/module/Plucene::SearchEngine::Index::MSOffice/.

SOURCE ^

The development version is on github at http://github.com/doherty/Plucene-SearchEngine-Index-MSOffice and may be cloned from git://github.com/doherty/Plucene-SearchEngine-Index-MSOffice.git

BUGS AND LIMITATIONS ^

You can make new bug reports, and view existing ones, through the web interface at https://github.com/doherty/Plucene-SearchEngine-Index-MSOffice/issues.

AUTHORS ^

COPYRIGHT AND LICENSE ^

This software is copyright (c) 2012 by Sopan Shewale <sopan.shewale@gmail.com>.

This is free software; you can redistribute it and/or modify it under the same terms as the Perl 5 programming language system itself.

syntax highlighting: