SWISH::Filters::Pdf2HTML - Perl extension for filtering PDF documents with Swish-e
This is a plug-in module that uses the xpdf package to convert PDF documents to html for indexing by Swish-e. Any info tags found in the PDF document are created as meta tags.
This filter plug-in requires the xpdf package available at:
http://www.foolabs.com/xpdf/
You may pass into SWISH::Filter's new method a tag to use as the html <title> if found in the PDF info tags:
my %user_data; $user_data{pdf}{title_tag} = 'title'; $was_filtered = $filter->filter( document => $filename, user_data => \%user_data, );
Then if a PDF info tag of "title" is found that will be used as the HTML <title>. If no tag is passed, title will be used as the default tag.
title
Bill Moseley
SWISH::Filter
To install SWISH::Filter, copy and paste the appropriate command in to your terminal.
cpanm
cpanm SWISH::Filter
CPAN shell
perl -MCPAN -e shell install SWISH::Filter
For more information on module installation, please visit the detailed CPAN module installation guide.