Catmandu::Importer::PDFPages - Catmandu importer to extract text data per page from one pdf
# From the command line # Export pdf pages with their text and coördinates $ catmandu convert PDFPages --file input.pdf to YAML #In a script use Catmandu::Sane; use Catmandu::Importer::PDFPages; my $importer = Catmandu::Importer::PDFPages->new( file => "/tmp/input.pdf" ); $importer->each(sub{ my $page = $_[0]; #.. });
- label: Cover Page height: 878 width: 595 text: "Hello world"
Nicolas Franck <nicolas.franck at ugent.be>
<nicolas.franck at ugent.be>
Catmandu, Catmandu::Importer , Poppler
To install Catmandu::Importer::PDF, copy and paste the appropriate command in to your terminal.
cpanm
cpanm Catmandu::Importer::PDF
CPAN shell
perl -MCPAN -e shell install Catmandu::Importer::PDF
For more information on module installation, please visit the detailed CPAN module installation guide.