The London Perl and Raku Workshop takes place on 26th Oct 2024. If your company depends on Perl, please consider sponsoring and/or attending.

Documentation

ocr
read an image file and turn into text
get text content of pdf document images within
get text from pdf and resort to ocr as needed

Modules

read an image with tesseract and get output
get images from pdf document
get ocr and images out of a pdf file
extract text fom pdf document resorting to ocr as needed
save ocr to text file for easy retrieval