Chris Dolan > CAM-PDF-1.60 > getpdftext.pl

Download:
CAM-PDF-1.60.tar.gz

Annotate this POD

CPAN RT

New  17
Open  15
View/Report Bugs
Source  

NAME ^

getpdftext.pl - Extracts and print the text from one or more PDF pages

SYNOPSIS ^

 getpdftext.pl [options] infile.pdf [<pagenums>]

 Options:
   -c --check          just validates the page instead of printing it
   -g --geometry       just computes geometry, prints nothing
   -v --verbose        print diagnostic messages
   -h --help           verbose help message
   -V --version        print CAM::PDF version

 <pagenums> is a comma-separated list of page numbers.
      Ranges like '2-6' allowed in the list
      Example: 4-6,2,12,8-9

DESCRIPTION ^

Extracts all of the text from the specified PDF page(s) and prints them to STDOUT. If no pages are specified, all pages are processed.

The --check and --geometry modes are distinctly different. They are used primarily for debugging.

SEE ALSO ^

CAM::PDF

renderpdf.pl

AUTHOR ^

See CAM::PDF

syntax highlighting: