Peter Karman > SWISH-Filter > SWISH::Filters::xls2txt

Download:
SWISH-Filter-0.18.tar.gz

Dependencies

Annotate this POD

CPAN RT

Open  0
View/Report Bugs
Module Version: 0.18   Source  

NAME ^

SWISH::Filters::xls2txt - convert Excel docs to text using xls2csv

DESCRIPTION ^

This is a plug-in module that uses the xls2csv program to convert MS Excel documents to text for indexing by Swish-e. xls2csv is part of the catdoc package and can be downloaded from:

    http://www.45.free.net/~vitus/software/catdoc/

The program xls2csv must be installed and in your PATH.

BUGS ^

This filter does not specify input or output character encodings.

A minor optimization during spidering (i.e. when docs are in memory instead of on disk) would be to use open2() call to let catdoc read from stdin instead of from a file.

AUTHOR ^

Peter Karman perl@peknet.com

SEE ALSO ^

SWISH::Filter

syntax highlighting: