The Perl Toolchain Summit needs more sponsors. If your company depends on Perl, please support this very important event.

NAME

App::Zapzi::Transformers::HTML - process HTML without doing readability transforms

VERSION

version 0.014

DESCRIPTION

This class takes HTML and returns the body without doing additional readable transforms - so tags such as script are removed but no text should be changed. Use this if HTMLExtractMain does not provide the desired results.

METHODS

name

Name of transformer visible to user.

handles($content_type)

Returns true if this module handles the given content-type

transform(input)

Converts input to readable text. Returns true if converted OK.

AUTHOR

Rupert Lane <rupert@rupert-lane.org>

COPYRIGHT AND LICENSE

This software is copyright (c) 2014 by Rupert Lane.

This is free software; you can redistribute it and/or modify it under the same terms as the Perl 5 programming language system itself.