Catmandu::HTML - Modules for handling HTML data within the Catmandu framework
Command line client catmandu:
catmandu
catmandu convert HTML to JSON < ex/test.html > data/test.json catmandu convert JSON to HTML < data/test.json > ex/test.html # Keep the text curl -s http://librecat.org | catmandu convert HTML to Text --fix "html_text()" # Filter out the anchors curl -s http://librecat.org | catmandu convert HTML to Text --fix "html_filter_tag(a); html_text(join:\"\n\")" # Filter out the meta tags as YAML curl -s http://librecat.org | catmandu convert HTML to YAML --fix "html_filter_tag(meta, group_by:name)"
See documentation of modules for more examples.
Serialize HTML data
Parse HTML data
Filter HTML by name and optional group them
Filter HTML by type
Select only the literal text values from the HTML
This module is based on Catmandu
patrick.hochstenbach at ugent.be
The Perl software is copyright (c) 2018 by Patrick Hochstenbach. This is free software; you can redistribute it and/or modify it under the same terms as the Perl 5 programming language system itself.
To install Catmandu::HTML, copy and paste the appropriate command in to your terminal.
cpanm
cpanm Catmandu::HTML
CPAN shell
perl -MCPAN -e shell install Catmandu::HTML
For more information on module installation, please visit the detailed CPAN module installation guide.