htmlclean - Html syntax reformatter and cleaner filter for human beings.
Argument is stdin, url, or file on disk (that has html). Outputs cleaned html to stdout. This is basically something to get rid of messy html, and same some of what might be of use.
-i do not rip out img and link tags -o path output file destination (instead of stdout) -h help -d debug -v version
htmlclean ./infile.html > outfile.html htmlclean http://thisthat.com/page.html > outfile.html cat file.html | htmlclean > outfile.html htmlclean -o output.html http://news.bbc.co.uk
You can provide multiple inputs at the same time, but it would be messy.
HTML::Clean::Human - parent package
Leo Charre leocharre at cpan dot org
Copyright (c) 2010 Leo Charre. All rights reserved.
This package is free software; you can redistribute it and/or modify it under the same terms as Perl itself, i.e., under the terms of the "Artistic License" or the "GNU General Public License".
This package is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.
See the "GNU General Public License" for more details.