Jean Tavernier > HTML-Content-Extractor-0.01

HTML-Content-Extractor-0.01

This Release HTML-Content-Extractor-0.01  [Download] [Browse 21 Aug 2005
Latest Release HTML-Content-Extractor-0.17  [Download] [Browse 03 Nov 2013
Other Releases
Links Discussion Forum ] [ View/Report Bugs (0) ] [ Dependencies ] [ Other Tools ]
CPAN Testers PASS (2)   FAIL (39)   NA (60)   UNKNOWN (3)   [ View Reports ] [ Perl/Platform Version Matrix ]
Rating      (0 Reviews) [ Rate this distribution ]
License Unknown
Special Files
Changes
Makefile.PL
MANIFEST
README

Modules

HTML::Content::ContentExtractor Perl module for extracting content from HTML documents.    0.01
HTML::Content::HTMLTokenizer Perl module to tokenize HTML documents.     
HTML::Content::TokeParserTokenizer      
HTML::WordTagRatio::ExponentialRatio Perl module for determining the ratio of words to tags in a range of tokens in an HTML document.     
HTML::WordTagRatio::NormalizedRatio Perl module for determining the ratio of words to tags in a range of tokens in an HTML document.     
HTML::WordTagRatio::Ratio Default module for determining the ratio of words to tags in a range of tokens in an HTML document.     
HTML::WordTagRatio::RelativeRatio Perl module for determining the ratio of words to tags in a range of tokens in an HTML document.     
HTML::WordTagRatio::SmoothedRatio Default module for determining the ratio of words to tags in a range of tokens in an HTML document.     
HTML::WordTagRatio::WeightedRatio Perl module for determining the ratio of words to tags in a range of tokens in an HTML document.     

Documentation

ContentExtractorDriver.pl Driver for HTML Content Extractor 

Other Files

test.htm