Jean Tavernier > HTML-Content-Extractor-0.01

HTML-Content-Extractor-0.01

This Release HTML-Content-Extractor-0.01  [Download] [Browse 21 Aug 2005
Links Discussion Forum ] [ View/Report Bugs (1) ] [ Dependencies ] [ Other Tools ]
CPAN Testers PASS (2)   FAIL (4)   NA (5)   [ View Reports ] [ Perl/Platform Version Matrix ]
Rating      (0 Reviews) [ Rate this distribution ]
License Unknown
Special Files
Changes
MANIFEST
Makefile.PL
README

Modules

HTML::Content::ContentExtractor Perl module for extracting content from HTML documents.    0.01
HTML::Content::HTMLTokenizer Perl module to tokenize HTML documents.     
HTML::Content::TokeParserTokenizer      
HTML::WordTagRatio::ExponentialRatio Perl module for determining the ratio of words to tags in a range of tokens in an HTML document.     
HTML::WordTagRatio::NormalizedRatio Perl module for determining the ratio of words to tags in a range of tokens in an HTML document.     
HTML::WordTagRatio::Ratio Default module for determining the ratio of words to tags in a range of tokens in an HTML document.     
HTML::WordTagRatio::RelativeRatio Perl module for determining the ratio of words to tags in a range of tokens in an HTML document.     
HTML::WordTagRatio::SmoothedRatio Default module for determining the ratio of words to tags in a range of tokens in an HTML document.     
HTML::WordTagRatio::WeightedRatio Perl module for determining the ratio of words to tags in a range of tokens in an HTML document.     

Documentation

ContentExtractorDriver.pl Driver for HTML Content Extractor 

Other Files

test.htm