The Perl Toolchain Summit needs more sponsors. If your company depends on Perl, please support this very important event.

Search results for "module:Text::NSP"

Text::NSP - Extract collocations and Ngrams from text River stage one • 2 direct dependents • 3 total dependents

The Ngram Statistics Package (NSP) is a collection of perl modules that aid in analyzing Ngrams in text files. We define an Ngram as a sequence of 'n' tokens that occur within a window of at least 'n' tokens in the text; what constitutes a "token" ca...

TPEDERSE/Text-NSP-1.31 - 04 Oct 2015 16:42:20 UTC

Text::NSP::Measures - Perl modules for computing association scores of Ngrams. This module provides the basic framework for these measures. River stage one • 2 direct dependents • 3 total dependents

Introduction These modules provide perl implementations of mathematical functions (association measures) that can be used to interpret the co-occurrence frequency data for Ngrams. We define an Ngram as a sequence of 'n' tokens that occur within a win...

TPEDERSE/Text-NSP-1.31 - 04 Oct 2015 16:42:20 UTC

Text::NSP::Measures::2D - Perl module that provides basic framework for building measure of association for bigrams. River stage one • 2 direct dependents • 3 total dependents

This module is to be used as a foundation for building 2-dimensional measures of association. The methods in this module retrieve observed bigram frequency counts, marginal totals, and also compute expected values. They also provide error checks for ...

TPEDERSE/Text-NSP-1.31 - 04 Oct 2015 16:42:20 UTC

Text::NSP::Measures::4D - Perl module that provides basic framework for building measure of association for 4-grams. River stage one • 2 direct dependents • 3 total dependents

The methods in this module retrieve observed 4-gram frequency counts and marginal totals, and also compute expected values. They also provide support for error checking of the output produced by count.pl. These methods are used in all the 4-gram (4d)...

TPEDERSE/Text-NSP-1.31 - 04 Oct 2015 16:42:20 UTC

Text::NSP::Measures::3D - Perl module that provides basic framework for building measure of association for trigrams. River stage one • 2 direct dependents • 3 total dependents

The methods in this module retrieve observed trigram frequency counts and marginal totals, and also compute expected values. They also provide support for error checking of the output produced by count.pl. These methods are used in all the trigram (3...

TPEDERSE/Text-NSP-1.31 - 04 Oct 2015 16:42:20 UTC

Text::NSP::Measures::4D::MI - Perl module that provides error checks and framework to implement Loglikelihood for 4-grams. River stage one • 2 direct dependents • 3 total dependents

This module is the base class for the Loglikelihood and the True Mutual Information measures. All these measure are similar. This module provides error checks specific for these measures, it also implements the computations that are common to these m...

TPEDERSE/Text-NSP-1.31 - 04 Oct 2015 16:42:20 UTC

Text::NSP::Measures::3D::MI - Perl module that provides error checks and framework to implement Loglikelihood, Total Mutual Information, Pointwise Mutual Information and Poisson Stirling Measure for trigrams. River stage one • 2 direct dependents • 3 total dependents

This module is the base class for the Loglikelihood and the True Mutual Information measures. All these measure are similar. This module provides error checks specific for these measures, it also implements the computations that are common to these m...

TPEDERSE/Text-NSP-1.31 - 04 Oct 2015 16:42:20 UTC

Text::NSP::Measures::2D::MI - Perl module that provides error checks for Loglikelihood, Total Mutual Information, Pointwise Mutual Information and Poisson-Stirling Measure. River stage one • 2 direct dependents • 3 total dependents

This module is the base class for the Loglikelihood, Total Mutual Information and the Pointwise Mutual Information measures. All these measure are similar. This module provides error checks specific for these measures, it also implements the computat...

TPEDERSE/Text-NSP-1.31 - 04 Oct 2015 16:42:20 UTC

Text::NSP::Measures::2D::CHI - Perl module that provides error checks for the Pearson's chi squared, phi coefficient and the Tscore measures. River stage one • 2 direct dependents • 3 total dependents

This module is the base class for the Chi-squared and Phi coefficient measures. This module provides error checks specific for these measures, it also implements the computations that are common to these measures. Pearson's Chi-Squared x2 = 2 * [((n1...

TPEDERSE/Text-NSP-1.31 - 04 Oct 2015 16:42:20 UTC

Text::NSP::Measures::2D::odds - Perl module to compute the Odds ratio for bigrams. River stage one • 2 direct dependents • 3 total dependents

Assume that the frequency count data associated with a bigram <word1><word2> is stored in a 2x2 contingency table: word2 ~word2 word1 n11 n12 | n1p ~word1 n21 n22 | n2p -------------- np1 np2 npp where n11 is the number of times <word1><word2> occur ...

TPEDERSE/Text-NSP-1.31 - 04 Oct 2015 16:42:20 UTC

Text::NSP::Measures::2D::Dice - Perl module that provides the framework to implement the Dice and Jaccard coefficients. River stage one • 2 direct dependents • 3 total dependents

Assume that the frequency count data associated with a bigram <word1><word2> is stored in a 2x2 contingency table: word2 ~word2 word1 n11 n12 | n1p ~word1 n21 n22 | n2p -------------- np1 np2 npp where n11 is the number of times <word1><word2> occur ...

TPEDERSE/Text-NSP-1.31 - 04 Oct 2015 16:42:20 UTC

Text::NSP::Measures::2D::Fisher - Perl module that provides methods to compute the Fishers exact tests. River stage one • 2 direct dependents • 3 total dependents

Assume that the frequency count data associated with a bigram <word1><word2> is stored in a 2x2 contingency table: word2 ~word2 word1 n11 n12 | n1p ~word1 n21 n22 | n2p -------------- np1 np2 npp where n11 is the number of times <word1><word2> occur ...

TPEDERSE/Text-NSP-1.31 - 04 Oct 2015 16:42:20 UTC

Text::NSP::Measures::2D::Fisher2 - Perl module that provides methods to compute the Fishers exact tests. River stage one • 2 direct dependents • 3 total dependents

This module provides a framework for the naive implementation of the fishers exact tests. That is the implementation does not have any optimizations for performance. This will compute the factorials for the hypergeometric probabilities using direct m...

TPEDERSE/Text-NSP-1.31 - 04 Oct 2015 16:42:20 UTC

Text::NSP::Measures::3D::MI::ll - Perl module that implements Loglikelihood measure of association for trigrams. River stage one • 2 direct dependents • 3 total dependents

The log-likelihood ratio measures the devitation between the observed data and what would be expected if <word1>, <word2> and <word3> were independent. The higher the score, the less evidence there is in favor of concluding that the words are indepen...

TPEDERSE/Text-NSP-1.31 - 04 Oct 2015 16:42:20 UTC

Text::NSP::Measures::2D::MI::ps - Perl module that implements Poisson-Stirling measure of association for bigrams. River stage one • 2 direct dependents • 3 total dependents

The log-likelihood ratio measures the deviation between the observed data and what would be expected if <word1> and <word2> were independent. The higher the score, the less evidence there is in favor of concluding that the words are independent. Assu...

TPEDERSE/Text-NSP-1.31 - 04 Oct 2015 16:42:20 UTC

Text::NSP::Measures::3D::MI::ps - Perl module that implements Poisson Stirling Measure for trigrams. River stage one • 2 direct dependents • 3 total dependents

The log-likelihood ratio measures the devitation between the observed data and what would be expected if <word1>, <word2> and <word3> were independent. The higher the score, the less evidence there is in favor of concluding that the words are indepen...

TPEDERSE/Text-NSP-1.31 - 04 Oct 2015 16:42:20 UTC

Text::NSP::Measures::4D::MI::ll - Perl module that implements Loglikelihood measure of association for 4-grams. River stage one • 2 direct dependents • 3 total dependents

The log-likelihood ratio measures the devitation between the observed data and what would be expected if <word1>, <word2>, <word3> and <word4> were independent.The higher the score, the less evidence there is in favor of concluding thatthe words are ...

TPEDERSE/Text-NSP-1.31 - 04 Oct 2015 16:42:20 UTC

Text::NSP::Measures::2D::MI::ll - Perl module that implements Loglikelihood measure of association for bigrams. River stage one • 2 direct dependents • 3 total dependents

The log-likelihood ratio measures the deviation between the observed data and what would be expected if <word1> and <word2> were independent. The higher the score, the less evidence there is in favor of concluding that the words are independent. Assu...

TPEDERSE/Text-NSP-1.31 - 04 Oct 2015 16:42:20 UTC

Text::NSP::Measures::3D::MI::tmi - Perl implementation for True Mutual Information for trigrams. River stage one • 2 direct dependents • 3 total dependents

True Mutual Information (tmi) is defined as the weighted average of the pointwise mutual informations for all the observed and expected value pairs. tmi = [n111/nppp * log(n111/m111) + n112/nppp * log(n112/m112) + n121/nppp * log(n121/m121) + n122/np...

TPEDERSE/Text-NSP-1.31 - 04 Oct 2015 16:42:20 UTC

Text::NSP::Measures::2D::CHI::x2 - Perl module that implements Pearson's chi squared measure of association for bigrams. River stage one • 2 direct dependents • 3 total dependents

Pearson's Chi-squred test measures the devitation between the observed data and what would be expected if <word1> and <word2> were independent. The higher the score, the less evidence there is in favor of concluding that the words are independent. As...

TPEDERSE/Text-NSP-1.31 - 04 Oct 2015 16:42:20 UTC
35 results (0.039 seconds)