The London Perl and Raku Workshop takes place on 26th Oct 2024. If your company depends on Perl, please consider sponsoring and/or attending.

Search results for "dist:Text-SenseClusters FREQUENCY"

frequency.pl - Compute the distribution of senses in a Senseval-2 data file River stage zero No dependents

Displays distribution of senses in a given Senseval-2 file to STDOUT. This information can be used to better understand the data, and also to decide to filter low frequency senses (using filter.pl) or balance the distribution of senses (using balance...

TPEDERSE/Text-SenseClusters-1.05 - 03 Oct 2015 14:33:01 UTC

README.Toolkit - SenseClusters Toolkit directory structure with links to all program documentation River stage zero No dependents

TPEDERSE/Text-SenseClusters-1.05 - 03 Oct 2015 14:33:01 UTC

README.samples - How to run SenseCluster sample scripts River stage zero No dependents

The samples directory allows a user to run various sample experiments with the SenseClusters system. Sample data is provided in the /Data directory, and there are scripts available that show how to exercise some of the major functionality of the pack...

TPEDERSE/Text-SenseClusters-1.05 - 03 Oct 2015 14:33:01 UTC

filter.pl - Remove the instances of low frequency sense tags from a Senseval-2 data file River stage zero No dependents

This program will remove low frequency sense tags from a Senseval-2 data set by specifying a percentage or rank threshhold. By default it removes any sense tag associated with less than 1% of the total instances. Output is to STDOUT, so the original ...

TPEDERSE/Text-SenseClusters-1.05 - 03 Oct 2015 14:33:01 UTC

setup.pl - Preprocess Senseval-2 data for sample experiments River stage zero No dependents

TPEDERSE/Text-SenseClusters-1.05 - 03 Oct 2015 14:33:01 UTC

wordvec.pl - Construct word vectors from bigram or co-occurrence matrices River stage zero No dependents

Constructs word vectors from the given WORD_PAIRS....

TPEDERSE/Text-SenseClusters-1.05 - 03 Oct 2015 14:33:01 UTC

order1vec.pl - Convert Senseval-2 format contexts into first order feature vectors in Cluto format River stage zero No dependents

Convert a context into a first order feature vector which shows how which features occured in the contexts. The possible features are identified via Perl regular expressions of the form created by nsp2regex.pl....

TPEDERSE/Text-SenseClusters-1.05 - 03 Oct 2015 14:33:01 UTC

order2vec.pl - Convert Senseval-2 contexts into second order context vectors in Cluto format River stage zero No dependents

Creates second order context vectors by averaging word or feature vectors of the contextual features....

TPEDERSE/Text-SenseClusters-1.05 - 03 Oct 2015 14:33:01 UTC

discriminate.pl River stage zero No dependents

TPEDERSE/Text-SenseClusters-1.05 - 03 Oct 2015 14:33:01 UTC

reduce-count.pl - Reduce size of feature space by removing words not in evaluation data River stage zero No dependents

This program removes all bigrams from the given BIGRAM file that do not include at least one constituent word from the UNIGRAM file. Note that this can also be applied on a co-occurrence file. The intent of this in SenseClusters is to allow a user to...

TPEDERSE/Text-SenseClusters-1.05 - 03 Oct 2015 14:33:01 UTC

clusterstopping.pl - Predict the optimal number of clusters in a data set River stage zero No dependents

Predicts the optimal number of clusters for the given data. This script tries to find the optimal number of clusters for the given INPUTFILE....

TPEDERSE/Text-SenseClusters-1.05 - 03 Oct 2015 14:33:01 UTC
11 results (0.034 seconds)