Search results for "distribution:Text-SenseClusters FREQUENCY"
frequency.pl - Compute the distribution of senses in a Senseval-2 data file
Displays distribution of senses in a given Senseval-2 file to STDOUT. This information can be used to better understand the data, and also to decide to filter low frequency senses (using filter.pl) or balance the distribution of senses (using balance...
TPEDERSE/Text-SenseClusters-1.05 - 03 Oct 2015 14:33:01 UTC
README.Toolkit - SenseClusters Toolkit directory structure with links to all program documentation
TPEDERSE/Text-SenseClusters-1.05
-
03 Oct 2015 14:33:01 UTC
README.samples - How to run SenseCluster sample scripts
The samples directory allows a user to run various sample experiments with the SenseClusters system. Sample data is provided in the /Data directory, and there are scripts available that show how to exercise some of the major functionality of the pack...
TPEDERSE/Text-SenseClusters-1.05 - 03 Oct 2015 14:33:01 UTC
filter.pl - Remove the instances of low frequency sense tags from a Senseval-2 data file
This program will remove low frequency sense tags from a Senseval-2 data set by specifying a percentage or rank threshhold. By default it removes any sense tag associated with less than 1% of the total instances. Output is to STDOUT, so the original ...
TPEDERSE/Text-SenseClusters-1.05 - 03 Oct 2015 14:33:01 UTC
setup.pl - Preprocess Senseval-2 data for sample experiments
TPEDERSE/Text-SenseClusters-1.05
-
03 Oct 2015 14:33:01 UTC
wordvec.pl - Construct word vectors from bigram or co-occurrence matrices
Constructs word vectors from the given WORD_PAIRS....
TPEDERSE/Text-SenseClusters-1.05 - 03 Oct 2015 14:33:01 UTC
order1vec.pl - Convert Senseval-2 format contexts into first order feature vectors in Cluto format
Convert a context into a first order feature vector which shows how which features occured in the contexts. The possible features are identified via Perl regular expressions of the form created by nsp2regex.pl....
TPEDERSE/Text-SenseClusters-1.05 - 03 Oct 2015 14:33:01 UTC
order2vec.pl - Convert Senseval-2 contexts into second order context vectors in Cluto format
Creates second order context vectors by averaging word or feature vectors of the contextual features....
TPEDERSE/Text-SenseClusters-1.05 - 03 Oct 2015 14:33:01 UTC
reduce-count.pl - Reduce size of feature space by removing words not in evaluation data
This program removes all bigrams from the given BIGRAM file that do not include at least one constituent word from the UNIGRAM file. Note that this can also be applied on a co-occurrence file. The intent of this in SenseClusters is to allow a user to...
TPEDERSE/Text-SenseClusters-1.05 - 03 Oct 2015 14:33:01 UTC
clusterstopping.pl - Predict the optimal number of clusters in a data set
Predicts the optimal number of clusters for the given data. This script tries to find the optimal number of clusters for the given INPUTFILE....
TPEDERSE/Text-SenseClusters-1.05 - 03 Oct 2015 14:33:01 UTC