The Perl Toolchain Summit needs more sponsors. If your company depends on Perl, please support this very important event.

NAME

calculate-corpus-stats.pl - perl script that gives corpus statistics given a semcor-reformatted corpus file.

SYNOPSIS

calculate-corpus-stats.pl --file FILE

DESCRIPTION

This script gives the information about the distribution of instances based on the part of speech. It also gives the distribution of instances based on the number of senses available for the instances. For example, the instance winter#n has only 1 sense associated with it and so it will be counted in the instances with only 1 sense.

AUTHORS

 Varada Kolhatkar, University of Minnesota, Duluth
 <kolha002 at d.umn.edu>

 Ted Pedersen, University of Minnesota, Duluth
 <tpederse at d.umn.edu>

This document last modified by : $Id: calculate-corpus-stats.pl,v 1.3 2009/04/30 22:08:49 kvarada Exp $

SEE ALSO

 L<semcor-reformat.pl> 

COPYRIGHT

Copyright (C) 2005-2008 by Jason Michelizzi and Ted Pedersen

This program is free software; you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation; either version 2 of the License, or (at your option) any later version.

This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.