
CHANGES Changelog for the Ngram Statistics Package (Text-NSP)

Revision history for Perl module Text::NSP

Released Feb 16, 2013 all changes by BTM
Released Jan 15, 2012 all changes by BTM
Release March 31, 2011 all changes by YL
Replaced the tail hash of huge-merge.pl by without use hash.
Released November 12, 2010 all changes by BTM
>>>>>>> 1.24 =item 1.19
Released November 1, 2010 all changes by YL
Released April 26, 2010 all changes by YL
Released April 7, 2010 all changes by YL
Released March 5, 2010 all changes by TDP and YL
Reported here : <L http://tech.groups.yahoo.com/group/ngram/message/280>
This was not added in 1.11 due to failure to rebuilt MANIFEST.
Released Nov 5, 2009 all changes by TDP
Reported here : <L http://tech.groups.yahoo.com/group/ngram/message/240>
Reported here : <L http://tech.groups.yahoo.com/group/ngram/message/280>
Reported here : <L http://tech.groups.yahoo.com/group/ngram/message/230>
Reported here : <L http://tech.groups.yahoo.com/group/ngram/message/290>
Released March 26, 2008 all changes by TDP
Released March 24, 2008 all changes by TDP
Released March 20, 2008 all changes by TDP

There is a limitation in huge-count.pl. When the size of the corpus is very large (>16G) and the some of the terms of the bigrams is very long (>30 chars), the program could run out of memory at huge-merge.pl step. This is because huge-merge use two hashes to count the frequencies of the first and second term of the bigrams. These two hashes could use up the memory with the increase of the length of the terms and the increase of the number of the terms. If just for normal text, terms are within limited length and numbers, the software won't use up the memory.

Ying Liu, University of Minnesota, Twin Cities liux0395 at umn.edu Ted Pedersen, University of Minnesota, Duluth tpederse at d.umn.edu
This document last modified by : $Id: CHANGES.pod,v 1.34 2013/02/16 21:23:27 tpederse Exp $


Copyright (c) 2004-2011 Ted Pedersen
Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation; with no Invariant Sections, no Front-Cover Texts, and no Back-Cover Texts.
Note: a copy of the GNU Free Documentation License is available on the web at http://www.gnu.org/copyleft/fdl.html and is included in this distribution as FDL.txt. Ted Pedersen, University of Minnesota, Duluth tpederse at d.umn.edu
This document last modified by : $Id: CHANGES.pod,v 1.34 2013/02/16 21:23:27 tpederse Exp $


Copyright (c) 2004-2011 Ted Pedersen
Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation; with no Invariant Sections, no Front-Cover Texts, and no Back-Cover Texts.
Note: a copy of the GNU Free Documentation License is available on the web at http://www.gnu.org/copyleft/fdl.html and is included in this distribution as FDL.txt.