Moot-2.0.10_006 - Perl interface to the libmoot HMM tagger library

Changes for version v2.0.10_006 - 2015-09-25

waste: improved handling of negative mode selectors (e.g. -N)
updated perl bindings
- added Moot::Waste::Annotator class
- updated Moot::TokPP to use Moot::Waste::Annotator

fixed wasteScanner choking on long utf-8-encoded characters (e.g. U+1D1A3 : MUSICAL SYMBOL ORNAMENT STROKE-9 : \xf0\x9d\x86\xa3 in bach_versuch02_1762
- wasteScanner should now handle even non-utf8 more or less gracefully

v2.0.10-1: workaround for probability underflow error propagation in mootHMM::tag_stream()
- once underflowed, no more differentiation was made, since no nodes qualified as flushable until EOF
- workaround flushes nodes whenever 'unsafe' probabilities (<-1e37) are encountered
encoding tweaks for Moot::TokPP::analyze_buffer()
tokpp improvements / fixes
fixed to jive with kmw's wasteLexer changes
wasteTrainWriter: basically working, but links are being dropped (scanner bug)
waste training prototype in testme.perl
added Moot::TokPP, moot-tokpp.perl : drop-in replacement for dwds_tomasotath tokenizer-supplied pseudo-morphology
documented Waste::Lexer::dehyphenate()
make distcheck fixes
got Moot::Waste::Decoder working, including buffer-level access
added Waste::Decoder to perl
Waste::Lexer seems working
- including get/set on underlying scanner, using lexer->tr_data to hold an SV
removed WasteLexerPerl class
- was WIP for simultaneous support of both standalone and embedded wasteLexicon objects, now abandoned
Waste::Lexicon : now only accessible via Waste::Lexer
- avoids ref-counting madness for embedded objects
added TokenReader, TokenWriter hierarchy wrappers
- WIP on wasteLexer, wasteLexicon
wrapped wasteTokenScanner as Moot::Waste::Scanner
added scanner,lexer type constants (why? they're not actually _used_ ... we should probably remove them again)
wrapper uses PerlIO layer
TokenReader bugfixes (check for null tr_istream in from_filename()

added re2c_ucl.py (re2c char-class generator)
added wasteScannerScan.* templates for waste generation
added moot(lookup|merge)-(lex|123).perl to MANIFEST
added mootlookup-lex.perl
fixes for weird DynaLoader bug on perl v5.14 / 32-bit i686 / debian wheezy if CCFLAGS is set in Makefile.PL
- strangely, x86_64 machine was unaffected
- bad: Linux plato 3.2.0-4-686-pae #1 SMP Debian 3.2.41-2 i686 GNU/Linux
added command-line utils mootmerge*.perl
updated version for 2.0.9-2