Modules

abstract ancestor for parallel-corpora document readers
abstract ancestor for parallel-corpora document readers
split bundles which contain more sentences
rule based segmentation to sentences
segment text on new lines
universal block for PoS tagging and lemmatization
language independent rule based tokenizer
Base tokenizer, splits on whitespaces, fills no_space_after
Rule based pseudo language-independent sentence segmenter
base class for Featurama PoS taggers
wrapper for Ufal::MorphoDiTa
role for PoS taggers
collection of blocks parametrized by language and language independent

Provides

in lib/Treex/Tool/ProcessUtils.pm