Tomáš Kraut > Treex-Unilang-0.08170 > Treex::Block::W2A::ResegmentSentences

Download:
Treex-Unilang-0.08170.tar.gz

Dependencies

Annotate this POD

View/Report Bugs
Module Version: 0.08170   Source  

NAME ^

Treex::Block::W2A::ResegmentSentences

VERSION ^

version 0.08170

DESCRIPTION ^

If the sentence segmenter says that the current sentence is actually composed of two or more sentences, then new bundles are inserted after the current bundle, each containing just one piece of the resegmented original sentence.

All zones are processed. The number of bundles created is determined by the number of subsegments in the "current" zone (specified by the parameters language and selector). If a zone contains less subsegments than the current one, the remaining bundles will contain empty sentence. If a zone contains more subsegments than the current one, the remaining subsegments will be joined in the last bundle.

In other words, it is granted that the current zone, will not contain empty sentences.

AUTHOR ^

Zdeněk Žabokrtský <zabokrtsky@ufal.mff.cuni.cz>

Martin Popel <popel@ufal.mff.cuni.cz>

COPYRIGHT AND LICENSE ^

Copyright © 2011 by Institute of Formal and Applied Linguistics, Charles University in Prague

This module is free software; you can redistribute it and/or modify it under the same terms as Perl itself.

syntax highlighting: