Search results for "Lingua Treebank"
Lingua::Treebank - Perl extension for manipulating the Penn Treebank format
This class knows how to read two treebank formats, the Penn format and the Chomsky Normal Form (CNF) format. These formats differ in how they handle terminal nodes. The Penn format places pre-terminal part of speech tags in the left-hand position of ...
KAHN/Lingua-Treebank-0.16 - 28 Aug 2008 20:08:52 UTC - Search in distribution- Lingua::Treebank::Const - Object modeling constituent from a treebank
- Lingua::Treebank::HeadFinder - Head-finding in Lingua::Treebank
- get_words - given collapsed treebank, print words only
- 5 more results from Lingua-Treebank »
Lingua::Align::Corpus::Treebank - Factory class for reading treebanks
Factory class of modules for reading treebanks in different formats. The default format is the Penn Treebank format. Other supported formats are the format produced by the Berkeley parser, the Stanford parser (including typed dependencies), TigerXML ...
TIEDEMANN/Lingua-Align-0.04 - 10 Dec 2012 18:31:24 UTC - Search in distribution- Lingua::Align::Corpus::Treebank::Penn - Read the Penn Treebank format
- Lingua::Align::Corpus::Treebank::TigerXML - Read the TigerXML format
- Lingua::Align::Corpus::Treebank::Stanford - Read output from the Stanford parser
- 13 more results from Lingua-Align »
Text::StemTagPOS - Computes stemmed/POS tagged lists of text.
"Text::StemTagPOS" uses the modules Lingua::Stem::Snowball and Lingua::EN::Tagger to do part-of-speech tagging and stemming of English text. It was developed to pre-process text for other modules. Encoding of all text should be in Perl's internal for...
KUBINA/Text-StemTagPOS-0.61 - 31 Dec 2011 13:41:21 UTC - Search in distribution
Lingua::Interset - DZ Interset is a universal morphosyntactic feature set to which all tagsets of all corpora/languages can be mapped.
DZ Interset is a universal framework for reading, writing, converting and interpreting part-of-speech and morphosyntactic tags from multiple tagsets of many different natural languages. Individual tagsets are mapped to the Interset using specialized ...
ZEMAN/Lingua-Interset-3.015 - 05 Mar 2022 11:01:56 UTC - Search in distribution- Lingua::Interset::Atom - Atomic driver for a surface feature.
- Lingua::Interset::Tagset::HI::Conll - Driver for the Hindi tagset of the shared tasks at ICON 2009, ICON 2010 and COLING 2012, as used in the CoNLL data format.
- Lingua::Interset::Tagset::DE::Smor - Driver for the German tagset of SMOR (Stuttgart Morphology)
- 35 more results from Lingua-Interset »
Treex::Block::W2A::EN::TagLinguaEn
Each node in analytical tree is tagged using "Lingua::EN::Tagger" (Penn Treebank POS tags). Because Lingua::EN::Tagger does its own tokenization, it checks if tokenization is same....
VARISD/Treex-EN-2.20151102 - 02 Nov 2015 20:29:13 UTC - Search in distribution