Search results for "dist:Lingua-Interset Lingua::Features::Value"
Lingua::Interset::Trie - A trie-like structure for DZ Interset features and their values.
The "Trie" class defines a trie-like data structure for DZ Interset features and their values. It is an auxiliary data structure that an outside user should not need to use directly. It is used to describe all feature-value combinations that are perm...
ZEMAN/Lingua-Interset-3.015 - 05 Mar 2022 11:01:56 UTC
Lingua::Interset - DZ Interset is a universal morphosyntactic feature set to which all tagsets of all corpora/languages can be mapped.
DZ Interset is a universal framework for reading, writing, converting and interpreting part-of-speech and morphosyntactic tags from multiple tagsets of many different natural languages. Individual tagsets are mapped to the Interset using specialized ...
ZEMAN/Lingua-Interset-3.015 - 05 Mar 2022 11:01:56 UTC
Lingua::Interset::Atom - Atomic driver for a surface feature.
Atom is a special case of a tagset driver. As the name suggests, the surface tags are considered atomic, i.e. indivisible. It provides environment for easy mapping between surface strings and Interset features. While Atom can be used to implement dri...
ZEMAN/Lingua-Interset-3.015 - 05 Mar 2022 11:01:56 UTC
Lingua::Interset::Tagset - The root class for all physical tagsets covered by DZ Interset 2.0.
DZ Interset is a universal framework for reading, writing, converting and interpreting part-of-speech and morphosyntactic tags from multiple tagsets of many different natural languages. The "Tagset" class is the inheritance root for all classes descr...
ZEMAN/Lingua-Interset-3.015 - 05 Mar 2022 11:01:56 UTC
Lingua::Interset::FeatureStructure - Definition of morphosyntactic features and their values.
DZ Interset is a universal framework for reading, writing, converting and interpreting part-of-speech and morphosyntactic tags from multiple tagsets of many different natural languages. The "FeatureStructure" class defines all morphosyntactic feature...
ZEMAN/Lingua-Interset-3.015 - 05 Mar 2022 11:01:56 UTC
Lingua::Interset::Tagset::Multext - Common code for drivers of tagsets of the Multext-EAST project.
Common code for drivers of tagsets of the Multext-EAST project. All the Multext-EAST tagsets use the same inventory of parts of speech and the same inventory of features (but not all features are used in all languages). Feature values are individual ...
ZEMAN/Lingua-Interset-3.015 - 05 Mar 2022 11:01:56 UTC
Lingua::Interset::Tagset::FI::Turku - Driver for the Finnish tagset from the Turku Dependency Treebank.
Interset driver for the Finnish tagset from the Turku Dependency Treebank. Tag is a sequence of features separated by vertical bars. There are just the feature values, not attribute-value pairs....
ZEMAN/Lingua-Interset-3.015 - 05 Mar 2022 11:01:56 UTC
Lingua::Interset::Tagset::UR::Conll - Driver for the tagset of the Hyderabad Urdu Treebank, as used in the CoNLL data format.
Interset driver for the tagset of the Urdu treebank from Hyderabad, as used in the CoNLL data format. CoNLL tagsets in Interset are traditionally three values separated by tabs, coming from the CoNLL columns CPOS, POS and FEAT. In the case of Urdu, t...
ZEMAN/Lingua-Interset-3.015 - 05 Mar 2022 11:01:56 UTC
Lingua::Interset::Tagset::HI::Conll - Driver for the Hindi tagset of the shared tasks at ICON 2009, ICON 2010 and COLING 2012, as used in the CoNLL data format.
Interset driver for the Hindi tagset of the shared tasks at ICON 2009, ICON 2010 and COLING 2012, as used in the CoNLL data format. CoNLL tagsets in Interset are traditionally three values separated by tabs, coming from the CoNLL columns CPOS, POS an...
ZEMAN/Lingua-Interset-3.015 - 05 Mar 2022 11:01:56 UTC