Search results for "distribution:Lingua-Interset Lingua::Features::Feature"
Lingua::Interset - DZ Interset is a universal morphosyntactic feature set to which all tagsets of all corpora/languages can be mapped.
DZ Interset is a universal framework for reading, writing, converting and interpreting part-of-speech and morphosyntactic tags from multiple tagsets of many different natural languages. Individual tagsets are mapped to the Interset using specialized ...
ZEMAN/Lingua-Interset-3.015 - 05 Mar 2022 11:01:56 UTC
Lingua::Interset::Atom - Atomic driver for a surface feature.
Atom is a special case of a tagset driver. As the name suggests, the surface tags are considered atomic, i.e. indivisible. It provides environment for easy mapping between surface strings and Interset features. While Atom can be used to implement dri...
ZEMAN/Lingua-Interset-3.015 - 05 Mar 2022 11:01:56 UTC
Lingua::Interset::Trie - A trie-like structure for DZ Interset features and their values.
The "Trie" class defines a trie-like data structure for DZ Interset features and their values. It is an auxiliary data structure that an outside user should not need to use directly. It is used to describe all feature-value combinations that are perm...
ZEMAN/Lingua-Interset-3.015 - 05 Mar 2022 11:01:56 UTC
Lingua::Interset::Tagset - The root class for all physical tagsets covered by DZ Interset 2.0.
DZ Interset is a universal framework for reading, writing, converting and interpreting part-of-speech and morphosyntactic tags from multiple tagsets of many different natural languages. The "Tagset" class is the inheritance root for all classes descr...
ZEMAN/Lingua-Interset-3.015 - 05 Mar 2022 11:01:56 UTC
Lingua::Interset::FeatureStructure - Definition of morphosyntactic features and their values.
DZ Interset is a universal framework for reading, writing, converting and interpreting part-of-speech and morphosyntactic tags from multiple tagsets of many different natural languages. The "FeatureStructure" class defines all morphosyntactic feature...
ZEMAN/Lingua-Interset-3.015 - 05 Mar 2022 11:01:56 UTC
Lingua::Interset::Tagset::Multext - Common code for drivers of tagsets of the Multext-EAST project.
Common code for drivers of tagsets of the Multext-EAST project. All the Multext-EAST tagsets use the same inventory of parts of speech and the same inventory of features (but not all features are used in all languages). Feature values are individual ...
ZEMAN/Lingua-Interset-3.015 - 05 Mar 2022 11:01:56 UTC
Lingua::Interset::OldTagsetDriver - A temporary envelope that provides access to the old (Interset 1.0) drivers from Interset 2.0.
Provides object envelope for an old, non-object-oriented driver from Interset 1.0. This makes the old drivers at least partially usable until they are fully ported to Interset 2.0. Note however that the old drivers use Interset features and/or values...
ZEMAN/Lingua-Interset-3.015 - 05 Mar 2022 11:01:56 UTC
Lingua::Interset::Tagset::FI::Turku - Driver for the Finnish tagset from the Turku Dependency Treebank.
Interset driver for the Finnish tagset from the Turku Dependency Treebank. Tag is a sequence of features separated by vertical bars. There are just the feature values, not attribute-value pairs....
ZEMAN/Lingua-Interset-3.015 - 05 Mar 2022 11:01:56 UTC
Lingua::Interset::Tagset::EU::Conll - Driver for the tagset of the Basque Dependency Treebank in the CoNLL format.
Interset driver for the tagset of the Basque Dependency Treebank version 2011 in the CoNLL format. Note that this version of the tagset is slightly different from the Basque data of the CoNLL 2007 Shared Task. For instance, the features now contain f...
ZEMAN/Lingua-Interset-3.015 - 05 Mar 2022 11:01:56 UTC
Lingua::Interset::Tagset::BN::Conll - Driver for the Bengali tagset of the ICON 2009 and 2010 Shared Tasks, as used in the CoNLL data format.
Interset driver for the Bengali tagset of the ICON 2009 and 2010 Shared Tasks, as used in the CoNLL data format. CoNLL tagsets in Interset are traditionally three values separated by tabs, coming from the CoNLL columns CPOS, POS and FEAT. ICON shared...
ZEMAN/Lingua-Interset-3.015 - 05 Mar 2022 11:01:56 UTC
Lingua::Interset::Tagset::TE::Conll - Driver for the Telugu tagset of the ICON 2009 and 2010 Shared Tasks, as used in the CoNLL data format.
Interset driver for the Telugu tagset of the ICON 2009 and 2010 Shared Tasks, as used in the CoNLL data format. CoNLL tagsets in Interset are traditionally three values separated by tabs, coming from the CoNLL columns CPOS, POS and FEAT. ICON shared ...
ZEMAN/Lingua-Interset-3.015 - 05 Mar 2022 11:01:56 UTC
Lingua::Interset::Tagset::UR::Conll - Driver for the tagset of the Hyderabad Urdu Treebank, as used in the CoNLL data format.
Interset driver for the tagset of the Urdu treebank from Hyderabad, as used in the CoNLL data format. CoNLL tagsets in Interset are traditionally three values separated by tabs, coming from the CoNLL columns CPOS, POS and FEAT. In the case of Urdu, t...
ZEMAN/Lingua-Interset-3.015 - 05 Mar 2022 11:01:56 UTC
Lingua::Interset::Tagset::HI::Conll - Driver for the Hindi tagset of the shared tasks at ICON 2009, ICON 2010 and COLING 2012, as used in the CoNLL data format.
Interset driver for the Hindi tagset of the shared tasks at ICON 2009, ICON 2010 and COLING 2012, as used in the CoNLL data format. CoNLL tagsets in Interset are traditionally three values separated by tabs, coming from the CoNLL columns CPOS, POS an...
ZEMAN/Lingua-Interset-3.015 - 05 Mar 2022 11:01:56 UTC