The London Perl and Raku Workshop takes place on 26th Oct 2024. If your company depends on Perl, please consider sponsoring and/or attending.

Search results for "distribution:Lingua-Interset Lingua::Features::Tag"

Lingua::Interset - DZ Interset is a universal morphosyntactic feature set to which all tagsets of all corpora/languages can be mapped. River stage one • 1 direct dependent • 5 total dependents

DZ Interset is a universal framework for reading, writing, converting and interpreting part-of-speech and morphosyntactic tags from multiple tagsets of many different natural languages. Individual tagsets are mapped to the Interset using specialized ...

ZEMAN/Lingua-Interset-3.015 - 05 Mar 2022 11:01:56 UTC

Lingua::Interset::Trie - A trie-like structure for DZ Interset features and their values. River stage one • 1 direct dependent • 5 total dependents

The "Trie" class defines a trie-like data structure for DZ Interset features and their values. It is an auxiliary data structure that an outside user should not need to use directly. It is used to describe all feature-value combinations that are perm...

ZEMAN/Lingua-Interset-3.015 - 05 Mar 2022 11:01:56 UTC

Lingua::Interset::Atom - Atomic driver for a surface feature. River stage one • 1 direct dependent • 5 total dependents

Atom is a special case of a tagset driver. As the name suggests, the surface tags are considered atomic, i.e. indivisible. It provides environment for easy mapping between surface strings and Interset features. While Atom can be used to implement dri...

ZEMAN/Lingua-Interset-3.015 - 05 Mar 2022 11:01:56 UTC

Lingua::Interset::Tagset - The root class for all physical tagsets covered by DZ Interset 2.0. River stage one • 1 direct dependent • 5 total dependents

DZ Interset is a universal framework for reading, writing, converting and interpreting part-of-speech and morphosyntactic tags from multiple tagsets of many different natural languages. The "Tagset" class is the inheritance root for all classes descr...

ZEMAN/Lingua-Interset-3.015 - 05 Mar 2022 11:01:56 UTC

Lingua::Interset::FeatureStructure - Definition of morphosyntactic features and their values. River stage one • 1 direct dependent • 5 total dependents

DZ Interset is a universal framework for reading, writing, converting and interpreting part-of-speech and morphosyntactic tags from multiple tagsets of many different natural languages. The "FeatureStructure" class defines all morphosyntactic feature...

ZEMAN/Lingua-Interset-3.015 - 05 Mar 2022 11:01:56 UTC

Lingua::Interset::OldTagsetDriver - A temporary envelope that provides access to the old (Interset 1.0) drivers from Interset 2.0. River stage one • 1 direct dependent • 5 total dependents

Provides object envelope for an old, non-object-oriented driver from Interset 1.0. This makes the old drivers at least partially usable until they are fully ported to Interset 2.0. Note however that the old drivers use Interset features and/or values...

ZEMAN/Lingua-Interset-3.015 - 05 Mar 2022 11:01:56 UTC

Lingua::Interset::Tagset::Multext - Common code for drivers of tagsets of the Multext-EAST project. River stage one • 1 direct dependent • 5 total dependents

Common code for drivers of tagsets of the Multext-EAST project. All the Multext-EAST tagsets use the same inventory of parts of speech and the same inventory of features (but not all features are used in all languages). Feature values are individual ...

ZEMAN/Lingua-Interset-3.015 - 05 Mar 2022 11:01:56 UTC

Lingua::Interset::Tagset::FI::Turku - Driver for the Finnish tagset from the Turku Dependency Treebank. River stage one • 1 direct dependent • 5 total dependents

Interset driver for the Finnish tagset from the Turku Dependency Treebank. Tag is a sequence of features separated by vertical bars. There are just the feature values, not attribute-value pairs....

ZEMAN/Lingua-Interset-3.015 - 05 Mar 2022 11:01:56 UTC

Lingua::Interset::Tagset::BN::Conll - Driver for the Bengali tagset of the ICON 2009 and 2010 Shared Tasks, as used in the CoNLL data format. River stage one • 1 direct dependent • 5 total dependents

Interset driver for the Bengali tagset of the ICON 2009 and 2010 Shared Tasks, as used in the CoNLL data format. CoNLL tagsets in Interset are traditionally three values separated by tabs, coming from the CoNLL columns CPOS, POS and FEAT. ICON shared...

ZEMAN/Lingua-Interset-3.015 - 05 Mar 2022 11:01:56 UTC

Lingua::Interset::Tagset::TE::Conll - Driver for the Telugu tagset of the ICON 2009 and 2010 Shared Tasks, as used in the CoNLL data format. River stage one • 1 direct dependent • 5 total dependents

Interset driver for the Telugu tagset of the ICON 2009 and 2010 Shared Tasks, as used in the CoNLL data format. CoNLL tagsets in Interset are traditionally three values separated by tabs, coming from the CoNLL columns CPOS, POS and FEAT. ICON shared ...

ZEMAN/Lingua-Interset-3.015 - 05 Mar 2022 11:01:56 UTC

Lingua::Interset::Tagset::UR::Conll - Driver for the tagset of the Hyderabad Urdu Treebank, as used in the CoNLL data format. River stage one • 1 direct dependent • 5 total dependents

Interset driver for the tagset of the Urdu treebank from Hyderabad, as used in the CoNLL data format. CoNLL tagsets in Interset are traditionally three values separated by tabs, coming from the CoNLL columns CPOS, POS and FEAT. In the case of Urdu, t...

ZEMAN/Lingua-Interset-3.015 - 05 Mar 2022 11:01:56 UTC

Lingua::Interset::Tagset::HI::Conll - Driver for the Hindi tagset of the shared tasks at ICON 2009, ICON 2010 and COLING 2012, as used in the CoNLL data format. River stage one • 1 direct dependent • 5 total dependents

Interset driver for the Hindi tagset of the shared tasks at ICON 2009, ICON 2010 and COLING 2012, as used in the CoNLL data format. CoNLL tagsets in Interset are traditionally three values separated by tabs, coming from the CoNLL columns CPOS, POS an...

ZEMAN/Lingua-Interset-3.015 - 05 Mar 2022 11:01:56 UTC

Lingua::Interset::Tagset::ET::Puudepank - Driver for the Estonian tagset from the Eesti keele puudepank (Estonian Language Treebank). River stage one • 1 direct dependent • 5 total dependents

Interset driver for the Estonian tagset from the Eesti keele puudepank (Estonian Language Treebank). Tag is the part of speech followed by a slash and the morphosyntactic features, separated by commas....

ZEMAN/Lingua-Interset-3.015 - 05 Mar 2022 11:01:56 UTC
13 results (0.046 seconds)