The Perl Toolchain Summit needs more sponsors. If your company depends on Perl, please support this very important event.

NAME

Lingua::Interset::Tagset::HE::Conll - Driver for the Hebrew tagset.

VERSION

version 3.012

SYNOPSIS

  use Lingua::Interset::Tagset::HE::Conll;
  my $driver = Lingua::Interset::Tagset::HE::Conll->new();
  my $fs = $driver->decode("NN\tNN\tM|S");

or

  use Lingua::Interset qw(decode);
  my $fs = decode('he::conll', "NN\tNN\tM|S");

DESCRIPTION

Interset driver for the Hebrew tagset in CoNLL format. CoNLL tagsets in Interset are traditionally three values separated by tabs. The values come from the CoNLL columns CPOS, POS and FEAT.

Tagset described in Yoav Goldberg: Automatic Syntactic Processing of Modern Hebrew Automatic Syntactic Processing of Modern Hebrew (2011), p. 32, http://www.cs.bgu.ac.il/~nlpproj/yoav-phd.pdf

TODO: try to use the official (but not as easy to process) resource: BGU Computational Linguistics Group. Hebrew morphological tagging guidelines. Technical report, Ben Gurion University of the Negev, 2008. http://www.cs.bgu.ac.il/~adlerm/tagging-guideline.pdf

SEE ALSO

Lingua::Interset, Lingua::Interset::Tagset, Lingua::Interset::Tagset::Conll, Lingua::Interset::FeatureStructure

AUTHOR

Rudolf Rosa <rosa@ufal.mff.cuni.cz>

AUTHOR

Dan Zeman <zeman@ufal.mff.cuni.cz>

COPYRIGHT AND LICENSE

This software is copyright (c) 2017 by Univerzita Karlova (Charles University).

This is free software; you can redistribute it and/or modify it under the same terms as the Perl 5 programming language system itself.