The Perl Toolchain Summit needs more sponsors. If your company depends on Perl, please support this very important event.

Changes for version 0.4

  • nlp_port are now declared as global.
  • Correction in the TermTagging : language switch was well taken into account
  • Correction in the management of the ".proc_id" file
  • correction in the computing of the xml rendering time (the variable is set to zero ;-)
  • stderr when NLP tools are called, is redirected in a log file
  • addition of a variable DEBUG defining a debug mode (temporary files are not removed)
  • alvis-nlp-standalone can read a file given in argument or on the STDIN stream
  • Documentation of the modules and scripts are gathered at the end of each file
  • Addition of DTD and XSD files in the documentation (etc directory)
  • Additional functionality: Loading files in various formats (PDF, LaTeX, Word, etc.) before carrying out linguistic annotations.
  • Addition of the modules Alvis::NLPPlatform::Convert and Alvis::NLPPlatform::Document for converting files in various formats in ALVIS XML.
  • Definition of the ogmios-standalone, ogmios-nlp-server, ogmios-nlp-client: annotation scripts from various formats
  • Improvement in the sentence segmentation: taking into account sectioning (!)
  • Addition of a Build.PL file
  • Enable to load empty markups
  • best management of UTF8 (use of Encode module)
  • various fixes and optimization
  • Yatea wrapper: new variable to get an yatea XML output or not
  • Yatea warpper: addition of the output of yatea in the XML output for the platform.
  • bug fixes
  • Rewrite of the TreeTagger wraper always by using
  • hash_words_punct but less complexe
  • Modification in the Wrapper of bioLG : options are set in the XML form
  • integration of the cleanning of the output of bioLg in the code.
  • Corrections in the LGbio wrapper
  • Output Data can be stored in a descriptor or a scalar.
  • Addition of the constituents in the BioLG wrapper (UserNLPWrapper.pm)
  • Addition of examples
  • Best management of the options (if they are not set)
  • - Bad hack for the quick integration of the semantic tagging
  • (tool SemanticTypeTagger) ** COMMENTED CODE **

Documentation

Perl script for the client of the Alvis NLP Platform running in the distributed mode.
Perl script for the server of the Alvis NLP Platform.
Perl script for linguistically annotating a corpus contained in a file
Perl script for the client of the Alvis NLP Platform running in the distributed mode.
Perl script for the server of the Ogmios NLP Platform.
Perl script for linguistically annotating files given in argument and in various format (PDF, Word, etc.).

Modules

Perl extension for linguistically annotating XML documents in Alvis
Perl extension for managing XML annotation of documents in the Alvis format
Perl extension for cleaning XML annotation of the canonical part of documents given the Alvis format.
Perl extension for converting files in any format into the ALVIS XML.
Perl extension for handling (getting and adding) information into a ALVIS XML file or data.
Internal Perl extension for analysing XML documents in the Alvis format
Perl extension for the wrappers used for linguistically annotating XML documents in Alvis
User interface for customizing the NLP wrappers used for linguistically annotating of XML documents in Alvis
User interface for customizing the NLP wrappers used to linguistically annotating of XML documents in Alvis
Perl extension for managing characters which can not be used in a XML document