The Perl Toolchain Summit needs more sponsors. If your company depends on Perl, please support this very important event.

Search results for "Lingua::EN::Splitter"

Lingua::EN::Splitter - Split text into words, paragraphs, segments, and tiles River stage one • 1 direct dependent • 1 total dependent

See synopsis. This module can be used in an object-oriented fashion or the routines can be exported....

SPLICE/Lingua-EN-Segmenter-0.1 - 03 Mar 2005 03:20:54 UTC - Search in distribution

FL3 - A shortcut module for Lingua::FreeLing3. River stage one • 1 direct dependent • 1 total dependent

Implements a set of utility functions to access "Lingua::FreeLing3" objects. Everytime one of the accessors is used just with the language code/language data file (or using the default language), the cached processor is returned if it exists. If any ...

AMBS/Lingua-FreeLing3-0.09 - 12 Jan 2014 16:21:27 UTC - Search in distribution

DBIx::FullTextSearch - Indexing documents with MySQL as storage River stage one • 1 direct dependent • 1 total dependent

DBIx::FullTextSearch is a flexible solution for indexing contents of documents. It uses the MySQL database to store the information about words and documents and provides Perl interface for indexing new documents, making changes and searching for mat...

TJMATHER/DBIx-FullTextSearch-0.73 - 02 Mar 2003 22:46:49 UTC - Search in distribution

Lingua::NameUtils - Identify given/family names and capitalize correctly River stage zero No dependents

This module is useful when receiving a person's name that might be all uppercase, or in the wrong case, or it might have the given names and the family name combined in a single string (e.g., a single spreadsheet column), and you need to split the fu...

RAFORG/Lingua-NameUtils-1.003 - 09 Jul 2023 13:23:37 UTC - Search in distribution

Lingua::Sentence - Perl extension for breaking text paragraphs into sentences River stage one • 5 direct dependents • 5 total dependents

This module allows splitting of text paragraphs into sentences. It is based on scripts developed by Philipp Koehn and Josh Schroeder for processing the Europarl corpus (<http://www.statmt.org/europarl/>). The module uses punctuation and capitalizatio...

CAPOEIRAB/Lingua-Sentence-1.100 - 26 Feb 2017 23:06:04 UTC - Search in distribution

Uplug::PreProcess::SentDetect - Moses/Europarl sentence boundary detector River stage two • 10 direct dependents • 10 total dependents

This module is basically a copy of Lingua::Sentence by Achim Ruopp adapted to Uplug which is based on tools developed for Moses and the Europarl corpus. All credits go to the original authors. This version includes some additional non-breaking prefix...

TIEDEMANN/uplug-main-0.3.8 - 16 Mar 2013 20:19:32 UTC - Search in distribution
6 results (0.041 seconds)