The Perl Toolchain Summit needs more sponsors. If your company depends on Perl, please support this very important event.

Search results for "Lingua::Alignment"

Alignment.pm River stage zero No dependents

PLAMBERT/Lingua-AlignmentSet-1.1 - 17 Mar 2007 16:55:38 UTC - Search in distribution

lib/Lingua/Align/Features/Alignment.pm River stage zero No dependents

TIEDEMANN/Lingua-Align-0.04 - 10 Dec 2012 18:31:24 UTC - Search in distribution

Lingua::TT::TextAlignment - TT Utils: alignment raw text lt-gt tokenized text River stage one • 1 direct dependent • 1 total dependent

The RTT "raw + text + tags" format is a "vertical" text format for combined storage of explicit token boundaries together with raw original (untokenized) text. It is a line-based formats with lines of the form: "%%$RTT:COMPACT=" *BOOL* RTT processing...

MOOCOW/Lingua-TT-0.13 - 16 Nov 2021 18:17:55 UTC - Search in distribution

opus2moses - convert aligned plain text files into OPUS XML River stage zero No dependents

"moses2opus" converts a sentence-aligned corpus in plain text format (Moses format or fast_align format) into the OPUS XML-based format. It also does sentence-splitting based on Lingua::Sentence...

TIEDEMANN/OPUS-Tools-0.2.2 - 26 Aug 2020 09:34:28 UTC - Search in distribution

Text::Fuzzy - Partial string matching using edit distances River stage one • 5 direct dependents • 7 total dependents

This module calculates edit distances between words, and searches arrays and files to find the nearest entry by edit distance. It handles both byte strings and character strings (strings containing Unicode), treating each Unicode character as a singl...

BKB/Text-Fuzzy-0.29 - 10 Dec 2020 05:39:33 UTC - Search in distribution

DBIx::Admin::TableInfo - A wrapper for all of table_info(), column_info(), *_key_info() River stage one • 2 direct dependents • 2 total dependents

"DBIx::Admin::TableInfo" is a pure Perl module. It is a convenient wrapper around all of these DBI methods: o table_info() o column_info() o primary_key_info() o foreign_key_info() o MySQL Warning: To get foreign key information in the output, the cr...

RSAVAGE/DBIx-Admin-TableInfo-3.04 - 04 Feb 2021 04:52:42 UTC - Search in distribution

sentences - Command line tool for text segmentation, tokenization and annotation River stage one • 5 direct dependents • 6 total dependents

"sentences" is a command line tool for text segmentation and annotation. It uses the "fsentences" function from "Lingua::PT::PLNbase". Its main behaviour is the detection of sentences and paragraphs, and their annotation with XML-like tags: <s> for s...

AMBS/Lingua-PT-PLNbase-0.29 - 20 Feb 2022 10:10:09 UTC - Search in distribution

Lingua::PTD - Module to handle PTD files in Dumper Format River stage one • 2 direct dependents • 2 total dependents

PTD files in Perl Dumper format are simple hashes references. But they use a specific structure, and this module provides a simple interface to manipulate it. "new" The "new" constructor returns a new Lingua::PTD object. This constructor receives a P...

AMBS/Lingua-PTD-1.17 - 01 Jul 2022 18:58:39 UTC - Search in distribution

Lingua::NATools - A framework for Parallel Corpora processing River stage zero No dependents

This is a collection of functions used on the NATools tools. Some of them can be used independently. Check documentation bellow. "init" Use this function to initialize a parallel corpora repository. You must supply a "directory" where the repository ...

AMBS/Lingua-NATools-v0.7.12 - 09 Nov 2020 21:40:02 UTC - Search in distribution

Lingua::POSAlign - Local alignment tool for POS tags River stage zero No dependents

This modules enable you to make pairwise alignments between any two POS tag sequences. use Lingua::POSAlign; my $n = new Lingua::POSAlign; $n->align([qw(DT MD VB VBN)], [qw(DT NNS VBP VBN)]); $a = $n->alignment; # Return the alignment result $n->tota...

XERN/Lingua-POSAlign-0.01 - 11 Jul 2005 10:56:20 UTC - Search in distribution

Lingua::EO::Orthography - A orthography/substitute converter for Esperanto characters River stage zero No dependents

6 letters in the Esperanto alphabet did not exist in ASCII. Their letters, which have supersigns (eo: supersignoj), are often spelled in substitute notations (eo: surogataj skribosistemoj) for the history, namely, for the ages of typography and typew...

MORIYA/Lingua-EO-Orthography-0.04 - 24 Dec 2013 17:11:43 UTC - Search in distribution

Lingua::FuzzyTrans::PT2GL - Translates Portuguese words to Galician using fuzzy replacements River stage zero No dependents

A simple rule-substitution-based fuzzy translator from Portuguese words to Galician words, as despicted on Dictionary Alignment based on Rewrite-based Entry Translation <http://drops.dagstuhl.de/opus/volltexte/2013/4041/pdf/16.pdf>. translate This is...

AMBS/Lingua-FuzzyTrans-PT2GL-0.001 - 16 Oct 2017 13:16:08 UTC - Search in distribution
12 results (0.033 seconds)