Search results for "Lingua::Alignment"
Alignment.pm
PLAMBERT/Lingua-AlignmentSet-1.1
-
17 Mar 2007 16:55:38 UTC
-
Search in distribution
- Lingua::AlignmentSet - Tools library to manage an Alignment Sets, i.e. a set of sentences aligned at the word (or phrase) level.
- AlSetLib.pm
- AlignmentSlice.pm
- 9 more results from Lingua-AlignmentSet »
lib/Lingua/Align/Features/Alignment.pm
TIEDEMANN/Lingua-Align-0.04
-
10 Dec 2012 18:31:24 UTC
-
Search in distribution
- coocfreq - count co-occurrence frequencies for arbitrary features of nodes in a parallel treebank
- sta2moses - convert from Stockholm Tree Aligner format to Moses/GIZA++ (plain text)
- treealigneval - a script for computing precision and recall scores for tree aligmnent
- 17 more results from Lingua-Align »
Lingua::TT::TextAlignment - TT Utils: alignment raw text lt-gt tokenized text
The RTT "raw + text + tags" format is a "vertical" text format for combined storage of explicit token boundaries together with raw original (untokenized) text. It is a line-based formats with lines of the form: "%%$RTT:COMPACT=" *BOOL* RTT processing...
MOOCOW/Lingua-TT-0.13 - 16 Nov 2021 18:17:55 UTC - Search in distribution
opus2moses - convert aligned plain text files into OPUS XML
"moses2opus" converts a sentence-aligned corpus in plain text format (Moses format or fast_align format) into the OPUS XML-based format. It also does sentence-splitting based on Lingua::Sentence...
TIEDEMANN/OPUS-Tools-0.2.2 - 26 Aug 2020 09:34:28 UTC - Search in distribution
Text::Fuzzy - Partial string matching using edit distances
This module calculates edit distances between words, and searches arrays and files to find the nearest entry by edit distance. It handles both byte strings and character strings (strings containing Unicode), treating each Unicode character as a singl...
BKB/Text-Fuzzy-0.29 - 10 Dec 2020 05:39:33 UTC - Search in distribution
DBIx::Admin::TableInfo - A wrapper for all of table_info(), column_info(), *_key_info()
"DBIx::Admin::TableInfo" is a pure Perl module. It is a convenient wrapper around all of these DBI methods: o table_info() o column_info() o primary_key_info() o foreign_key_info() o MySQL Warning: To get foreign key information in the output, the cr...
RSAVAGE/DBIx-Admin-TableInfo-3.04 - 04 Feb 2021 04:52:42 UTC - Search in distribution
sentences - Command line tool for text segmentation, tokenization and annotation
"sentences" is a command line tool for text segmentation and annotation. It uses the "fsentences" function from "Lingua::PT::PLNbase". Its main behaviour is the detection of sentences and paragraphs, and their annotation with XML-like tags: <s> for s...
AMBS/Lingua-PT-PLNbase-0.29 - 20 Feb 2022 10:10:09 UTC - Search in distribution
Lingua::PTD - Module to handle PTD files in Dumper Format
PTD files in Perl Dumper format are simple hashes references. But they use a specific structure, and this module provides a simple interface to manipulate it. "new" The "new" constructor returns a new Lingua::PTD object. This constructor receives a P...
AMBS/Lingua-PTD-1.17 - 01 Jul 2022 18:58:39 UTC - Search in distribution
Lingua::NATools - A framework for Parallel Corpora processing
This is a collection of functions used on the NATools tools. Some of them can be used independently. Check documentation bellow. "init" Use this function to initialize a parallel corpora repository. You must supply a "directory" where the repository ...
AMBS/Lingua-NATools-v0.7.12 - 09 Nov 2020 21:40:02 UTC - Search in distribution
Lingua::POSAlign - Local alignment tool for POS tags
This modules enable you to make pairwise alignments between any two POS tag sequences. use Lingua::POSAlign; my $n = new Lingua::POSAlign; $n->align([qw(DT MD VB VBN)], [qw(DT NNS VBP VBN)]); $a = $n->alignment; # Return the alignment result $n->tota...
XERN/Lingua-POSAlign-0.01 - 11 Jul 2005 10:56:20 UTC - Search in distribution
Lingua::EO::Orthography - A orthography/substitute converter for Esperanto characters
6 letters in the Esperanto alphabet did not exist in ASCII. Their letters, which have supersigns (eo: supersignoj), are often spelled in substitute notations (eo: surogataj skribosistemoj) for the history, namely, for the ages of typography and typew...
MORIYA/Lingua-EO-Orthography-0.04 - 24 Dec 2013 17:11:43 UTC - Search in distribution
Lingua::FuzzyTrans::PT2GL - Translates Portuguese words to Galician using fuzzy replacements
A simple rule-substitution-based fuzzy translator from Portuguese words to Galician words, as despicted on Dictionary Alignment based on Rewrite-based Entry Translation <http://drops.dagstuhl.de/opus/volltexte/2013/4041/pdf/16.pdf>. translate This is...
AMBS/Lingua-FuzzyTrans-PT2GL-0.001 - 16 Oct 2017 13:16:08 UTC - Search in distribution