Search results for "TIEDEMANN"
uplug - the main startup script for the Uplug toolbox
The basic use of this startup script is to load a Uplug module, to parse its configuration and to run it using the command-line arguments give. Uplug modules may consist of complex processing pipelines and loops and Uplug tries to build system calls ...
TIEDEMANN/uplug-main-0.3.8 - 16 Mar 2013 20:19:32 UTC - Search in distribution- Uplug - a toolbox for processing (parallel) text corpora
- uplug-readalign - read sentence alignment in XCES align format
- xces2moses - based on XML::XCES
- 2 more results from uplug-main »
treealign - training tree alignment classifiers and aligning syntactic trees
This script allows you to train a tree alignment model and to apply them to parallel treebanks. Tree alignment is based on local binary classification and rich feature sets. Currently, training data has to be in Stockholm Tree Aligner format. The out...
TIEDEMANN/Lingua-Align-0.04 - 10 Dec 2012 18:31:24 UTC - Search in distribution- coocfreq - count co-occurrence frequencies for arbitrary features of nodes in a parallel treebank
- treealigneval - a script for computing precision and recall scores for tree aligmnent
- doc::index
- 44 more results from Lingua-Align »
iso639 - a simple script to convert language codes
TIEDEMANN/ISO-639-3-0.03
-
26 Aug 2020 09:40:00 UTC
-
Search in distribution
- ISO::639::3 - Language codes and names from ISO::639
pdf2xml - extract text from PDF files and wraps it in XML
pdf2xml tries to combine the output of several conversion tools in order to improve the extraction of text from PDF documents. Currently, it uses pdftotext, Apache Tika and pdfxtk. In the default mode, it calls all tools to extract text and pdfxtk is...
TIEDEMANN/Text-PDF2XML-0.3.3 - 11 Feb 2019 14:54:41 UTC - Search in distribution- Text::PDF2XML - extract text from PDF files and wraps it in XML
opus-read - read sentence alignment in XCES align format
"opus-read" is a simple script to read sentence alignments stored in XCES align format and prints the aligned sentences to STDOUT. It requires monolingual alignments (ascending order, no crossing links) of sentences in linked XML files. Linked XML fi...
TIEDEMANN/OPUS-Tools-0.2.2 - 26 Aug 2020 09:34:28 UTC - Search in distribution- opus2moses
- tmx2opus - convert TMX into OPUS XML
- opus2tmx
- 12 more results from OPUS-Tools »
srt2xml - script for converting SRT-files (subtitles) to tokenized XML
This script detects sentence boundaries and tokenizes the text in given SRT movie subtitle files and creates XML output....
TIEDEMANN/Text-SRT-Align-0.2 - 15 Sep 2018 12:40:54 UTC - Search in distribution- srtalign - align movie subtitles based on time overlaps
- Text::SRT::Align - sentence alignment for movie subtitles based on time overlaps
langgroup - print language groups according to ISO639-5
TIEDEMANN/ISO-639-5-0.05
-
10 Feb 2021 13:49:00 UTC
-
Search in distribution
Uplug::CS - Uplug Language pack for Czech
Note that you need to install the main components of Uplug first. Download the latest version of uplug-main from <https://bitbucket.org/tiedemann/uplug> or from CPAN and install it on your system. The Uplug::CS package includes configuration files fo...
TIEDEMANN/uplug-cs-0.2 - 30 Jan 2013 12:53:19 UTC - Search in distribution
Uplug::DA - Uplug Language pack for Danish
Note that you need to install the main components of Uplug first. Download the latest version of uplug-main from <https://bitbucket.org/tiedemann/uplug> or from CPAN and install it on your system. The Uplug::DA package includes configuration files fo...
TIEDEMANN/uplug-da-0.2 - 30 Jan 2013 12:54:51 UTC - Search in distribution
Uplug::DE - Uplug Language pack for German
Note that you need to install the main components of Uplug first. Download the latest version of uplug-main from <https://bitbucket.org/tiedemann/uplug> or from CPAN and install it on your system. The Uplug::DE package includes configuration files fo...
TIEDEMANN/uplug-de-0.2 - 30 Jan 2013 12:55:17 UTC - Search in distribution
Uplug::EN - Uplug Language pack for English
Note that you need to install the main components of Uplug first. Download the latest version of uplug-main from <https://bitbucket.org/tiedemann/uplug> or from CPAN and install it on your system. The Uplug::EN package includes configuration files fo...
TIEDEMANN/uplug-en-0.2 - 17 Dec 2012 16:20:23 UTC - Search in distribution
Uplug::FR - Uplug Language pack for French
Note that you need to install the main components of Uplug first. Download the latest version of uplug-main from <https://bitbucket.org/tiedemann/uplug> or from CPAN and install it on your system. The Uplug::FR package includes configuration files fo...
TIEDEMANN/uplug-fr-0.2 - 30 Jan 2013 12:55:40 UTC - Search in distribution
Uplug::HU - Uplug Language pack for Hungarian
Note that you need to install the main components of Uplug first. Download the latest version of uplug-main from <https://bitbucket.org/tiedemann/uplug> or from CPAN and install it on your system. The Uplug::HU package includes configuration files fo...
TIEDEMANN/uplug-hu-0.2 - 30 Jan 2013 12:56:02 UTC - Search in distribution
Uplug::RU - Uplug Language pack for Russian
Note that you need to install the main components of Uplug first. Download the latest version of uplug-main from <https://bitbucket.org/tiedemann/uplug> or from CPAN and install it on your system. The Uplug::RU package includes configuration files fo...
TIEDEMANN/uplug-ru-0.2 - 30 Jan 2013 12:57:14 UTC - Search in distribution
Uplug::SL - Uplug Language pack for Slovene
Note that you need to install the main components of Uplug first. Download the latest version of uplug-main from <https://bitbucket.org/tiedemann/uplug> or from CPAN and install it on your system. The Uplug::SL package includes configuration files fo...
TIEDEMANN/uplug-sl-0.2 - 30 Jan 2013 12:57:34 UTC - Search in distribution
Uplug::SV - Uplug Language pack for Swedish
Note that you need to install the main components of Uplug first. Download the latest version of uplug-main from <https://bitbucket.org/tiedemann/uplug> or from CPAN and install it on your system. The Uplug::SV package includes configuration files fo...
TIEDEMANN/uplug-sv-0.2 - 30 Jan 2013 12:57:56 UTC - Search in distribution
Lingua::Identify::Blacklists - Language identification for related languages based on blacklists
This module adds a blacklist classifier to a general purpose language identification tool. Related languages can easily be confused with each other and standard language detection tools do not work very well for distinguishing them. With this module ...
TIEDEMANN/Lingua-Identify-Blacklists-0.04 - 09 Nov 2012 21:19:10 UTC - Search in distribution
Uplug::TreeTagger - Uplug add-on for using treetagger models for POS tagging
Note that you need to install the main components of Uplug first. Download the latest version of uplug-main from <https://bitbucket.org/tiedemann/uplug> or from CPAN and install it on your system. The Uplug::TreeTagger package includes configuration ...
TIEDEMANN/uplug-treetagger-0.3.2 - 08 Jan 2013 20:45:55 UTC - Search in distribution
Acme::CPANAuthors::CPAN::MostScripts - Authors with the most number of scripts on CPAN
This module lists 50 CPAN authors with the most number of scripts on CPAN. This list is produced by querying a local mini CPAN mirror using this command: % lcpan authors-by-script-count | head -n 50 Statistics of the CPAN mirror: +-------------------...
PERLANCAR/Acme-CPANAuthors-CPAN-MostScripts-0.005 - 09 Dec 2021 00:06:12 UTC - Search in distribution