The Perl Toolchain Summit needs more sponsors. If your company depends on Perl, please support this very important event.

Search results for "module:Text::Extract::Word"

Text::Extract::Word - Extract text from Word files River stage one • 4 direct dependents • 8 total dependents

This simple module allows the textual contents to be extracted from a Word file. The code was ported from Java code, originally part of the Apache POE project, but extensive code changes were made internally....

SNKWATT/Text-Extract-Word-0.04 - 08 Mar 2012 17:13:44 UTC

Text::TermExtract - Extract terms from text River stage one • 1 direct dependent • 1 total dependent

Text::TermExtract takes a simple approach at extracting the most interesting terms from documents of arbitrary length. There's more scientific methods to term extraction, like Yahoo's online term extraction API (but you can't have it locally) and the...

MSCHILLI/Text-TermExtract-0.02 - 10 Mar 2008 05:14:28 UTC

Text::PDF2XML - extract text from PDF files and wraps it in XML River stage zero No dependents

Extract text from PDF using external tools and some post-processing heuristics. Here is an example with and without post-processing: raw: <p>PRESENTATION ET R A P P E L DES PRINCIPAUX RESULTATS 9</p> clean: <p>PRESENTATION ET RAPPEL DES PRINCIPAUX RE...

TIEDEMANN/Text-PDF2XML-0.3.3 - 11 Feb 2019 14:54:41 UTC

Text::ExtractWords - Perl extension for extract words from strings River stage one • 1 direct dependent • 4 total dependents

The aim of this module is to extract the words from the texts or mails to identify spam. But it can be used for another purpose....

HDIAS/Text-ExtractWords-0.08 - 13 Oct 2003 18:28:27 UTC

Text::Reform - Manual text wrapping and reformatting River stage two • 4 direct dependents • 67 total dependents

The "form" sub The "form()" subroutine may be exported from the module. It takes a series of format (or "picture") strings followed by replacement values, interpolates those values into each picture string, and returns the result. The effect is simil...

CHORNY/Text-Reform-1.20 - 05 Sep 2009 22:26:56 UTC

Text::Affixes - Prefixes and suffixes analysis of text River stage one • 2 direct dependents • 8 total dependents

Provides methods for prefix and suffix analysis of text....

KAPPA/Text-Affixes-0.09 - 20 Oct 2015 10:05:58 UTC

Text::Distill - Quick texts compare, plagiarism and common parts detection River stage zero No dependents

GRIBUSER/Text-Distill-0.5 - 09 Oct 2020 11:13:44 UTC

Text::Ngramize - Computes lists of n-grams from text. River stage zero No dependents

"Text::Ngramize" is used to compute the list of n-grams derived from the bytes, characters, or words of the text provided. Methods are included that provide positional information about the n-grams computed within the text....

KUBINA/Text-Ngramize-1.03 - 07 Mar 2010 04:04:11 UTC

Text::MetaText - Perl extension implementing meta-language for processing "template" text files. River stage zero No dependents

MetaText is a text processing and markup meta-language which can be used for processing "template" files. This module is a Perl 5 extension implementing a MetaText object class which processes text files, interpreting and acting on the embedded MetaT...

ABW/Text-MetaText-0.22 - 01 Sep 1998 14:15:59 UTC

Text::ToSentences - Perl extension for converting pieces of text into individual sentences. River stage zero No dependents

Extract sentences from a given piece of text. It is aware of acronyms and parenthesis (including some mistakes as not closing or not opening ones) EXPORT convert...

AMONTERO/Text-ToSentences-0.91 - 30 Dec 2009 21:22:03 UTC

Data::Table::Text - Write data in tabular text format. River stage two • 35 direct dependents • 35 total dependents

Write data in tabular text format. Version 20240203. The following sections describe the methods in each functional area of this module. For an alphabetic listing of all methods by name see Index....

PRBRENAN/Data-Table-Text-20240408 - 07 Apr 2024 19:30:28 UTC

Text::Util::Chinese - A collection of subroutines for processing Chinese Text River stage zero No dependents

GUGOD/Text-Util-Chinese-0.08 - 27 Jun 2020 15:35:23 UTC

Text::TokenStream::Lexer - reusable lexer for token-stream scanning River stage zero No dependents

A lexer instance is constructed by specifying regexes that match individual parts of the input text. Each regex is associated with a token type that will be used to distinguish the tokens found. The regexes are tried in the order they're given in the...

ARC/Text-TokenStream-0.04 - 09 Mar 2021 09:31:57 UTC

Text::Context::EitherSide - Get n words either side of search keywords River stage one • 1 direct dependent • 2 total dependents

Suppose you have a large piece of text - typically, say, a web page or a mail message. And now suppose you've done some kind of full-text search on that text for a bunch of keywords, and you want to display the context in which you found the keywords...

TMTM/Text-Context-EitherSide-1.4 - 04 May 2009 10:27:42 UTC

Text::NLP::Stanford::EntityExtract - Talks to a stanford-ner socket server to get named entities back River stage zero No dependents

ZARQUON/Text-NLP-Stanford-EntityExtract-0.07 - 04 Aug 2018 08:24:40 UTC

Text::Extract::MaketextCallPhrases - Extract phrases from maketext–call–looking text River stage zero No dependents

Well designed systems use consistent calls for localization. If you're really smart you've also used Locale::Maketext!! You will probably have a collection of data that contains things like this: $locale->maketext( ... ); (perl) [% locale.maketext( ....

DMUEY/Text-Extract-MaketextCallPhrases-0.94 - 17 Feb 2017 04:51:47 UTC
16 results (0.049 seconds)