The Perl Toolchain Summit needs more sponsors. If your company depends on Perl, please support this very important event.

Changes for version 0.3.0 - 2012-01-15

  • New features:
    • [LUCY-173] - Bundle Lemon parser generator
    • [LUCY-189] - Bundle utf8proc
    • [LUCY-191] - Unicode normalization The new analyzer Lucy::Analysis::Normalizer provides Unicode normalization, case folding and stripping of accents.
    • [LUCY-196] - UAX #29 tokenizer The new analyzer Lucy::Analysis::StandardTokenizer tokenizes according to the UAX #29 word break rules.
    • [LUCY-203] - Implement EasyAnalyzer EasyAnalyzer is simple analysis chain using the StandardTokenizer, the Normalizer, and the SnowballStemmer.
  • Bugfixes:
    • [LUCY-175] - Missing stdarg.h breaks build
    • [LUCY-176] - More INCREF/DECREF symbol collisions under Windows
    • [LUCY-178] - Discriminate between stdio and POSIX in large file support probing.
    • [LUCY-180] - ORQuery, ANDQuery, RequiredOptionalQuery optimizations affect scoring
    • [LUCY-181] - Perl Documentation fix - Remove links to classes within docs, that are not exposed to Perl
    • [LUCY-182] - highlighter bug when searching for duplicate terms [wordX wordX]
    • [LUCY-183] - Eliminate spurious "extra" query normalization
    • [LUCY-185] - Improve error handling when required params not supplied
    • [LUCY-186] - Terminate connection from SearchClient properly
    • [LUCY-187] - Adapt to GC changes in Perl 5.15
    • [LUCY-188] - Highlighter should accept weighted Queries (Compilers)
    • [LUCY-193] - Occasional schema file name collision under 'truncate'
    • [LUCY-195] - Revisit pthreads linking on OpenBSD.
    • [LUCY-206] - Work around broken export in Perl 5.15.6
  • Improvements:
    • [LUCY-142] - Port Clownfish compiler to C
    • [LUCY-143] - Convert Clownfish::Parser to Lemon
    • [LUCY-179] - Tighten UTF-8 validity checks.
    • [LUCY-197] - Clean whitespace for 0.3.0
    • [LUCY-204] - Process ClusterSearcher RPCs in parallel
    • [LUCY-205] - Parallel processing for SearchServer This introduces an incompatible API change: The port argument has moved from the constructor to the serve method and the password argument has been removed.
  • Tasks:
    • [LUCY-133] - Eliminate JSON::XS dependency
    • [LUCY-134] - Eliminate Parse::RecDescent dependency
    • [LUCY-137] - Dependency licensing in grant code

Documentation

A small OO language that forms symbiotic relationships with "host" languages.
Generate word break table and tests
Auto-format Lucy code.
Lucy smoke test script
update Lucy version strings in source files
Apache Lucy search engine library.
Tokenize/modify/filter text.
Normalize case, facilitating case-insensitive search.
A simple analyzer chain
Unicode normalization, case folding and accent stripping
Multiple Analyzers in series.
Split a string into tokens.
Reduce related words to a shared root.
Suppress a "stoplist" of common words.
Split a string into tokens.
Apache Lucy recipes.
Sample subclass of Query.
Sample subclass of QueryParser.
Near real-time index updates.
Quick-start guide to hacking on Apache Lucy.
Characteristics of Apache Lucy document ids.
Overview of index file format.
Manage indexes on shared volumes.
Crash course in information retrieval.
Step-by-step introduction to Apache Lucy.
How to choose and use Analyzers.
A more flexible app structure.
Specify per-field properties and behaviors.
Augment search results with highlighted excerpts.
Use Query objects instead of query strings.
Bare-bones search app.
A document.
A document read from an index.
Create and highlight excerpts.
Consolidate index segments in the background.
Abstract base class for reading index data.
Write data to an index.
Abstract base class for marking documents as deleted.
Retrieve stored documents.
Policies governing index updating, locking, and file deletion.
Read from an inverted index.
Build inverted indexes.
Iterator for a field's terms.
Read Lexicon data.
Multi-segment implementation of IndexReader.
Term-Document pairings.
Read postings data.
Single-segment IndexReader.
Write one segment of an index.
Warehouse for information about one segment of an inverted index.
Judge how well a document matches a query.
Point-in-time index file list.
An array of bits.
Exception.
Base class for all Lucy objects.
Configure major components of an index.
Default behaviors for binary fields.
Define a field's behavior.
Full-text search field type.
User-created specification for an inverted index.
Non-tokenized text type.
Intersect multiple result sets.
Process hits.
Collector which records doc nums in a BitVector.
Query-to-Matcher compiler.
Access search results.
Execute searches against a single index.
Leaf node in a tree created by QueryParser.
Query which matches all documents.
Match a set of document ids.
Invert the result set of another Query.
Query which matches no documents.
Union multiple result sets.
Query matching an ordered list of terms.
Base class for composite Query objects.
Aggregate results from multiple Searchers.
A specification for a search query.
Transform a string into a Query object.
Match a range of values.
Join results for two Queries, one required, one optional.
Base class for searching collections of documents.
Element of a SortSpec.
Specify a custom sort order for search results.
An offset, a length, and a weight.
Query which matches individual terms.
File System implementation of Folder.
Abstract class representing a directory.
Abstract class representing an interprocess mutex lock.
Lock exception.
Create Locks.
In-memory Folder implementation.
Query matching an ordered list of terms.

Modules

Clownfish compiler.
Perl bindings for a Clownfish::CFC::Hierarchy.
Generate Perl binding code for a Clownfish::CFC::Class.
Binding for an object method.
Binding for an object method.
Abstract base binding for a Clownfish::CFC::Function.
Convert between Clownfish and Perl via XS.
Create compressed Unicode tables for C programs
Basic search engine.
Read a Doc as a fixed-width byte array.
Write a Doc as a fixed-width byte array.
Similarity optimized for long fields.
Compressed doc storage.
Compressed doc storage.
Search multiple remote indexes.
Connect to a remote SearchServer.
Make a Searcher remotely accessible.
Build a caching filter based on results of a Query.
Matcher with arbitrary docs and scores.
Simple query parser, with no boolean operators.

Provides

in lib/LucyX/Search/Filter.pm
in lib/LucyX/Search/Filter.pm

Examples