Text::Perfide::BookSync - Synchronize books in plain text format.
Text::Perfide::BookSync performs a structural alignment at section level of books in plain text format. The books have to be previously annotated by Text::Perfide::BookCleaner.
Generates an HTML file containing a matrix showing the matches between sections of two books.
Given two files FILEL and FILER, creates new versions of these files (FILEL.sync and FILER.sync) with synchronization tags <sync id="x"> marking the points where the texts synchronize.
Given two files FILEL and FILER, splits them by their synchronization points, storing each chunk in a file, where each FILEL.cXX matches FILER.cXX.
Calculates chunks for a given pair of files. A chunk is a set of consecutive sections, which are grouped in order to match the corresponding chunk.
From a given file in which sections have been delimited with Text::Perfide::BookCleaner, creates and returns a list containing information about the sections of this file: id, start offset and end offset.
Calculates metrics on each pair of sections (length in words, ...)
Calculates metrics on each pair of chunks (length in words, ...)
<andrefs at cpan.org> Jose Joao Almeida,
<jj at di.uminho.pt>
Please report any bugs or feature requests to
bug-text-perfide-booksync at rt.cpan.org,
or through the web interface at http://rt.cpan.org/NoAuth/ReportBug.html?Queue=Text-Perfide-BookSync.
I will be notified,
and then you'll automatically be notified of progress on your bug as I make changes.
Copyright 2011 Project Natura.
This program is free software; you can redistribute it and/or modify it under the terms of either: the GNU General Public License as published by the Free Software Foundation; or the Artistic License.
See http://dev.perl.org/licenses/ for more information.