The Perl Toolchain Summit needs more sponsors. If your company depends on Perl, please support this very important event.

NAME

combineExport - export records in XML from Combine database

SYNOPSIS

combineExport --jobname <name> [--profile alvis|dc|combine --charset utf8|isolatin --number <n> --recordid <n> --md5 <MD5> --incremental --xsltscript ...]

OPTIONS AND ARGUMENTS

jobname is used to find the appropriate configuration (mandatory)

--profile

Three profiles: alvis, dc, and combine . alvis and combine are similar XML formats.

'alvis' profile format is defined by the Alvis enriched document format DTD. It uses charset UTF-8 per default.

'combine' is more compact with less redundancy.

'dc' is XML encoded Dublin Core data.

--charset

Selects a specific characterset from UTF-8, iso-latin-1 Overrides --profile settings.

Skip inlinks with duplicate anchor-texts (ie just one inlink per unique anchor-text).

Do not include any outlinks in the exported records.

--ZebraIndex

ZebraIndex sends XML records directly to the Zebra server defined in Combine configuration variable 'ZebraHost'. It uses the default Zebra configuration: profile=combine, nooutlinks, collapseinlinks and is compatible with the direct Zebra indexing done during harvesting when 'ZebraHost' is defined in the Combine configuration. Requires that the Zebra server is running.

--SolrIndex

SolrIndex sends XML records directly to the Solr server defined in Combine configuration variable 'SolrHost'. It uses the default Solr configuration: profile=combine, nooutlinks, collapseinlinks and is compatible with the direct Solr indexing done during harvesting when 'SolrHost' is defined in the Combine configuration. Requires that the Solr server is running.

--xsltscript

Generates records in Combine native format and converts them using this XSLT script before output. See example scripts in /etc/combine/*.xsl

--number

the max number of records to be exported

--recordid

Export just the one record with this recordid

--md5

Export just the one record with this MD5 checksum

--pipehost, --pipeport

Specifies the server-name and port to connect to and export data using the Alvis Pipeline. Exports incrementally, ie all changes since last call to combineExport with the same pipehost and pipeport.

--incremental

Exports incrementally, ie all changes since last call to combineExport using --incremental

DESCRIPTION

EXAMPLES

 Export all records in Alvis XML-format to the file recs.xml
   combineExport --jobname atest > recs.xml

 Export 10 records to STDOUT
   combineExport --jobname atest --number 10

 Export all records in UTF-8 using Combine native format
   combineExport --jobname atest --profile combine --charset utf8 > Zebrarecs.xml

 Incremental export of all changes from last call using localhost at port 6234 using the
 default profile (Alvis)
   combineExport --jobname atest --pipehost localhost --pipeport 6234

SEE ALSO

Combine configuration documentation in /usr/share/doc/combine/.

Alvis XML schema (--profile alvis) at http://project.alvis.info/alvis_docs/enriched-document.xsd

AUTHOR

Anders Ardö, <anders.ardo@it.lth.se>

COPYRIGHT AND LICENSE

Copyright (C) 2005 - 2006 Anders Ardö

This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself, either Perl version 5.8.4 or, at your option, any later version of Perl 5 you may have available.

 See the file LICENCE included in the distribution at
 L<http://combine.it.lth.se/>

1 POD Error

The following errors were encountered while parsing the POD:

Around line 366:

Non-ASCII character seen before =encoding in 'Ardö,'. Assuming CP1252