combineExport - export records in XML from Combine database
combineExport --jobname <name> [--profile alvis|dc|combine --charset utf8|isolatin --number <n> --recordid <n> --md5 <MD5> --incremental --xsltscript ...]
jobname is used to find the appropriate configuration (mandatory)
Three profiles: alvis, dc, and combine . alvis and combine are similar XML formats.
'alvis' profile format is defined by the Alvis enriched document format DTD. It uses charset UTF-8 per default.
'combine' is more compact with less redundancy.
'dc' is XML encoded Dublin Core data.
Selects a specific characterset from UTF-8, iso-latin-1 Overrides --profile settings.
Skip inlinks with duplicate anchor-texts (ie just one inlink per unique anchor-text).
Do not include any outlinks in the exported records.
ZebraIndex sends XML records directly to the Zebra server defined in Combine configuration variable 'ZebraHost'. It uses the default Zebra configuration: profile=combine, nooutlinks, collapseinlinks and is compatible with the direct Zebra indexing done during harvesting when 'ZebraHost' is defined in the Combine configuration. Requires that the Zebra server is running.
SolrIndex sends XML records directly to the Solr server defined in Combine configuration variable 'SolrHost'. It uses the default Solr configuration: profile=combine, nooutlinks, collapseinlinks and is compatible with the direct Solr indexing done during harvesting when 'SolrHost' is defined in the Combine configuration. Requires that the Solr server is running.
Generates records in Combine native format and converts them using this XSLT script before output. See example scripts in /etc/combine/*.xsl
the max number of records to be exported
Export just the one record with this recordid
Export just the one record with this MD5 checksum
Specifies the server-name and port to connect to and export data using the Alvis Pipeline. Exports incrementally, ie all changes since last call to combineExport with the same pipehost and pipeport.
Exports incrementally, ie all changes since last call to combineExport using --incremental
Export all records in Alvis XML-format to the file recs.xml combineExport --jobname atest > recs.xml Export 10 records to STDOUT combineExport --jobname atest --number 10 Export all records in UTF-8 using Combine native format combineExport --jobname atest --profile combine --charset utf8 > Zebrarecs.xml Incremental export of all changes from last call using localhost at port 6234 using the default profile (Alvis) combineExport --jobname atest --pipehost localhost --pipeport 6234
Combine configuration documentation in /usr/share/doc/combine/.
Alvis XML schema (--profile alvis) at http://project.alvis.info/alvis_docs/enriched-document.xsd
Anders Ardö, <anders.ardo@it.lth.se>
Copyright (C) 2005 - 2006 Anders Ardö
This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself, either Perl version 5.8.4 or, at your option, any later version of Perl 5 you may have available.
See the file LICENCE included in the distribution at L<http://combine.it.lth.se/>
1 POD Error
The following errors were encountered while parsing the POD:
Non-ASCII character seen before =encoding in 'Ardö,'. Assuming CP1252
To install Combine::UA, copy and paste the appropriate command in to your terminal.
cpanm
cpanm Combine::UA
CPAN shell
perl -MCPAN -e shell install Combine::UA
For more information on module installation, please visit the detailed CPAN module installation guide.