The Perl Toolchain Summit needs more sponsors. If your company depends on Perl, please support this very important event.
$Id: README 258 2008-09-03 08:29:42Z anders $

 Combine is highly configurable web focused crawler completely written in Perl

Please see documentation (DocMain.pdf) in the doc
subdirectory or go directly to the Web-site
http://combine.it.lth.se/


The following command sequence will install Combine:
  (you might have run 'make test' and 'make install' as root)

perl Makefile.PL
make
make test
make install
mkdir /etc/combine
cp conf/* /etc/combine/
mkdir /var/run/combine


INSTALLATION TEST
Harvest 1 URL by doing:

sudo combineINIT --jobname aatest --topic /etc/combine/Topic_carnivor.txt 
combine --jobname aatest --harvest http://combine.it.lth.se/CombineTests/InstallationTest.html
combineExport --jobname aatest --profile dc

and verify that the output, except for dates and order, looks like

<?xml version="1.0" encoding="UTF-8"?>
<documentCollection xmlns="http://alvis.info/enriched/" version="1.1">
  <dc:identifier>http://combine.it.lth.se/CombineTests/InstallationTest.html</dc:identifier>
  <dc:date>2006-05-19 11:57:03</dc:date>
  <dc:type>text/html</dc:type>
  <dc:title>Installation test for Combine</dc:title>
  <dc:format>text/html</dc:format>
  <dc:format>text/html; charset=iso-8859-1</dc:format>
  <dc:subject>Carnivorous plants, Drosera, Nepenthes</dc:subject>

</documentCollection>