The Perl Toolchain Summit needs more sponsors. If your company depends on Perl, please support this very important event.

NAME

wikicrawl - crawl Wikipedia to generate graph from the found article links

SYNOPSIS

Crawl wikipedia and create a Graph::Easy text describing the inter-article links that were found during the crawl.

At least one argument must be given to start:

        perl examples/wikicrawl.pl --lang=fr

ARGUMENTS

Here are the options:

--help

Print the full documentation, not just this short overview.

--version

Write version info and exit.

--language

Select the language of Wikipedia that we should crawl. Currently supported are 'de', 'en' and 'fr'. Default is 'en'.

--root

Set the root node where the crawl should start. Default is of course 'Xkcd'.

--maxdepth

The maximum depth the crawl should go. Please select small values under 10. Default is 4.

--maxspread

The maximum number of links we follow per article. Please select small values under 10. Default is 5.

--maxnodes

The maximum number of nodes we crawl. Set to -1 (default) to disable.

SEE ALSO

http://forums.xkcd.com/viewtopic.php?f=2&t=21300&p=672184 and Graph::Easy.

LICENSE

This library is free software; you can redistribute it and/or modify it under the terms of the GPL.

See the LICENSE file of Graph::Easy for a copy of the GPL.

AUTHOR

Copyright (C) 2008 by integral forum.xkcd.com Copyright (C) 2008 by Tels http://bloodgate.com