Anders Ardö > Combine-4.003 > combineRank

Download:
Combine-4.003.tar.gz

Annotate this POD

CPAN RT

Open  0
Report a bug
Source  

NAME ^

combineRank - calculates various Ranks for a Combine crawled database

SYNOPSIS ^

combineRank <action> --jobname <name> --verbose

where action can be one of PageRank, PageRankBL, NetLocRank, and exportLinkGraph. Results on STDOUT.

OPTIONS AND ARGUMENTS ^

jobname is used to find the appropriate configuration (mandatory)

verbose enables printing of ranks to STDOUT as SQL INSERT statements

Actions calculating variants of PageRank

PageRank

calculate standard PageRank

PageRankBL

calculate PageRanks with backlinks added for each link

NetLocRank

calculate SiteRank for each site and a local DocRank for documents within each site. Global ranks are then calulated as SiteRank * DocRank

Actions exporting link data

exportLinkGraph

export linkgraph from Combine database

DESCRIPTION ^

Implements calculation of different variants of PageRank.

Results are written to STDOUT and can be huge for large databases.

Linkgraph is exported in ASCII as a sparse matrix, one row per line. First integer is the ID (urlid) of a page with links. The rest of integers on the line are IDs for pages linked to. Ie 121 5624 23416 51423 267178 means that page 121 links to pages 5624 23416 51423 267178

EXAMPLES ^

combineRank --jobname aatest --verbose PageRankBL

calculate PageRank with backlinks, result on STDOUT

combineRank --jobname aatest --verbose exportLinkGraph

export the linkgraph to STDOUT

SEE ALSO ^

combine

Combine configuration documentation in /usr/share/doc/combine/.

AUTHOR ^

Anders Ardö, <anders.ardo@it.lth.se>

COPYRIGHT AND LICENSE ^

Copyright (C) 2006 Anders Ardö

This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself, either Perl version 5.8.4 or, at your option, any later version of Perl 5 you may have available.

See the file LICENCE included in the distribution at http://combine.it.lth.se/