Monday, November 29, 2010

PageRank Algorithm Using Mapreduce

Description  -  http://en.wikipedia.org/wiki/PageRank


Examples of pagerank calculation - http://pagerank.suchmaschinen-doktor.de/index/examples.html

Formula for pagerank calculation:

PR(A) = (1 - d) + d * SUM ((PR(I->A)/C(I))
Where:
  • PR(A) is the PageRank of your page A.
  • d is the damping factor, usually set to 0,85.
  • PR(I->A) is the PageRank of page I containing a link to page A.
  • C(I) is the number of links off page I.
  • PR(I->A)/C(I) is a PR-value page A receives from page I.
  • SUM (PR(I->A)/C(I)) is the sum of all PR-values page A receives from pages with links to page A..



 Inputs and Outputs of Mapper and Reducer - https://wiki.umiacs.umd.edu/ccc/images/e/ea/CLuE-Jagannathan.pdf 







Video lecture


1 comment:

  1. I can set up my new idea from this post. It gives in depth information. Thanks for this valuable information for all,..
    check pagerank

    ReplyDelete