pagerank
play

PageRank Model of internet: Users click random link on a page. - PowerPoint PPT Presentation

MathematicsforComputerScience GoogleRankings MIT 6.042J/18.062J Which webpages are more important? PageRank Model of internet: Users click random link on a page. (byGooglefounder Occasionally start over. LarryPage) A


  1. Mathematics for Computer Science Google Rankings MIT 6.042J/18.062J Which webpages are “more important?” PageRank Model of internet: • Users click random link on a page. (by Google founder • Occasionally start over. Larry Page) A page is “more important” if viewed a large fraction of time page­rank.1 page­rank.2 Albert R Meyer, May 13, 2015 Albert R Meyer, May 13, 2015 Random Walk on the Web Random Walk on the Web To model starting over: View the entire web as digraph * add a “super­node” to the graph • vertices are webpages * an edge from super­node to each • edge (V,W) exists if link from other node page V to page W * edges from each other node back • edges out of V equally likely to super­node Pr[(V,W)] = 1/outdeg(V) ­may get customized probabilities page­rank.3 page­rank.4 Albert R Meyer, May 13, 2015 Albert R Meyer, May 13, 2015 1

  2. Super-node PageRank Compute super s stationary distribution ­­ ½ ½ H T PageRank(V) :: = s V ­H ½ ­­ ­T T ½ H T T T H Rank V above W when HH HT ­­ T TH ­­ TT T T H T H T s V > s W H H T H win H T lose page­rank.5 page­rank.6 Albert R Meyer, May 13, 2015 Albert R Meyer, May 13, 2015 Importance of Super-node Resistance to scamming ensures * Creating fake nodes s * unique stable distribution pointing to self p * every initial distribution s converges to * Adding links to important ⋅ Mt = s lim t →∞ p nodes * convergence is rapid: won’t improve PageRank s t is small so easy to compute page­rank.7 page­rank.8 Albert R Meyer, May 13, 2015 Albert R Meyer, May 13, 2015 2

  3. Actual Google Rank Google rank rules are a closely held trade secret using text, location, payment, and other criteria that have evolved for 15 years. But PageRank continues to play a significant role. page­rank.9 Albert R Meyer, May 13, 2015 3

  4. MIT OpenCourseWare http s ://ocw.mit.edu 6.042J / 18.062J Mathematics for Computer Science Spring 20 15 For information about citing these materials or our Terms of Use, visit: http s ://ocw.mit.edu/terms.

Recommend


More recommend