A Random Walk Around The Block Johan Ugander Stanford University - PowerPoint PPT Presentation

A Random Walk Around The Block Johan Ugander Stanford University Joint work with: Isabel Kloumann (Facebook)   & Jon Kleinberg (Cornell) Google Mountain View August 17, 2016

S e e d s e t e x p a n s i o n • Given a graph G=(V, E), goal is to accurately identify   a target set T ⊂ V from a smaller seed set S ⊂ T . ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● target set T ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●

S e e d s e t e x p a n s i o n • Given a graph G=(V, E), goal is to accurately identify   a target set T ⊂ V from a smaller seed set S ⊂ T . ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● target set T ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● seed set S ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●

S e e d s e t e x p a n s i o n • Given a graph G=(V, E), goal is to accurately identify   a target set T ⊂ V from a smaller seed set S ⊂ T . ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● seed set S ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●

S e e d s e t e x p a n s i o n • Given a graph G=(V, E), goal is to accurately identify   a target set T ⊂ V from a smaller seed set S ⊂ T . ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● seed set S ● ● ● ● ● ● ● ● ● ● ● ● Scored by   ● ● ● Personalized PageRank ● ●

S e e d s e t e x p a n s i o n • Given a graph G=(V, E), goal is to accurately identify   a target set T ⊂ V from a smaller seed set S ⊂ T . • Applications: Broadly: ranking on graphs, recommendation systems • Spam filtering (Wu & Chellapilla ’07) • Community detection (Weber et al. ’13) • Missing data inference (Mislove et al. ’14) • ● • Common methods: ● ● ● ● Semi-supervised learning (Zhu et al. ’03) • ● ● ● ● ● Diffusion-based classification   • ● ● ● ● ● ● ● ● ● ● (Jeh & Widom ’03, Kloster & Gleich ’14) ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● Outwardness, modularity and more   ● ● ● • ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● (Bagrow ’08, Kloumann & Kleinberg ’14) ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●

S e e d s e t e x p a n s i o n • Given a graph G=(V, E), goal is to accurately identify   a target set T ⊂ V from a smaller seed set S ⊂ T . • Applications: Broadly: ranking on graphs, recommendation systems • Spam filtering (Wu & Chellapilla ’07) • Community detection (Weber et al. ’13) • Missing data inference (Mislove et al. ’14) • ● • Common methods: ● ● ● ● Semi-supervised learning (Zhu et al. ’03) • ● ● ● ● ● Diffusion-based classification   • ● ● ● ● ● ● ● ● ● ● (Jeh & Widom ’03, Kloster & Gleich ’14) ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● Outwardness, modularity and more   ● ● ● • ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● (Bagrow ’08, Kloumann & Kleinberg ’14 ) ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●

R e c a l l c u r v e s f o r s e e d s e t e x p a n s i o n Kloumann & Kleinberg ‘14 • Recall curve: true positive rate, as a function of the number   of items returned based on small uniformly random seed set. • Kloumann & Kleinberg ’14 tested many different methods   on data, broadly found Personalized PageRank to be best.

R e c a l l c u r v e s f o r s e e d s e t e x p a n s i o n Kloumann & Kleinberg ‘14 • Recall curve: true positive rate, as a function of the number   of items returned based on small uniformly random seed set. • Kloumann & Kleinberg ’14 tested many different methods   on data, broadly found Personalized PageRank to be best. • Truncated PPR (first K steps) comparable to PPR from K=4. • Heat Kernel later found comparable to PPR.

D i f f u s i o n - b a s e d n o d e c l a s s i fi c a t i o n • Classification based on random walk landing probabilities r v • , probability that a random walk starting in S is at v after k steps. k ( r v 1 , r v 2 , ..., r v • , truncated vector of landing probabilities. K ) • Personalized PageRank and Heat Kernel ranking: ∞ ∞ ✓ t k ◆ X X ( α k ) r v r v PPR( v ) ∝ HK( v ) ∝ k k k ! k =1 k =1 ● ● ● • General diffusion score function: ● ● ● ● ● ● ● ● ● ● ∞ ● ● ● ● ● ● ● X w k r v ● ● ● ● ● ● ● score( v ) = ● ● ● ● ● ● ● ● ● ● ● ● ● ● k ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● k =1 ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●

D i f f u s i o n - b a s e d n o d e c l a s s i fi c a t i o n K • Personalized PageRank and Heat Kernel   X w k r v score( v ) = = two parametric families of linear weights k k =1 0 10 α =0.99 Weight w k = α k PPR − 5 10 w k = t k /k ! HK α =0.85 t=1 t=5 t=15 0 20 40 60 80 100 Length (Kloster & Gleich, ’14) • Question in this work:   What weights are “optimal” for diffusion-based classification?

T h e s t o c h a s t i c b l o c k m o d e l • C blocks p in p out • Focus on C=2 blocks: 1=“Target”, 2=“Other” • n 1 , n 2 nodes in blocks p out p in • Independent edge probabilities: ● ● ● ● ● ● ● ● ● ● • Edge probability within a block = p in ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● • Edge probability across blocks = p out ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● • (Results for C>2 as well, see paper) ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● • Model with many names: • Stochastic Block Model (Holland et al. ’83) • Affiliation Model (Frank-Harary ’82) • Planted Partition Model (Dyer-Frieze ’89)

A Random Walk Around The Block Johan Ugander Stanford University - PowerPoint PPT Presentation

A Random Walk Around The Block Johan Ugander Stanford University Joint work with: Isabel Kloumann (Facebook) & Jon Kleinberg (Cornell) Google Mountain View August 17, 2016 S e e d s e t e x p a n s i o n Given a

The Winter Walk at Wisley The Winter Walk at Wisley The Winter Walk at Wisley The Winter Walk at

Mixing time for a random walk on a ring Stephen Connor Joint work with Michael Bate Aspects of

Back to Random Walks on Graphs Random walk on a graph: Stationary distribution: Back to Random

Short Walks in Higher Dimensions Ghislain McKay Febuary 3, 2015 What is a Random Walk? A path

Advanced Algorithms (XII) Shanghai Jiao Tong University Chihao Zhang May 25, 2020 Random Walk

Critical density for Activated Random Walk Lorenzo Taggi Max Planck Institute for Mathematics in

Random Numbers RANDOM VS PSEUDO RANDOM Truly Random numbers From Wolfram: A random number

Random walk on the torus Jean-Baptiste Boyer (IMB / ModalX) May 16, 2016 Jean-Baptiste Boyer

Random Walks Will Perkins February 5, 2013 Simple Random Walk S 0 = 0, S n = X 1 + X 2 + . . . X

Southeast Cooler Corporation Southeast Cooler Corporation Walk Walk- -In Cooler In Cooler

Turn Right Walk forward 100 pixels Start Here Walk Forward Turn Left and 100 pixels walk

Onelight.com Training Series Connecting the Pyramids and the Crystal Cities the ISIS Walk 2 The

Problem 1 k zero bits n bits IV Block Block Block Block Cipher Cipher Cipher Cipher

Clustering Random Walk Time Series GSI 2015 - Geometric Science of Information Gautier Marti,

Introducing EF Block TM Introduction to EF Block Building Materials Overview of EF

CRYSTAL CITY BLOCK PLAN # CCBP- J-K 2019 1 BLOCK J-K Long Range Planning Committee Block

Automated determination of isoptics with dynamic geometry Thierry Dana-Picard 1 acs 2 Zolt an

Symmetry Breaking in Quantum Curves & Super Chern-Simons Matrix Models Sanefumi Moriyama

COMBINATORICS OF MODULI SPACES OF CURVES LUCIA CAPORASO- UNIVERSIT` A ROMA TRE DOBBIACO WINTER

Scalar decomposition on elliptic curves GLV, GLS, and beyond Benjamin Smith Laboratoire

Forms of elliptic curves Wouter Castryck Forms of elliptic curves First definitions Well-known

Hardware-Software Co-Design for Security: ECC Processor Example Arnaud Tisserand CNRS, Lab-STICC

Introductory Chemical Engineering Thermodynamics By J.R. Elliott and C.T. Lira Chapter 6 -

GAIMD Congestion Control Y. Richard Yang and Simon S. Lam, General AIMD Congestion Control

A Random Walk Around The Block Johan Ugander Stanford University - PowerPoint PPT Presentation

A Random Walk Around The Block Johan Ugander Stanford University Joint work with: Isabel Kloumann (Facebook) & Jon Kleinberg (Cornell) Google Mountain View August 17, 2016 S e e d s e t e x p a n s i o n Given a

The Winter Walk at Wisley The Winter Walk at Wisley The Winter Walk at Wisley The Winter Walk at

Mixing time for a random walk on a ring Stephen Connor Joint work with Michael Bate Aspects of

Back to Random Walks on Graphs Random walk on a graph: Stationary distribution: Back to Random

Short Walks in Higher Dimensions Ghislain McKay Febuary 3, 2015 What is a Random Walk? A path

Advanced Algorithms (XII) Shanghai Jiao Tong University Chihao Zhang May 25, 2020 Random Walk

Critical density for Activated Random Walk Lorenzo Taggi Max Planck Institute for Mathematics in

Random Numbers RANDOM VS PSEUDO RANDOM Truly Random numbers From Wolfram: A random number

Random walk on the torus Jean-Baptiste Boyer (IMB / ModalX) May 16, 2016 Jean-Baptiste Boyer

Random Walks Will Perkins February 5, 2013 Simple Random Walk S 0 = 0, S n = X 1 + X 2 + . . . X

Southeast Cooler Corporation Southeast Cooler Corporation Walk Walk- -In Cooler In Cooler

Turn Right Walk forward 100 pixels Start Here Walk Forward Turn Left and 100 pixels walk

Onelight.com Training Series Connecting the Pyramids and the Crystal Cities the ISIS Walk 2 The

Problem 1 k zero bits n bits IV Block Block Block Block Cipher Cipher Cipher Cipher

Clustering Random Walk Time Series GSI 2015 - Geometric Science of Information Gautier Marti,

Introducing EF Block TM Introduction to EF Block Building Materials Overview of EF

CRYSTAL CITY BLOCK PLAN # CCBP- J-K 2019 1 BLOCK J-K Long Range Planning Committee Block

Automated determination of isoptics with dynamic geometry Thierry Dana-Picard 1 acs 2 Zolt an

Symmetry Breaking in Quantum Curves &amp; Super Chern-Simons Matrix Models Sanefumi Moriyama

COMBINATORICS OF MODULI SPACES OF CURVES LUCIA CAPORASO- UNIVERSIT` A ROMA TRE DOBBIACO WINTER

Scalar decomposition on elliptic curves GLV, GLS, and beyond Benjamin Smith Laboratoire

Forms of elliptic curves Wouter Castryck Forms of elliptic curves First definitions Well-known

Hardware-Software Co-Design for Security: ECC Processor Example Arnaud Tisserand CNRS, Lab-STICC

Introductory Chemical Engineering Thermodynamics By J.R. Elliott and C.T. Lira Chapter 6 -

GAIMD Congestion Control Y. Richard Yang and Simon S. Lam, General AIMD Congestion Control

Symmetry Breaking in Quantum Curves & Super Chern-Simons Matrix Models Sanefumi Moriyama