no node2vec: Scalable Feature Learning for Networks Aditya Grover - PowerPoint PPT Presentation

no node2vec: Scalable Feature Learning for Networks Aditya Grover and Jure Leskovec. KDD 2016. Presented by Haoxiang Wang. Feb 26, 2020.

Node Embeddings Ou Outpu put A B In Input ´ Intuition: Find embeddings of nodes in a d- dimensional space so that “similar” nodes in the graph have embeddings that are close together.

Setup ´ Assume we have a graph G : ´ V is the vertex set (i.e., node set). ´ A is the adjacency matrix (assume binary).

Embedding Nodes ´ Goal: to encode nodes so that similarity in the embedding space (e.g., dot product) approximates similarity in the original network. similarity( u, v ) ≈ z > v z u

Random Walk Embeddings: Basic Idea probability that u and v co-occur on a z > u z v ≈ random walk over the network 1. Estimate probability of visiting node v on a random walk starting from node u using some random walk strategy R . 2. Optimize embeddings to encode these random walk statistics.

Algorithm/Optimization of Random Walk Embeddings 1. Run short random walks starting from each node on the graph using some strategy R . For each node u collect N R ( u ) , the multiset * of nodes 2. visited on random walks starting from u. （ * N R ( u ) can have repeat elements since nodes can be visited multiple times on random walks. ） 3. Optimize embeddings to according to: X X L = − log( P ( v | z u )) u ∈ V v ∈ N R ( u ) exp( z > u z v ) P ( v | z u ) = n 2 V exp( z > P u z n ) In practice, random sampling based on some distribution over nodes

Node2vec: Biased Random Walks ´ Idea: use flexible, biased random walks that can trade off between local and global views of the network (Grover and Leskovec, 2016). ´ BFS (Breath-First Search)and DFS (Depth-First Search): Two classic strategies to define a neighborhood 𝑂 , 𝑣 of a given node 𝑣 : 𝑂 ./0 𝑣 = { 𝑡 4 , 𝑡 6 , 𝑡 7 } s 1 s 2 s 8 Local microscopic view s 7 BFS u s 6 DFS 𝑂 9/0 𝑣 = { 𝑡 : , 𝑡 ; , 𝑡 < } s 9 s 4 s 5 s 3 Global macroscopic view

Combine BFS + DFS by a Ratio Biased random walk 𝑆 that Unnormalized given a node 𝑣 generates Walker is at 𝑥 . transition prob. neighborhood 𝑂 , 𝑣 Where to go next? ´ Two parameters: 1 s 2 s 3 s 1 1/𝑞 ´ Return parameter 𝑞 : w s 2 1 w → 1/𝑟 Return back to the s 1 s 3 1/𝑟 u 1/𝑞 previous node BFS-like walk: Low value of 𝑞 ´ Walk-away parameter DFS-like walk: Low value of 𝑟 𝑟 : Moving outwards (DFS) vs. inwards (BFS)

Benchmarks: Node Classification & Link Prediction ? ? Node ? Classification ? Machine Learning ? ? Link Prediction ? x ? Machine Learning

Link Prediction Empirical Results Node Classification

Advantages of Node2Vec ´ node2vec performs better on node classification compared with other node embedding methods. ´ Random walk approaches are generally more efficient (i.e., O(|E|) vs. O(|V| 2 ) ) ´ (Note: In general , one must choose definition of node similarity that matches application. )

Other random walk node embedding works ´ Different kinds of biased random walks: ´ Based on node attributes (Dong et al., 2017). ´ Based on a learned weights (Abu-El-Haija et al., 2017) ´ Alternative optimization schemes: ´ Directly optimize based on 1-hop and 2-hop random walk probabilities (as in LINE from Tang et al. 2015). ´ Network preprocessing techniques: ´ Run random walks on modified versions of the original network (e.g., Ribeiro et al. 2017’s struct2vec, Chen et al. 2016’s HARP).

no node2vec: Scalable Feature Learning for Networks Aditya Grover - PowerPoint PPT Presentation

no node2vec: Scalable Feature Learning for Networks Aditya Grover and Jure Leskovec. KDD 2016. Presented by Haoxiang Wang. Feb 26, 2020. Node Embeddings Ou Outpu put A B In Input Intuition: Find embeddings of nodes in a d-

node2vec: Scalable Feature Learning for Networks Aditya Grover, Jure Leskovec Farzaneh Heidari

Churn Prediction using Dynamic RFM-Augmented node2vec Sandra Mitrovi , Jochen de Weerdt, Bart

Network Embedding as Matrix Factorization: Unifying DeepWalk, LINE, PTE, and node2vec Jiezhong

node2vec: Scalable Feature Learning for Networks Presenter: Tom Novek, Faculty of

Introduction CSCE CSCE 496/896 496/896 Lecture 9: Lecture 9: word2vec and word2vec and To

node2vec: Scalable F Feature Learning f for Networks A paper by Aditya Grover and Jure

Annuity Adjustments Caleb CJJohnson Presenter 1 Effective Rates and Annuity Adjustments

Classes and Objects Object Oriented Programming Genome 559: Introduction to Statistical and

Java EE6 and JBoss AS6 What's coming? Jason T. Greene JBoss, a Division of Red Hat 1 Monday,

EIO: ERROR CHECKING IS OCCASIONALLY CORRECT HARYADI S. GUNAWI, CINDY RUBIO-GONZLEZ, ANDREA C.

Quickest Quickest for maintaining sorted sets. Sorting Sorting British codebreakers used

Speaking the same language as Developers and DBAs Michael Coburn Percona Michael Coburn

Knowledge bases domainindependent algorithms Inference engine Knowledge base

The Manhattan Project - Personalities and Problems Fromm Institute Fall 2020

CS5412: OTHER DATA CENTER SERVICES Lecture IX Ken Birman Tier two and Inner Tiers 2 If

Atoms Convention We usually use p , q , p 1 , etc, instead of sentences like The sun is

Logic in Software, Dynamical and Biological Systems Ashish Tiwari SRI International Menlo Park,

Development By The Numbers We Are Going To Measure Complexity Why Should We Care About

Dijkstra Variants: A* and Potentials Eric Price UT Austin CS 331, Spring 2020 Coronavirus

Information Effects for Understanding Type Systems Or: how someone else found the maths to

Graphs Part I: Basic algorithms Laura Toma Algorithms (csci2200), Bowdoin College Part I: Basic

CPSC 490 DP Part 2: Max-IS on Arrays, LCS, Recovery, and Binary Exponentiation Lucca Siaudzionis

The Role of Information and Communication Technologies in the Development of Inclusive Society for

Support vector machines Lecture 4 David Sontag New York University Slides adapted from Luke