Kernel methods and Graph kernels Social and Technological Networks - PowerPoint PPT Presentation

Kernel methods and Graph kernels Social and Technological Networks Rik Sarkar University of Edinburgh, 2018.

Kernels • Kernels are a type of measures of similarity • Important technique in Machine learning • Used to increase power of many techniques • Can be defined on graphs • Used to compare, classify, cluster many small graphs – E.g. Molecules, neighborhoods of different people in social networks etc…

The main ML question • For classes that can be separated by a line – ML is easy – E.g. Linear SVM, Single Neuron • But what if the separation is more complex?

The main ML question • For classes that can be separated by a line – ML is easy – E.g. Linear SVM, Single Neuron • What if the structure is more complex? – Cannot separated linearly

Lifting to higher dimensions • Suppose we lift every (x,y) point to 𝑦, 𝑧 → (𝑦,𝑧,x ' + y ' ) : • • Now there is a linear separator!

Exercise • Suppose we have the following data: • How would you lift and classify? • Assuming there is a mechanism to find linear separators if they exist

Kernels • A similarity measure 𝐿:𝑌×𝑌 → ℝ is a kernel if: • There is an embedding 𝜔 (usually to higher dimension), – Such that: K 𝒗, 𝒘 = ⟨𝜔 𝒗 , 𝜔 𝒘 ⟩ – Where ⟨,⟩ represents inner product – Positive definite kernels

Example kernel • For the examples we saw earlier, the following kernel helps: • 𝐿 𝑣, 𝑤 = 𝑣 ⋅ 𝑤 '

Example kernel • For the examples we saw earlier, the following kernel helps: • 𝐿 𝑣, 𝑤 = 𝑣 ⋅ 𝑤 ' – This is true with lifting map ' , ' 𝜔 𝑣 = 𝑣 : 2 𝑣 : 𝑣 = ,𝑣 > – Try it out!

More examples • Polynnomial Kernel • 𝐿 𝑣, 𝑤 = (1 + 𝑣 ⋅ 𝑤 ) I • Gaussian Kernel • 𝐿 𝑣, 𝑤 = 𝑓 K LMN O OP – Sometimes called Radial Basis Function (RBF) kernel

Graph kernels • To compute similarity between two attributed graphs – Nodes can carry labels – E.g. Elements (C, N, H etc) in complex molecules • Idea: It is not obvious how to compare two graphs – Instead compute walks, cycles etc on the graph, and compare those

Walk counting • Count the number of walks of length k from i to j • Idea: i and j should be considered close if – They are not far in the shortest path distance – And there are many walks of short length between them (so they are highly connected) • So, there would be many walks of length ≤ 𝑙

Walk counting • Can be computed by taking k th power of adjacency matrix A • If 𝐵 I 𝑗, 𝑘 = 𝑑 , that means there are c walks of length k between i and j • Note: 𝐵 I is expensive, but manageable for small graphs

Common walk kernel • Count how many walks are common between the two graphs • That is, take all possible walks of length k on both graphs. – Count the number that are exactly the same – Two walks are same if the follow the same sequence of labels • (note that other than labels, there is no obvious correspondence between nodes)

Random walk kernel • Perform multiple random walks of length k on both graphs • Count the number of walks common to both graphs

Tottering • Walks can move back and forth between adjacent vertices – Small structural similarities can produce a large score • Usual technique: for a walk 𝑤 W ,𝑤 ' , … prohibit return along an edge, ie 𝑤 Y = 𝑤 YZ'

Subtree kernel • From each node, compute a neighborhood upto distance h • From every pair of nodes in two graphs, compare the neighborhoods – And count the number of matches

Shortest path kernel • Compute all pairs shortest paths in two graphs • Compute the number of common sequences • Tottering problem does not appear • Problem: there can be many (exponentially many) shortest paths between two nodes – Computational problems – Can bias the similairity

Shortest distance kernel • Instead use shortest distance between nodes • Always unique • Method: – Compute all shortest distances SD(G1) and SD(G2) in graphs G1 and G2 – Define kernel (e.g. Gaussian kernel) over pairs of distances: 𝑙 𝑡 W , 𝑡 ' , where 𝑡 W ∈ 𝑇𝐸 𝐻 W , 𝑡 ' ∈ 𝑇𝐸(𝐻 ' ) – Define shortest path (SP )kernel between graphs as sum of kernel values over all pairs of distances between two graphs • K `a 𝐻 W , 𝐻 ' = ∑ ∑ 𝑙(𝑡 W ,𝑡 ' ) c d c O

Kernel methods and Graph kernels Social and Technological Networks - PowerPoint PPT Presentation

Kernel methods and Graph kernels Social and Technological Networks Rik Sarkar University of Edinburgh, 2018. Kernels Kernels are a type of measures of similarity Important technique in Machine learning Used to increase power of many

GRAPH MINING AND GRAPH KERNELS Part II: Graph Kernels Karsten Borgwardt^ and Xifeng Yan*

GRAPH MINING AND GRAPH KERNELS Part I: Graph Mining Karsten Borgwardt^ and Xifeng Yan*

Overview: Kernels for Sequences and Graphs String Kernels 8 Example Sequence Classification

The Gray Code Kernels The Gray Code Kernels The Gray Code Kernels Gil Ben-Artzi Hagit Hel-Or

Kernel methods and Graph kernels Social and Technological Networks Rik Sarkar University of

Learning From Data Lecture 26 Kernel Machines Popular Kernels The Kernel Measures Similarity

Beta kernels and transformed kernels applications to copulas and quantiles Arthur Charpentier

Kernels on structures Andrea Passerini passerini@disi.unitn.it Machine Learning Kernels on

Kernel on Automata Cousins of String Kernels and Dynamic Systems Kernels? S.V.N. Vishy

Kernel Learning with a Million Kernels Ashesh Jain SVN Vishwanathan IIT Delhi Purdue

Tight Kernel Query Complexity of Kernel Ridge Regression and Kernel -means Clustering Manuel

Scalable Machine Learning 6. Kernels Alex Smola Yahoo! Research and ANU

SVM Kernels COMPSCI 371D Machine Learning COMPSCI 371D Machine Learning SVM Kernels 1 /

Kernel Methods and String Kernels for Authorship Analysis Marius Popescu 1 Cristian Grozea 2 1

MICROKERNELS: MACH AND L4 Hakim Weatherspoon CS6410 Introduction to Kernels Different Types

Rumprun for Rump Kernels: Instant Unikernels for POSIX applications Martin Lucina, @matolucina 1

Explicit Feature Methods for Accelerated Kernel Learning Purushottam Kar Quick Motivation

Kernel methods for Network Analysis: An introduction Chiranjib Bhattacharyya Machine Learning

Scalable Learning in Reproducing Kernel Kre n Spaces Dino Oglic 1 Thomas Grtner 2 1

CS480/680 Lecture 11: June 12, 2019 Kernel methods [D] Chap. 11 [B] Sec. 6.1, 6.2 [M] Sec.

A Neural Network View of Kernel Methods Shuiwang Ji Department of Computer Science &

Lecture 5: SVM II Princeton University COS 495 Instructor: Yingyu Liang Review: SVM objective

L ECTURE 9: D UAL AND K ERNEL Prof. Julia Hockenmaier juliahmr@illinois.edu Linear classifiers

Machine learning theory Kernel methods Hamid Beigy Sharif university of technology April 20,

Kernel methods and Graph kernels Social and Technological Networks - PowerPoint PPT Presentation

Kernel methods and Graph kernels Social and Technological Networks Rik Sarkar University of Edinburgh, 2018. Kernels Kernels are a type of measures of similarity Important technique in Machine learning Used to increase power of many

GRAPH MINING AND GRAPH KERNELS Part II: Graph Kernels Karsten Borgwardt^ and Xifeng Yan*

GRAPH MINING AND GRAPH KERNELS Part I: Graph Mining Karsten Borgwardt^ and Xifeng Yan*

Overview: Kernels for Sequences and Graphs String Kernels 8 Example Sequence Classification

The Gray Code Kernels The Gray Code Kernels The Gray Code Kernels Gil Ben-Artzi Hagit Hel-Or

Kernel methods and Graph kernels Social and Technological Networks Rik Sarkar University of

Learning From Data Lecture 26 Kernel Machines Popular Kernels The Kernel Measures Similarity

Beta kernels and transformed kernels applications to copulas and quantiles Arthur Charpentier

Kernels on structures Andrea Passerini passerini@disi.unitn.it Machine Learning Kernels on

Kernel on Automata Cousins of String Kernels and Dynamic Systems Kernels? S.V.N. Vishy

Kernel Learning with a Million Kernels Ashesh Jain SVN Vishwanathan IIT Delhi Purdue

Tight Kernel Query Complexity of Kernel Ridge Regression and Kernel -means Clustering Manuel

Scalable Machine Learning 6. Kernels Alex Smola Yahoo! Research and ANU

SVM Kernels COMPSCI 371D Machine Learning COMPSCI 371D Machine Learning SVM Kernels 1 /

Kernel Methods and String Kernels for Authorship Analysis Marius Popescu 1 Cristian Grozea 2 1

MICROKERNELS: MACH AND L4 Hakim Weatherspoon CS6410 Introduction to Kernels Different Types

Rumprun for Rump Kernels: Instant Unikernels for POSIX applications Martin Lucina, @matolucina 1

Explicit Feature Methods for Accelerated Kernel Learning Purushottam Kar Quick Motivation

Kernel methods for Network Analysis: An introduction Chiranjib Bhattacharyya Machine Learning

Scalable Learning in Reproducing Kernel Kre n Spaces Dino Oglic 1 Thomas Grtner 2 1

CS480/680 Lecture 11: June 12, 2019 Kernel methods [D] Chap. 11 [B] Sec. 6.1, 6.2 [M] Sec.

A Neural Network View of Kernel Methods Shuiwang Ji Department of Computer Science &amp;

Lecture 5: SVM II Princeton University COS 495 Instructor: Yingyu Liang Review: SVM objective

L ECTURE 9: D UAL AND K ERNEL Prof. Julia Hockenmaier juliahmr@illinois.edu Linear classifiers

Machine learning theory Kernel methods Hamid Beigy Sharif university of technology April 20,

A Neural Network View of Kernel Methods Shuiwang Ji Department of Computer Science &