Logical Expressiveness of Graph Neural Networks DIG seminar Mikal - PowerPoint PPT Presentation

Logical Expressiveness of Graph Neural Networks DIG seminar Mikaël Monet March 12th, 2020 Millennium Institute for Foundational Research on Data, Chile

Graph Neural Networks (GNNs) • With: Pablo Barceló, Egor Kostylev, Jorge Pérez, Juan Reutter, Juan Pablo Silva • Graph Neural Networks (GNNs) [Merkwirth and Lengauer, 2005, Scarselli et al., 2009]: a class of NN architectures that has recently become popular to deal with structured data → Goal: understand what they are, and their theoretical properties 1

Neural Networks (NNs) output vector output vector output vector output vector output vector input vector x input vector x input vector x input vector x input vector x L layers of neurons L layers of neurons L layers of neurons L layers of neurons L layers of neurons y = N ( x ) y = N ( x ) y = N ( x ) y = N ( x ) y = N ( x ) y 4 . . . x 3 y 3 . . . x 2 y 2 . . . x 1 y 1 . . . x 0 y 0 . . . A fully connected neural network N . 2

Neural Networks (NNs) output vector output vector output vector output vector output vector input vector x input vector x input vector x input vector x input vector x L layers of neurons L layers of neurons L layers of neurons L layers of neurons L layers of neurons y = N ( x ) y = N ( x ) y = N ( x ) y = N ( x ) y = N ( x ) y 4 . . . x 3 y 3 . . . x 2 y 2 . . . x 1 y 1 . . . x 0 y 0 . . . A fully connected neural network N . • Weight w n ′ → n between two consecutive neurons 2

Neural Networks (NNs) output vector output vector output vector output vector output vector input vector x input vector x input vector x input vector x input vector x L layers of neurons L layers of neurons L layers of neurons L layers of neurons L layers of neurons y = N ( x ) y = N ( x ) y = N ( x ) y = N ( x ) y = N ( x ) y 4 . . . x 3 y 3 . . . x 2 y 2 . . . x 1 y 1 . . . x 0 y 0 . . . A fully connected neural network N . • Weight w n ′ → n between two consecutive neurons • Compute left to right λ ( n ) := f ( � w n ′ → n × λ ( n ′ )) 2

Neural Networks (NNs) output vector output vector output vector output vector output vector input vector x input vector x input vector x input vector x input vector x L layers of neurons L layers of neurons L layers of neurons L layers of neurons L layers of neurons y = N ( x ) y = N ( x ) y = N ( x ) y = N ( x ) y = N ( x ) y 4 . . . x 3 y 3 . . . x 2 y 2 . . . x 1 y 1 . . . x 0 y 0 . . . A fully connected neural network N . • Weight w n ′ → n between two consecutive neurons • Compute left to right λ ( n ) := f ( � w n ′ → n × λ ( n ′ )) • Goal: find the weights that “solve” your problem (classification, clustering, regression, etc.) 2

Finding the weights • Goal: find the weights that “solve” your problem → minimize Dist ( N ( x ) , g ( x )) , where g is what you want to learn 3

Finding the weights • Goal: find the weights that “solve” your problem → minimize Dist ( N ( x ) , g ( x )) , where g is what you want to learn → use backpropagation algorithms 3

Finding the weights • Goal: find the weights that “solve” your problem → minimize Dist ( N ( x ) , g ( x )) , where g is what you want to learn → use backpropagation algorithms • Problem : for fully connected NNs, when a layer has many neurons there are a lot of weights. . . 3

Finding the weights • Goal: find the weights that “solve” your problem → minimize Dist ( N ( x ) , g ( x )) , where g is what you want to learn → use backpropagation algorithms • Problem : for fully connected NNs, when a layer has many neurons there are a lot of weights. . . → example: input is a 250 × 250 pixels image, and we want to build a fully connected NN with 500 neurons per layer → between the first two layers we have 250 × 250 × 500 = 31 , 250 , 000 weights 3

Convolutional Neural Networks input vector (an image) . . . . . . A convolutional neural network. • Idea : use the structure of the data (here, a grid) 4

Convolutional Neural Networks input vector (an image) . . . . . . A convolutional neural network. • Idea : use the structure of the data (here, a grid) → fewer weights to learn (e.g, 500 ∗ 9 = 4 , 500 for the first layer) 4

Convolutional Neural Networks input vector (an image) . . . . . . A convolutional neural network. • Idea : use the structure of the data (here, a grid) → fewer weights to learn (e.g, 500 ∗ 9 = 4 , 500 for the first layer) → other advantage: recognize patterns that are local 4

Graph Neural Networks (GNNs) input vector output: is it poisonous? (e.g., [Duvenaud et al., 2015]) (a molecule) . . . . . . A (convolutional) graph neural network. • Idea : use the structure of the data → GNNs generalize this idea to allow any graph as input 5

Question: what can we do with graph neural networks? (from a theoretical perspective)

GNNs: formalisation • Simple, undirected, node-labeled graph G = ( V , E , λ ) , where λ : V → R d 6

GNNs: formalisation • Simple, undirected, node-labeled graph G = ( V , E , λ ) , where λ : V → R d • Run of a GNN with L layers on G : iteratively compute x ( i ) ∈ R d for 0 ≤ i ≤ L as follows: u 6

GNNs: formalisation • Simple, undirected, node-labeled graph G = ( V , E , λ ) , where λ : V → R d • Run of a GNN with L layers on G : iteratively compute x ( i ) ∈ R d for 0 ≤ i ≤ L as follows: u → x ( 0 ) := λ ( u ) u 6

GNNs: formalisation • Simple, undirected, node-labeled graph G = ( V , E , λ ) , where λ : V → R d • Run of a GNN with L layers on G : iteratively compute x ( i ) ∈ R d for 0 ≤ i ≤ L as follows: u → x ( 0 ) := λ ( u ) u → x ( i + 1 ) := COMB ( i + 1 ) ( x ( i ) u , AGG ( i + 1 ) ( {{ x ( i ) | v ∈ N G ( u ) }} )) u v • Where the AGG ( i ) are called aggregation functions and the COMB ( i ) combination functions 6

GNNs: formalisation • Simple, undirected, node-labeled graph G = ( V , E , λ ) , where λ : V → R d • Run of a GNN with L layers on G : iteratively compute x ( i ) ∈ R d for 0 ≤ i ≤ L as follows: u → x ( 0 ) := λ ( u ) u → x ( i + 1 ) := COMB ( i + 1 ) ( x ( i ) u , AGG ( i + 1 ) ( {{ x ( i ) | v ∈ N G ( u ) }} )) u v • Where the AGG ( i ) are called aggregation functions and the COMB ( i ) combination functions • Let us call such a GNN an aggregate-combine GNN (AC-GNN) 6

Link with Weisfeiler-Lehman • Recently, [Morris et al., 2019, Xu et al., 2019] established a link with the Weisfeiler-Lehman (WL) isomorphism test 7

Link with Weisfeiler-Lehman • Recently, [Morris et al., 2019, Xu et al., 2019] established a link with the Weisfeiler-Lehman (WL) isomorphism test → A heuristic to determine if two graphs are isomorphic (also called color refinement) 7

Link with Weisfeiler-Lehman • Recently, [Morris et al., 2019, Xu et al., 2019] established a link with the Weisfeiler-Lehman (WL) isomorphism test → A heuristic to determine if two graphs are isomorphic (also called color refinement) 1. Start from two graphs, with all nodes having the same color 7

Logical Expressiveness of Graph Neural Networks DIG seminar Mikal - PowerPoint PPT Presentation

Logical Expressiveness of Graph Neural Networks DIG seminar Mikal Monet March 12th, 2020 Millennium Institute for Foundational Research on Data, Chile Graph Neural Networks (GNNs) With: Pablo Barcel, Egor Kostylev, Jorge Prez, Juan

Logical Expressiveness of Graph Neural Networks Mikal Monet October 10th, 2019 Millennium

Learning Neural Networks Learning Neural Networks Neural Networks can represent complex Neural

Neural Networks and Handwriting Recognition Background Neural Networks Neural Network Steven

Lecture 6: Cook Levin Theorem Arijit Bishnu 11.03.2010 Warm Up Expressiveness of Boolean

Neural Networks Neural networks arise from attempts to model Neural Networks human/animal

Graph Neural Network Fang Yuanqiang, 2019/05/18 Graph Neural Network Why GNN? Preliminary

Sequential Data with Neural Networks Recurrent Neural Networks Sequential input / output Greg

Neural Information Retrieval Wassila Lalouani 1 Plan Neural network architectures Neural

Logical Consequence: From Logical Terms to Semantic Constraints Gil Sagi Munich Center for

CHAPTER II I CHAPTER I Recurrent Neural Networks Recurrent Neural Networks CHAPTER II : I :

CHAPTER II III I CHAPTER Neural Networks as Neural Networks as Associative Memory

Convolutional Neural Networks Convolutional neural networks One of the major kinds of ANNs in use

Neural Networks 0. Logistics Spring 2019 1 Neural Networks are taking over! Neural networks

Neural Networks and their Application to Go Neural Networks Learning Blackjack Theory Training

Neural Networks 1. Introduction Fall 2017 Neural Networks are taking over! Neural networks

Graph Neural Networks for Neutrino Classification Nicholas Choma and Joan Bruna July 18, 2018

Effects of prosodic focus on discourse processing by native

CS553 Lecture 1 1 Isnt Compilation A Solved Problem? Modern View of Compilers Applications

CS553 Compiler Construction Instructor: Michelle Strout mstrout@cs.colostate.edu USC 227 Office

Topic-Comment Frames in HPSG Gert Webelhuth Georg-August University G ottingen November 24,

Credit Market Problems January 2011 () Credit Markets January 2011 1 / 17 Should Governments

Hypothetical Reasoning and Association with Focus Gregory M. Kobele kobele@rz.hu-berlin.de

Oliver Hart May 15, 2003 Karl Borch Lecture FIRMS VERSUS CONTRACTS Firms are important in

@ SSHA Bobby Singh Director Information Security Smart Systems for Health Agency 2 Agenda

Sambuz

Useful Links

Newsletter

Mail Us