Learning Representations of Relational Data Sebastijan Dumani - PowerPoint PPT Presentation

Learning Representations of Relational Data Sebastijan Dumančić DTAI, CS Department, KU Leuven September 6, ILP 2017

1 – Outline 2/30 1 Introduction 2 Where are we now? 3 What can we do better? 4 Similarity of relational objects 5 Experiments and results 6 Auto-encoding logic programs Learning relational latent features – Dumančić, Blockeel

1 – Representation matters 3/30 Learning relational latent features – Dumančić, Blockeel

1 – Finding good features 4/30 Deep learning - finding good features autonomously by gradually building complexity Learning relational latent features – Dumančić, Blockeel

1 – Focus on sensory data 5/30 Learning relational latent features – Dumančić, Blockeel

1 – Relational deep learning? 6/30 What about relational data? Learning relational latent features – Dumančić, Blockeel

1 – Relational deep learning? 7/30 Learning relational latent features – Dumančić, Blockeel

2 – Vector spaces in knowledge graphs 9/30 Learning relational latent features – Dumančić, Blockeel

2 – Vector spaces in knowledge graphs 10/30 Learning representation = learning vectors Learning relational latent features – Dumančić, Blockeel

2 – Vector spaces in knowledge graphs 11/30 wasBornIn(barack,honolulu). � � [ honolulu ] T ≈ 1 [ barack ] wasBornIn wasBornIn(barack,nairobi). � � [ nairobi ] T ≈ 0 [ barack ] wasBornIn Learning relational latent features – Dumančić, Blockeel

2 – Vector spaces in knowledge graphs 12/30 efficient uninterpretable latent spaces good performance on KB huge amounts of data completion tasks problems with unseen entities does not integrate in (statistical) relational learning Learning relational latent features – Dumančić, Blockeel

3 – Desirable features 14/30 Learning relational latent features – Dumančić, Blockeel

3 – Learning features with k-means 15/30 [Coates, Lee and NG, AISTATS 2011] Learning relational latent features – Dumančić, Blockeel

3 – Lifting the pipeline 16/30 Questions: What to cluster? How to cluster? Architecture? Learning relational latent features – Dumančić, Blockeel

3 – Lifting the pipeline 17/30 What to cluster? cluster vertices and relationships! For each type/domain of vertices in data Learning relational latent features – Dumančić, Blockeel

3 – Lifting the pipeline 18/30 How to cluster them? Unsupervised approach - which similarity is useful? (features, proximity, struc,...) Cluster with a diverse set of similarities Learning relational latent features – Dumančić, Blockeel

3 – Lifting the pipeline 19/30 How to choose the architecture ? a predicate for each latent feature Rely on clustering selection to choose a good clustering Learning relational latent features – Dumančić, Blockeel

4 – Relational similarity 21/30 How similar are ProfA and ProfB ? Relational clustering over neighbourhood trees [Dumančić & Blockeel, MLJ 2017] Learning relational latent features – Dumančić, Blockeel

4 – Relational similarity – Neighbourhood trees 22/30 Neighbourhood trees summarize the neighbourhood of an instance/example data neighbourhood tree Learning relational latent features – Dumančić, Blockeel

4 – Relational similarity – Neighbourhood trees 22/30 Neighbourhood trees summarize the neighbourhood of an instance/example data neighbourhood tree similarity of instances = similarity of their neighbourhood trees Learning relational latent features – Dumančić, Blockeel

4 – Relational similarity – similarity interpretation 23/30 Decompose neighbourhood trees into semantic parts Learning relational latent features – Dumančić, Blockeel

4 – Relational similarity – similarity interpretation 23/30 Decompose neighbourhood trees into semantic parts similarity = linear combination of similarities of individual semantic parts Learning relational latent features – Dumančić, Blockeel

4 – Relational similarity – comparing semantic parts 24/30 Decompose NT is multisets of: attribute edge labels vertex identities per level and vertex type Multiset of edge labels (level 1): { (Advised,2), (Advised,2), (TaughtBy,2) } Compare two multisets, A and B with χ 2 distance ( f A ( x ) − f B ( x )) 2 χ 2 ( A, B ) = � f A ( x ) + f B ( x ) x ∈ A ∪ B Learning relational latent features – Dumančić, Blockeel

4 – Relational similarity – hyperedge similarity 25/30 (Hyper)edge similarity – reduction to similarities of vertices Merging 1 Combination 2 Learning relational latent features – Dumančić, Blockeel

5 – Experiments and results 27/30 Datasets: Setup: IMDB 5-fold cross validation UWCSE learning features of train data Mutagenesis mapping test data to the obtained clusters Hepatitis learn TILDE models on Terrorist attacks latent/original representations WebKB Question: Does learning in relational latent spaces benefits leaning compared to learning in the original space? lower model complexity increased performance How does it compared to MRC [Kok & Domingos, ICML 07] Learning relational latent features – Dumančić, Blockeel

5 – Experiments and results 28/30 Models learned on latent representations are substantially simpler Models learned on latent representations often perform better exception: relationship info not useful Learning relational latent features – Dumančić, Blockeel

6 – Auto-encoding logic programs 30/30 Logic programs a computational framework for encoder and decoder Input: mother(anna,dirk). female(anna). father(tom,dirk). male(tom). Encoder: latent1(X,Y) :- mother(X,Y);father(X,Y). latent2(X) :- female(X). Latent rep.: latent1(anna,dirk). latent1(tom,dirk). latent2(anna). Decoder: mother(X,Y) :- latent1(X,Y),latent2(X). female(X) :- latent2(X). father(X,Y) :- latent1(X,Y),not(latent2(X)). male(X) :- not(latent2(X)). Output: mother(anna,dirk). female(anna). father(tom,dirk). male(tom). Learning relational latent features – Dumančić, Blockeel

Learning Representations of Relational Data Sebastijan Dumani - PowerPoint PPT Presentation

Learning Representations of Relational Data Sebastijan Dumani DTAI, CS Department, KU Leuven September 6, ILP 2017 1 Outline 2/30 1 Introduction 2 Where are we now? 3 What can we do better? 4 Similarity of relational objects 5

Chapter 2: Relational Model Chapter 2: Relational Model Structure of Relational Databases

Chapter 3: Relational Model Structure of Relational Databases Relational Algebra Tuple

Relational Algebra Relational Query Languages Recall: Query = Retrieval Program Language

Relational Algebra 1 / 39 Relational Algebra Relational model specifies stuctures and

Relational Query Languages (2) SQL and QBE Walid G. Aref Query Languages For The Relational

Relational Data Model Hacettepe University Computer Engineering Department Outline 1. Relational

This Lecture The Relational Model Relational data structures Relations and Relational

The Relational Data Model Lecture 6 1 Outline Relational Data Model Functional

Chapter 8 Evaluation of Relational Operators Implementing the Relational Algebra Relational

Relational Calculus More declarative than relational algebra Foundation for query

RELATIONAL ALGEBRA CHAPTER 6 1 CHAPTER 6 OUTLINE Unary Relational Operations: SELECT and

Relational Algebra Murali Mani What is Relational Algebra? Defines operations (data

CSE 154 LECTURE 13:RELATIONAL DATABASES AND SQL Relational databases relational database : A

CSC 337 LECTURE 20: RELATIONAL DATABASES AND SQL Relational databases relational database : A

CSE 154 LECTURE 22:RELATIONAL DATABASES AND SQL Relational databases relational database : A

Relational Non-Relational Rational Agile Predictable Flexible Traditional

Ramsey regularity, MAD families, and their relatives David Schrittesser (KGRC) Joint work with

ARNOLD STABILITY of TIME-OSCILLATING FLOWS Legacy of Vladimir Arnold Fields Institute, November,

Logistic Regression, Gradient Descent, and Newton Method Matthieu R. Bloch 1 Maximum Likelihood

Logical reduction of metarules Andrew Cropper & Sophie Tourret ILP Examples Learner

A FRAMEWORK FOR MULTILINGUAL AND SEMANTIC ENRICHMENT OF DIGITAL CONTENT (NEW L10N BUSINESS

Random matrix ensembles for quantum spins and decoherence Franois David IPhT Saclay & CNRS

The Total Least Squares Problem with Multiple Right-Hand Sides A X B Martin Ple singer in

Promoting Education under Distortionary Taxation: A Comparison between Equality of Opportunity

Sambuz

Useful Links

Newsletter

Mail Us

Learning Representations of Relational Data Sebastijan Dumani - PowerPoint PPT Presentation

Learning Representations of Relational Data Sebastijan Dumani DTAI, CS Department, KU Leuven September 6, ILP 2017 1 Outline 2/30 1 Introduction 2 Where are we now? 3 What can we do better? 4 Similarity of relational objects 5

Chapter 2: Relational Model Chapter 2: Relational Model Structure of Relational Databases

Chapter 3: Relational Model Structure of Relational Databases Relational Algebra Tuple

Relational Algebra Relational Query Languages Recall: Query = Retrieval Program Language

Relational Algebra 1 / 39 Relational Algebra Relational model specifies stuctures and

Relational Query Languages (2) SQL and QBE Walid G. Aref Query Languages For The Relational

Relational Data Model Hacettepe University Computer Engineering Department Outline 1. Relational

This Lecture The Relational Model Relational data structures Relations and Relational

The Relational Data Model Lecture 6 1 Outline Relational Data Model Functional

Chapter 8 Evaluation of Relational Operators Implementing the Relational Algebra Relational

Relational Calculus More declarative than relational algebra Foundation for query

RELATIONAL ALGEBRA CHAPTER 6 1 CHAPTER 6 OUTLINE Unary Relational Operations: SELECT and

Relational Algebra Murali Mani What is Relational Algebra? Defines operations (data

CSE 154 LECTURE 13:RELATIONAL DATABASES AND SQL Relational databases relational database : A

CSC 337 LECTURE 20: RELATIONAL DATABASES AND SQL Relational databases relational database : A

CSE 154 LECTURE 22:RELATIONAL DATABASES AND SQL Relational databases relational database : A

Relational Non-Relational Rational Agile Predictable Flexible Traditional

Ramsey regularity, MAD families, and their relatives David Schrittesser (KGRC) Joint work with

ARNOLD STABILITY of TIME-OSCILLATING FLOWS Legacy of Vladimir Arnold Fields Institute, November,

Logistic Regression, Gradient Descent, and Newton Method Matthieu R. Bloch 1 Maximum Likelihood

Logical reduction of metarules Andrew Cropper &amp; Sophie Tourret ILP Examples Learner

A FRAMEWORK FOR MULTILINGUAL AND SEMANTIC ENRICHMENT OF DIGITAL CONTENT (NEW L10N BUSINESS

Random matrix ensembles for quantum spins and decoherence Franois David IPhT Saclay &amp; CNRS

The Total Least Squares Problem with Multiple Right-Hand Sides A X B Martin Ple singer in

Promoting Education under Distortionary Taxation: A Comparison between Equality of Opportunity

Sambuz

Useful Links

Newsletter

Mail Us

Logical reduction of metarules Andrew Cropper & Sophie Tourret ILP Examples Learner

Random matrix ensembles for quantum spins and decoherence Franois David IPhT Saclay & CNRS