Schema Independent Rela/onal Learning Jose Picado, Arash Termehchy, - PowerPoint PPT Presentation

Schema Independent Rela/onal Learning Jose Picado, Arash Termehchy, Alan Fern, Parisa Ataei Informa/on and Data Management and Analy/cs (IDEA) Lab

Design a drug to treat HIV What is the structure of compounds that have an#-HIV ac/vity? A compound has an#-HIV ac/vity if it has the following substructure: Oracle N O N 2

Rela/onal learning Leverages the structure of the rela/onal • database Learns a Datalog defini/on • compound atom compId atomId atomId element c1 a1 a1 N Training data: c2 a10 a2 O an#-HIV no-an#-HIV bond compId compId atomId1 atomId2 type c1 c2 a1 a2 single c3 c4 a2 a3 single an/-HIV(x) :- compound(x,u), atom(u,N), compound(x,v), atom(v,O), Rela/onal learning compound(x,w), atom(w,N), algorithm bond(u,v,single), bond(v,w,single). 3

Rela/onal learning has many applica/ons in data analy/cs & management • Model en//es and rela/onships between en//es Marke#ng Drug design How will new customers What is the structure of respond to an offer? compounds to fight a disease? Concept Concept interestedInOffer(customer) ac/ve(compound) • Various applica/ons in data management • E.g., informa/on extrac/on, usable query interfaces, data integra/on/ exchange. 4

Benefits of rela/onal learning ü Leverage the structure of compound atom data and learn over complex compId atomId atomId element schemas with mul/ple tables c1 a1 a1 N c2 a10 a2 O ü Automa/c feature extrac/on and selec/on bond atomId1 atomId2 type ü Results are interpretable a1 a2 single (Datalog) a2 a3 single an/-HIV(x) :- compound(x,u), atom(u,N), compound(x,v), atom(v,O), FOIL, Progol, … compound(x,w), atom(w,N), Castor (new algorithm) bond(u,v,single), bond(v,w,single). Exis/ng algorithms 5

Schema 1 Which authors are collaborators ? paperAuthor author authorAffilia#on paperId authorId id name id affilia/on collaborators p1 mad mad Madden mad MIT person1 person2 p1 bai sto Stonebraker sto MIT Madden Bailis p2 soc soc Socher soc Stanford Socher Manning p2 man man Manning man Stanford Madden Stonebraker p3 mad bai Bailis bai Stanford non-collaborators paper paperYear paperConf person1 person2 id /tle id year id conf Madden Socher p1 MacroBase: Priori… p1 2017 p1 SIGMOD Manning Bailis p2 GloVe: Global Vect… p2 2014 p2 EMNLP ? FOIL learning algorithm 6

FOIL: rela/onal learning algorithm Schema 1 collaborators(x,y) :- author authorAffilia#on id name id affilia/on paperAuthor paperId authorId paperConf paper paperYear id conf id /tle id year Scoring func/on f: P - N P: posi/ve examples covered N: nega/ve examples covered collaborators(x,y) :- true. 7

FOIL: rela/onal learning algorithm Schema 1 collaborators(x,y) :- f=0 f=0 f=-1 author authorAffilia#on author(z,x) author(z,y) id name id affilia/on paperAuthor paperId authorId paperConf paper paperYear id conf id /tle id year Scoring func/on f: P - N P: posi/ve examples covered N: nega/ve examples covered collaborators(x,y) :- true. 8

FOIL: rela/onal learning algorithm Schema 1 collaborators(x,y) :- f=0 f=0 f=-1 author authorAffilia#on author(z,x) author(z,y) id name id affilia/on paperAuthor paperId authorId paperConf paper paperYear id conf id /tle id year Scoring func/on f: P - N P: posi/ve examples covered N: nega/ve examples covered collaborators(x,y) :- author(z,x). 9

FOIL: rela/onal learning algorithm Schema 1 collaborators(x,y) :- f=0 f=0 f=-1 author authorAffilia#on author(z,x) author(z,y) id name id affilia/on f=0 f=1 f=0 paperAuthor author(v,y) paperId authorId paperConf paper paperYear id conf id /tle id year Scoring func/on f: P - N P: posi/ve examples covered N: nega/ve examples covered collaborators(x,y) :- author(z,x). 10

FOIL: rela/onal learning algorithm Schema 1 collaborators(x,y) :- f=0 f=0 f=-1 author authorAffilia#on author(z,x) author(z,y) id name id affilia/on f=0 f=1 f=0 paperAuthor author(v,y) paperId authorId paperConf paper paperYear id conf id /tle id year Scoring func/on f: P - N P: posi/ve examples covered N: nega/ve examples covered collaborators(x,y) :- author(z,x), author(v,y). 11

FOIL: rela/onal learning algorithm Schema 1 collaborators(x,y) :- f=0 f=0 f=-1 author authorAffilia#on author(z,x) author(z,y) id name id affilia/on f=0 f=1 f=0 paperAuthor author(v,y) paperId authorId f=0 f=2 f=-1 paperAuthor(w,z) paperConf paper paperYear id conf id /tle id year Scoring func/on f: P - N P: posi/ve examples covered N: nega/ve examples covered collaborators(x,y) :- author(z,x), author(v,y). 12

FOIL: rela/onal learning algorithm Schema 1 collaborators(x,y) :- f=0 f=0 f=-1 author authorAffilia#on author(z,x) author(z,y) id name id affilia/on f=0 f=1 f=0 paperAuthor author(v,y) paperId authorId f=0 f=2 f=-1 paperAuthor(w,z) paperConf paper paperYear id conf id /tle id year Scoring func/on f: P - N P: posi/ve examples covered N: nega/ve examples covered collaborators(x,y) :- author(z,x), author(v,y), paperAuthor(w,z). 13

FOIL: rela/onal learning algorithm Schema 1 collaborators(x,y) :- f=0 f=0 f=-1 author authorAffilia#on author(z,x) author(z,y) id name id affilia/on f=0 f=1 f=0 paperAuthor author(v,y) paperId authorId f=0 f=2 f=-1 paperAuthor(w,z) paperConf paper paperYear id conf id /tle id year f=1 f=3 f=1 paperAuthor(w,v) Scoring func/on f: P - N P: posi/ve examples covered N: nega/ve examples covered collaborators(x,y) :- author(z,x), author(v,y), paperAuthor(w,z). 14

FOIL: rela/onal learning algorithm Schema 1 collaborators(x,y) :- f=0 f=0 f=-1 author authorAffilia#on author(z,x) author(z,y) id name id affilia/on f=0 f=1 f=0 paperAuthor author(v,y) paperId authorId f=0 f=2 f=-1 paperAuthor(w,z) paperConf paper paperYear id conf id /tle id year f=1 f=3 f=1 paperAuthor(w,v) Scoring func/on f: P - N P: posi/ve examples covered N: nega/ve examples covered collaborators(x,y) :- author(z,x), author(v,y), paperAuthor(w,z), paperAuthor(w,v). 15

FOIL: rela/onal learning algorithm Schema 1 collaborators(x,y) :- f=0 f=0 f=-1 author authorAffilia#on author(z,x) author(z,y) id name id affilia/on f=0 f=1 f=0 paperAuthor author(v,y) paperId authorId f=0 f=2 f=-1 paperAuthor(w,z) paperConf paper paperYear id conf id /tle id year f=1 f=3 f=1 paperAuthor(w,v) Scoring func/on f: P - N f=2 f=1 f=1 P: posi/ve examples covered No improvement N: nega/ve examples covered collaborators(x,y) :- author(z,x), author(v,y), paperAuthor(w,z), paperAuthor(w,v). 16

Schema 1 Which authors are collaborators ? paperAuthor author authorAffilia#on paperId authorId id name id affilia/on collaborators p1 mad mad Madden mad MIT person1 person2 p1 bai sto Stonebraker sto MIT Madden Bailis p2 soc soc Socher soc Stanford Socher Manning p2 man man Manning man Stanford Madden Stonebraker p3 mad bai Bailis bai Stanford non-collaborators paper paperYear paperConf person1 person2 id /tle id year id conf Madden Socher p1 MacroBase: Priori… p1 2017 p1 SIGMOD Manning Bailis p2 GloVe: Global Vect… p2 2014 p2 EMNLP f=3 collaborators(x,y) :- author(z,x), author(v,y), FOIL learning paperAuthor(w,z), paperAuthor(w,v). algorithm Two people are collaborators if they are co-authors. 17

People represent the same data using different schemas author authorAffilia#on author id name id affilia/on id name affilia/on mad Madden mad MIT mad Madden MIT sto Stonebraker sto MIT sto Stonebraker MIT soc Socher soc Stanford soc Socher Stanford man Manning man Stanford man Manning Stanford bai Bailis bai Stanford bai Bailis Stanford paper paperYear paper id /tle id year id /tle year conference p1 MacroBase: Priori… p1 2017 p1 MacroBase: Priori… 2017 SIGMOD p2 GloVe: Global Vect… p2 2014 p2 GloVe: Global Vect… 2014 EMNLP paperConf Composi/on id conf Denormaliza/on p1 SIGMOD beher performance p2 EMNLP 18 DBA

Schema 2 Which authors are collaborators ? paperAuthor author paperId authorId id name affilia/on collaborators p1 mad mad Madden MIT person1 person2 p1 bai sto Stonebraker MIT Madden Bailis p2 soc soc Socher Stanford Socher Manning p2 man man Manning Stanford Madden Stonebraker p3 mad bai Bailis Stanford non-collaborators paper person1 person2 id /tle year conference Madden Socher p1 MacroBase: Priori… 2017 SIGMOD Manning Bailis p2 GloVe: Global Vect… 2014 EMNLP ? FOIL learning algorithm 19

FOIL: rela/onal learning algorithm Schema 2 collaborators(x,y) :- author id name affilia/on paperAuthor paperId authorId paper id /tle year conference Scoring func/on f: P - N P: posi/ve examples covered N: nega/ve examples covered collaborators(x,y) :- true. 20

FOIL: rela/onal learning algorithm Schema 2 collaborators(x,y) :- author f=0 f=0 f=-1 id name affilia/on author(z,x,v) author(z,y,v) paperAuthor paperId authorId paper id /tle year conference Scoring func/on f: P - N P: posi/ve examples covered N: nega/ve examples covered collaborators(x,y) :- true. 21

Schema Independent Rela/onal Learning Jose Picado, Arash Termehchy, - PowerPoint PPT Presentation

Schema Independent Rela/onal Learning Jose Picado, Arash Termehchy, Alan Fern, Parisa Ataei Informa/on and Data Management and Analy/cs (IDEA) Lab Design a drug to treat HIV What is the structure of compounds that have an#-HIV ac/vity? A

rela%onal algebra & calculus Relational DB: The Origins Frege:

CSE 344 Introduc/on to Data Management Sec%on 4: Rela%onal Algebra Outline HW3 Check-in

Linked Open Data data.slub-dresden.de Linked Open Usable Data data.slub-dresden.de schema.org

Schema Languages Schema Languages Regular expressions a commonly used formalism in schema

Schema Matching in a Large Scale Schema Matching in a Large Scale Personal Schema Based Querying

Ocelot Rela%onal Logic in a Solver-Aided Language James Bornholt http://ocelot.tools Emina

Schema validation and evolution for PGs Eugenia Oshurko (ENS Lyon) 7 March 2019 Main ideas

The Rela/vis/c Quantum World A lecture series on Rela/vity

Rela%vis%c Red Black Trees Rela%vis%c Programming Concurrent

The Rela/vis/c Quantum World A lecture series on Rela/vity

IP-XACT XML Schema Vanderlei Bonato Sep 2008 Outline XML Schema The seven top-level

REFEDS Schema Editorial Board https://wiki.refeds.org/display/STAN/Schema+Editorial+Board

The LDAP Directory Schema AGENDA Why do we need a good schema? From the White Pages to

unior nternaonal ward ( ) unior nternaonal ward What

CERN-US Rela,ons: Past, Present & Future from a CERN

Towards Automa-cally Se3ng Language Bias in Rela-onal Learning Jose Picado, Arash Termehchy, Alan

IconIntent : Automatic Identification of Sensitive UI Widgets based on Icon Classification for

Relational Algebra and SQL Chapter 5 1 Relational Query Languages Languages for describing

Purposefully Planning the Road to Recruitment Dial: 877-853-5257 Webinar ID: 335-362-024 Welcome!

JSON-LD Joint Session Lyon, France, October 2018 DEFINING @ID OF THING Defining @id of Thing

CS 1501 www.cs.pitt.edu/~nlf4/cs1501/ Union Find Dynamic connectivity problem For a given graph

CSE443 Compilers Dr. Carl Alphonce alphonce@buffalo.edu 343 Davis Hall http:/

Update on Sparse CNNs for Particle ID in ProtoDUNE Carlos Sarasty Segura 1st April 2020 DRA

Arpeggio: Metadata Searching and Content Sharing with Chord Austin T. Clements, Dan R. K. Ports,

Schema Independent Rela/onal Learning Jose Picado, Arash Termehchy, - PowerPoint PPT Presentation

Schema Independent Rela/onal Learning Jose Picado, Arash Termehchy, Alan Fern, Parisa Ataei Informa/on and Data Management and Analy/cs (IDEA) Lab Design a drug to treat HIV What is the structure of compounds that have an#-HIV ac/vity? A

rela%onal algebra &amp; calculus Relational DB: The Origins Frege:

CSE 344 Introduc/on to Data Management Sec%on 4: Rela%onal Algebra Outline HW3 Check-in

Linked Open Data data.slub-dresden.de Linked Open Usable Data data.slub-dresden.de schema.org

Schema Languages Schema Languages Regular expressions a commonly used formalism in schema

Schema Matching in a Large Scale Schema Matching in a Large Scale Personal Schema Based Querying

Ocelot Rela%onal Logic in a Solver-Aided Language James Bornholt http://ocelot.tools Emina

Schema validation and evolution for PGs Eugenia Oshurko (ENS Lyon) 7 March 2019 Main ideas

The Rela/vis/c Quantum World A lecture series on Rela/vity

Rela%vis%c Red Black Trees Rela%vis%c Programming Concurrent

The Rela/vis/c Quantum World A lecture series on Rela/vity

IP-XACT XML Schema Vanderlei Bonato Sep 2008 Outline XML Schema The seven top-level

REFEDS Schema Editorial Board https://wiki.refeds.org/display/STAN/Schema+Editorial+Board

The LDAP Directory Schema AGENDA Why do we need a good schema? From the White Pages to

unior nterna*onal ward ( ) unior nterna*onal ward What

CERN-US Rela,ons: Past, Present &amp; Future from a CERN

Towards Automa-cally Se3ng Language Bias in Rela-onal Learning Jose Picado, Arash Termehchy, Alan

IconIntent : Automatic Identification of Sensitive UI Widgets based on Icon Classification for

Relational Algebra and SQL Chapter 5 1 Relational Query Languages Languages for describing

Purposefully Planning the Road to Recruitment Dial: 877-853-5257 Webinar ID: 335-362-024 Welcome!

JSON-LD Joint Session Lyon, France, October 2018 DEFINING @ID OF THING Defining @id of Thing

CS 1501 www.cs.pitt.edu/~nlf4/cs1501/ Union Find Dynamic connectivity problem For a given graph

CSE443 Compilers Dr. Carl Alphonce alphonce@buffalo.edu 343 Davis Hall http:/

Update on Sparse CNNs for Particle ID in ProtoDUNE Carlos Sarasty Segura 1st April 2020 DRA

Arpeggio: Metadata Searching and Content Sharing with Chord Austin T. Clements, Dan R. K. Ports,

rela%onal algebra & calculus Relational DB: The Origins Frege:

unior nternaonal ward ( ) unior nternaonal ward What

CERN-US Rela,ons: Past, Present & Future from a CERN