Statistical Relational Learning and Knowledge Graph Reasoning - PowerPoint PPT Presentation

Statistical Relational Learning and Knowledge Graph Reasoning CSCI 699 J AY P UJARA

Reminder: Basic problems A 1 E 1 A 2 • Who are the entities (nodes) in the graph? R 2 R 1 • What are their attributes E 2 and types (labels)? R 3 A 1 A 2 • How are they related E 3 (edges)? A 1 A 2 2

Motivating Problem: New Opportunities Extraction Internet Knowledge Graph (KG) Cutting-edge IE Structured methods representation of Massive source of entities, their labels and publicly available the relationships information between them

Motivating Problem: Real Challenges Extraction Internet Knowledge Graph Difficult! Noisy! Contains many errors and inconsistencies

Graph Construction Issues Extracted knowledge is: • ambiguous: ◦ Ex: Beetles, beetles, Beatles ◦ Ex: citizenOf, livedIn, bornIn 5

Graph Construction Issues Extracted knowledge is: • ambiguous author • incomplete author ◦ Ex: missing relationships c o ◦ Ex: missing labels w o r k ◦ Ex: missing entities e r 6

Graph Construction Issues Extracted knowledge is: • ambiguous • incomplete spouse • inconsistent ◦ Ex: Cynthia Lennon, Yoko Ono ◦ Ex: exclusive labels (alive, dead) spouse ◦ Ex: domain-range constraints 7

Graph Construction Issues Extracted knowledge is: • ambiguous • incomplete • inconsistent 8

NELL:The Never-Ending Language Learner • Large-scale IE project (Carlson et al., 2010) • Lifelong learning: aims to “read the web” • Ontology of known labels and relations • Knowledge base contains millions of facts

Examples of NELL errors

Entity co-reference errors Kyrgyzstan has many variants: • Kyrgystan • Kyrgistan • Kyrghyzstan • Kyrgzstan • Kyrgyz Republic

Missing and spurious labels Kyrgyzstan is labeled a bird and a country

Missing and spurious relations Kyrgyzstan’s location is ambiguous – Kazakhstan, Russia and US are included in possible locations

Violations of ontological knowledge • Equivalence of co-referent entities (sameAs) • SameEntity(Kyrgyzstan, Kyrgyz Republic) • Mutual exclusion (disjointWith) of labels • MUT(bird, country) • Selectional preferences (domain/range) of relations • RNG(countryLocation, continent) Enforcing these constraints require jointly considering multiple extractions

Graph Construction approach •Graph construction cleans and completes extraction graph •Incorporate ontological constraints and relational patterns •Discover statistical relationships within knowledge graph 15

Graph Construction Probabilistic Models TO TOPICS: O VERVIEW G RAPHICAL MODELS R ANDOM W ALK M ETHODS 16

Graph Construction Probabilistic Models TO TOPICS: O VERVIEW G RAPHICAL MODELS R ANDOM W ALK M ETHODS 17

Voter Party Classification ?

Voter Party Classification Multiple Sources of Information Statuses & Tweets

Voter Party Classification Multiple Sources of Information Statuses & Donations Tweets

Voter Party Classification Multiple Sources of Information Statuses & Donations Tweets Friends & Followers

Voter Party Classification Multiple Sources of Information Statuses & Donations Tweets Friends & Family Followers

Voter Party Classification

Voter Party Classification $ CarlyFiorinaforVicePresident.com

Voter Party Classification Multiple Sources of Information Statuses & Donations Tweets Friends & Family Followers

Standard Classification CarlyFiorinaforVicePresident.com Bag-of-words features

Standard Classification CarlyFiorinaforVicePresident.com Bag-of-words features Pr( Y )

Standard Classification CarlyFiorinaforVicePresident.com Bag-of-words features

Voter Party Classification Multiple Sources of Information Status Donations Updates Friends Family

Collective Classification Follows

Collective Classification Follows Pr( Y )

Collective Classification Follows My label is likely to match that of my follower

Collective Classification Follows Follows(U1, U2) & Votes(U1, P) à Votes(U2, P)

Collective Classification follower spouse

Collective Classification follower spouse Spouse(U1, U2) & Votes(U1, P) à Votes(U2, P) Follows(U1, U2) & Votes(U1, P) à Votes(U2, P)

Collective Classification

Collective Classification Pr( Y )

Collective Classification follower spouse follower

Collective Classification follower spouse follower 5.0: Spouse(U1, U2) ^& Votes(U1, P) à Votes(U2, P) 2.0: Follows(U1, U2) & Votes(U1, P) à Votes(U2, P)

Collective Classification follower spouse follower

Collective Classification ? ? ? friend spouse colleague friend spouse friend friend colleague spouse ? ? ?

Collective Classification with PSL /* Local rules */ 5.0: Donates(A, P) -> Votes(A, P) 0.3: Mentions(A, “Affordable Health”) -> Votes(A, “Democrat”) 0.3: Mentions(A, “Tax Cuts”) -> Votes(A, “Republican”) /* Relational rules */ 1.0: Votes(A,P) & Spouse(B,A) -> Votes(B,P) 0.3: Votes(A,P) & Friend(B,A) -> Votes(B,P) 0.1: Votes(A,P) & Colleague(B,A) -> Votes(B,P) /* Range constraint */ Votes(A, “Republican”) + Votes(A, “Democrat”) = 1.0 .

Beyond Pure Reasoning •Classical AI approach to knowledge: reasoning Lbl(Socrates, Man) & Sub(Man, Mortal) -> Lbl(Socrates, Mortal) 47

Beyond Pure Reasoning •Classical AI approach to knowledge: reasoning Lbl(Socrates, Man) & Sub(Man, Mortal) -> Lbl(Socrates, Mortal) •Reasoning difficult when extracted knowledge has errors 48

Beyond Pure Reasoning •Classical AI approach to knowledge: reasoning Lbl(Socrates, Man) & Sub(Man, Mortal) -> Lbl(Socrates, Mortal) •Reasoning difficult when extracted knowledge has errors •Solution: probabilistic models P(Lbl(Socrates, Mortal)|Lbl(Socrates,Man)=0.9) 49

Logic Refresher: Satisfaction /* Model Snippet */ Mentions(A, “Affordable Health”) -> Votes(A, “Democrat”) Affordable Democrat Logical Health Satisfaction TRUE TRUE J TRUE FALSE L FALSE TRUE J FALSE FALSE J

Logic and Noisy Data /* Model Snippet */ [1] Mentions(A, “Affordable Health”) -> Votes(A, “Democrat”) [2] Mentions(A, “Tax Cuts”) -> !Votes(A, “Democrat”) Affordable Tax Democrat [1] Logical [2] Logical Health Cuts Satisfaction Satisfaction TRUE TRUE TRUE J L TRUE TRUE FALSE L J

Logic and Noisy Data /* Model Snippet */ [1] Mentions(A, “Affordable Health”) -> Votes(A, “Democrat”) [2] Mentions(A, “Tax Cuts”) -> !Votes(A, “Democrat”) Affordable Tax Democrat [1] Logical [2] Logical Health Cuts Satisfaction Satisfaction TRUE TRUE TRUE J L TRUE TRUE FALSE L J In logic, much as in politics, it is hard to satisfy everyone

Soft Logic to the Rescue! /* Model Snippet */ [1] Mentions(A, “Affordable Health”) -> Votes(A, “Democrat”) [2] Mentions(A, “Tax Cuts”) -> !Votes(A, “Democrat”) Affordable Tax Democrat [1] Logical [2] Logical Health Cuts Satisfaction Satisfaction TRUE TRUE 0.5 ! ! ! ! TRUE TRUE 0.5

What does 0.5 MEAN?

What does 0.5 mean? Rounding probability: • Flip a coin with bias 0.5 • Heads = TRUE • Tails = FALSE • Using this method is a ¾ optimal solution to the • NP hard weighted MAX SAT problem [Goemans&Williams, 94] 55

What does ! MEAN?

What does ! mean? P -> Q • /* Soft Logic Penalty */ • if P < Q return J • • else: • return P-Q

! : Closed Form P -> Q • max(0, P-Q)

! : Closed Form P -> Q max(0, P-Q) Soft Loss 1 0.8 0.6 0.4 0.2 P=1 0 P=.6 P=.2 Q=0 Q=0.2 Q=0.4 Q=0.6 Q=0.8 Q=1 0-0.2 0.2-0.4 0.4-0.6 0.6-0.8 0.8-1

What does ! mean? /* Model Snippet */ [1] Mentions(A, “Affordable Health”) -> Votes(A, “Democrat”) [2] Mentions(A, “Tax Cuts”) -> !Votes(A, “Democrat”) /* Soft Logic Penalty */ if Mentions(A, “Tax Cuts”) < !Votes(A, “Democrat”): return 0 else: return Mentions(A, “Tax Cuts”) - !Votes(A, “Democrat”)

Computing ! /* Model Snippet */ [1] Mentions(A, “Affordable Health”) -> Votes(A, “Democrat”) [2] Mentions(A, “Tax Cuts”) -> !Votes(A, “Democrat”) Affordable Tax Democrat [1] Penalty [2] Penalty Health Cuts 1 1 0.7 1 1 0.2 !Q = 1-Q P -> Q = max(0, P-Q)

Computing ! /* Model Snippet */ [1] Mentions(A, “Affordable Health”) -> Votes(A, “Democrat”) [2] Mentions(A, “Tax Cuts”) -> !Votes(A, “Democrat”) Affordable Tax Democrat [1] Penalty [2] Penalty Health Cuts 1 1 0.7 0.3 0.7 1 1 0.2 0.8 0.2 !Q = 1-Q P -> Q = max(0, P-Q)

Statistical Relational Learning and Knowledge Graph Reasoning - PowerPoint PPT Presentation

Statistical Relational Learning and Knowledge Graph Reasoning CSCI 699 J AY P UJARA Reminder: Basic problems A 1 E 1 A 2 Who are the entities (nodes) in the graph? R 2 R 1 What are their attributes E 2 and types (labels)? R 3 A 1 A 2

Chapter 2: Relational Model Chapter 2: Relational Model Structure of Relational Databases

Chapter 3: Relational Model Structure of Relational Databases Relational Algebra Tuple

Relational Algebra 1 / 39 Relational Algebra Relational model specifies stuctures and

Relational Query Languages (2) SQL and QBE Walid G. Aref Query Languages For The Relational

Relational Algebra Relational Query Languages Recall: Query = Retrieval Program Language

Goals Why relational learning? Review of logic programming Examples for

RELATIONAL ALGEBRA CHAPTER 6 1 CHAPTER 6 OUTLINE Unary Relational Operations: SELECT and

This Lecture The Relational Model Relational data structures Relations and Relational

Chapter 8 Evaluation of Relational Operators Implementing the Relational Algebra Relational

Relational Calculus More declarative than relational algebra Foundation for query

Relational Data Model Hacettepe University Computer Engineering Department Outline 1. Relational

Knowledge-Based Agents knowledge knowledge representation, knowledge base, types of knowledge

GRAPH MINING AND GRAPH KERNELS Part I: Graph Mining Karsten Borgwardt^ and Xifeng Yan*

CSE 154 LECTURE 13:RELATIONAL DATABASES AND SQL Relational databases relational database : A

CSC 337 LECTURE 20: RELATIONAL DATABASES AND SQL Relational databases relational database : A

Relational Calculus Another Theoretical QL-Relational Calculus Comes in two flavors: Tuple

Discrete Time Evolution and Baxter's Q-operator Christian.Korff@glasgow.ac.uk ( :( ( : a un

Decentralized Execution of Constraint Handling Rules for Ensembles Edmund S. L. Lam Iliano

Learning Multi-Modal Grounded Linguistic Semantics by Playing I Spy Jesse Thomason Jivko

We will begin at the top of the hour. Housekeeping Q&A Box Survey at the end

PIP-II Goals, Status, and Strategy Steve Holmes DOE Independent Project Review of PIP-II 15

DSA Briefing (Primary 6) Date: 16 May, 2018 Auditorium 4.00pm 5.00pm Direct School Admission

Sample Snort Signature alert tcp $EXTERNAL_NET any -> $HOME_NET 139

Stash Directory: A Scalable Directory for Many-Core

Statistical Relational Learning and Knowledge Graph Reasoning - PowerPoint PPT Presentation

Statistical Relational Learning and Knowledge Graph Reasoning CSCI 699 J AY P UJARA Reminder: Basic problems A 1 E 1 A 2 Who are the entities (nodes) in the graph? R 2 R 1 What are their attributes E 2 and types (labels)? R 3 A 1 A 2

Chapter 2: Relational Model Chapter 2: Relational Model Structure of Relational Databases

Chapter 3: Relational Model Structure of Relational Databases Relational Algebra Tuple

Relational Algebra 1 / 39 Relational Algebra Relational model specifies stuctures and

Relational Query Languages (2) SQL and QBE Walid G. Aref Query Languages For The Relational

Relational Algebra Relational Query Languages Recall: Query = Retrieval Program Language

Goals Why relational learning? Review of logic programming Examples for

RELATIONAL ALGEBRA CHAPTER 6 1 CHAPTER 6 OUTLINE Unary Relational Operations: SELECT and

This Lecture The Relational Model Relational data structures Relations and Relational

Chapter 8 Evaluation of Relational Operators Implementing the Relational Algebra Relational

Relational Calculus More declarative than relational algebra Foundation for query

Relational Data Model Hacettepe University Computer Engineering Department Outline 1. Relational

Knowledge-Based Agents knowledge knowledge representation, knowledge base, types of knowledge

GRAPH MINING AND GRAPH KERNELS Part I: Graph Mining Karsten Borgwardt^ and Xifeng Yan*

CSE 154 LECTURE 13:RELATIONAL DATABASES AND SQL Relational databases relational database : A

CSC 337 LECTURE 20: RELATIONAL DATABASES AND SQL Relational databases relational database : A

Relational Calculus Another Theoretical QL-Relational Calculus Comes in two flavors: Tuple

Discrete Time Evolution and Baxter's Q-operator Christian.Korff@glasgow.ac.uk ( :( ( : a un

Decentralized Execution of Constraint Handling Rules for Ensembles Edmund S. L. Lam Iliano

Learning Multi-Modal Grounded Linguistic Semantics by Playing I Spy Jesse Thomason Jivko

We will begin at the top of the hour. Housekeeping Q&amp;A Box Survey at the end

PIP-II Goals, Status, and Strategy Steve Holmes DOE Independent Project Review of PIP-II 15

DSA Briefing (Primary 6) Date: 16 May, 2018 Auditorium 4.00pm 5.00pm Direct School Admission

Sample Snort Signature alert tcp $EXTERNAL_NET any -&gt; $HOME_NET 139

Stash Directory: A Scalable Directory for Many-Core

We will begin at the top of the hour. Housekeeping Q&A Box Survey at the end

Sample Snort Signature alert tcp $EXTERNAL_NET any -> $HOME_NET 139