Mind Change Optimal Learning Of Bayes Net Structure Oliver Schulte - PowerPoint PPT Presentation

Mind Change Optimal Learning Of Bayes Net Structure Oliver Schulte School of Computing Science Simon Fraser University Vancouver, Canada oschulte@cs.sfu.ca ` with Wei Luo (SFU, wluoa@cs.sfu.ca) and Russ Greiner (U of Alberta, greiner@cs.ualberta.ca)

Outline 1. Brief Intro to Bayes Nets (BNs). 2. Language Learning Model for BN Structure Learning. 3. Mind Change Complexity of BN Learning. 4. Mind Change, Convergence Time Optimality. 5. NP-hardness of Optimal Learner. Mind Change Optimal Learning of Bayes Net Structure 2/19

Bayes Nets: Overview Very widely used graphical formalism for probabilistic reasoning and KR in AI and machine learning. Bayes Net Structure = Directed Acyclic Graph. Nodes = Variables of Interest. Arcs = direct “influence”, “association”. Structure represents probabilistic conditional dependencies (correlations). Mind Change Optimal Learning of Bayes Net Structure 3/19

Example of Bayes Net Structure Season Sprinkler Rain 1. Season depends on Slippery. Wet 2. Sprinkler depends on Rain. 3. Sprinkler does not depend on Rain Slippery given Season. 4. Sprinkler depends on Rain given Season, Wet. Mind Change Optimal Learning of Bayes Net Structure 4/19

Graphs entail Dependencies A C Dep(A,B),Dep(A,B|C), Dep(B,C),Dep(B,C|A), B Dep(A,C|B) A C Dep(A,B),Dep(A,B|C) B A C B Mind Change Optimal Learning of Bayes Net Structure 5/19

Pattern = DAG Equivalence Class  Write Dep(G) for the dependencies defined by DAG G.  Natural Equivalence relation: G ≈ G’ ⇔ Dep(G) = Dep(G’).  A partially directed graph, called a pattern , represents the equivalence class for a given DAG G. G pattern G’ A B A B A B C C C Mind Change Optimal Learning of Bayes Net Structure 6/19

Constraint-Based BN Learning as Language Learning Constraint-Based Approach: Learn BN from (in)dependency information . Spirtes, Glymour, Shines (2000); Pearl and Verma (2000); Margaritis and Thrun (1999); Cheng and Greiner (2001). Bayes Net Gold paradigm conditional dependence: string Dep(X,Y| Z ) Z = set of variables dependency relation language pattern index A BN learner maps a sequence of dependencies (repetitions allowed) to a pattern or to ?. Mind Change Optimal Learning of Bayes Net Structure 7/19

Identification with Bounded Mind Changes  Learner Ψ changes its mind on text T at stage k+1 , Ψ (T[k]) ≠ Ψ (T[k+1]) or Ψ (T[k]) ≠ ? and Ψ (T[k+1]) = ?.  Learner Ψ identifies language collection L with k mind changes , Ψ identifies L and changes its mind at most k times on any text for a language in L .  L is identifiable with k mind changes , there is a learner Ψ that identifies L with k mind changes. … Text Dep(A,B) Dep(B,C) Dep(A,C|B) … A C A C Conjectures ? B B

Inclusion Depth and Mind Change Bounds Proposition (Luo and Schulte 2006) Suppose that L has finite thickness. Then the best mind change bound for L is given by the length of the longest inclusion chain L 1 ⊂ L 2 ⊂ … L k formed by languages in L . 2,3,4 1,2 2,3 longest inclusion chain has length 4 1 2 Mind Change Optimal Learning of Bayes Net Structure 9/19

Mind Change Complexity of BN Learning Let L V be the collection of dependency relations definable by Bayes nets with variables V. Theorem The longest inclusion chain in L V � � | V | is of length = � � 2 � � the number of edges in a complete graph. Mind Change Optimal Learning of Bayes Net Structure 10/19

Maximal Length Inclusion Chain A C all dependencies B A C Dep(A,B),Dep(A,B|C), Dep(B,C),Dep(B,C|A), B Dep(A,C|B) A C Dep(A,B),Dep(A,B|C) B A C B

Mind Change Optimal Learning  Learner Ψ is MC-optimal for language collection L , if given any data sequence σ , the learner Ψ identifies L with the best possible mind change bound for the language collection {L: L is in L and consistent with σ }.  Proposition A BN learner identifying L is MC- optimal , for all dependency sequences σ , if there is no unique edge-minimal pattern consistent with σ , then Ψ ( σ ) = ?. Proof follows from general characterization of MC-optimality in Luo and Schulte (2005,2006). Mind Change Optimal Learning of Bayes Net Structure 12/19

Example of Mind Change Optimal Learner … Text Dep(A,B) Dep(B,C) Dep(A,C|B) … A C A C Conjectures ? B B Alternatives: A C B A C B Mind Change Optimal Learning of Bayes Net Structure 13/19

Convergence Time Convergence Time = number of observed dependencies - important to  minimize Def (Gold) Learner Ψ is uniformly faster than learner Φ ,  Ψ converges at least as fast as Φ on every text T, and 1. Ψ converges strictly faster on some text T. 2. Define � fast( � ) = G � if G is the unique edge - minimal pattern consistent with � � ? otherwise � Proposition The learner Ψ fast is uniformly faster than any other MC-optimal BN learner. Mind Change Optimal Learning of Bayes Net Structure 14/19

Complexity Analysis A list of dependencies is compactly represented by a dependency oracle . dependency list Dep(A,B) Dep(B,C) Dep(A,C|B) Q:“Dep(B,C)” Oracle yes Q:“Dep(A,C)” Oracle ? Unique k O-cover Given a dependency oracle O, and a bound k , is there a DAG G covering the dependencies in O with ≤ k edges s.t. all other DAGs G’ covering the dependencies in O have more edges than G?

NP-hardness result Theorem Unique X3set-Cover reduces to Unique k O- Cover . So if P = RP, then UMOC is NP-hard. Basic Idea: Construct a dependency oracle that forces a tree.  Universe: X 1 ,…,X m . R  Sets: C 1 ,..,C p . … C 1 C 2 C p C p - 1  All elements must be dependent on R. … X 1 X 2 X 3 X m - X m 1 Mind Change Optimal Learning of Bayes Net Structure 16/19

Conclusion  Constraint-based approach to BN learning analyzed as language learning problem. � � n  Mind Change Complexity = , where n is the � � 2 � � number of variables.  Number of edges: new intuitive notion of simplicity for a BN, based on learning theory.  Unique fastest mind-change optimal method is NP-hard. Mind Change Optimal Learning of Bayes Net Structure 17/19

Future Work  Heuristic Implementation of MC-optimal Learner (GES search).  Leads to a new BN learning algorithm with good performance. Mind Change Optimal Learning of Bayes Net Structure 18/19

References W. Luo and O. Schulte. Mind change efficient learning. In COLT 2005, pages 398-412. W. Luo and O. Schulte. Mind change efficient learning . Information and Computation 204:989-1011, 2006. THE END Mind Change Optimal Learning of Bayes Net Structure 19/19

Mind Change Optimal Learning Of Bayes Net Structure Oliver Schulte - PowerPoint PPT Presentation

Mind Change Optimal Learning Of Bayes Net Structure Oliver Schulte School of Computing Science Simon Fraser University Vancouver, Canada oschulte@cs.sfu.ca ` with Wei Luo (SFU, wluoa@cs.sfu.ca) and Russ Greiner (U of Alberta,

www.Every-Mind.org www.Every-Mind.org www.Every-Mind.org www.Every-Mind.org

Naive Bayes and Gaussian Bayes Classifier Ladislav Rampasek slides by Mengye Ren and others

The Nave Bayes Classifier Machine Learning 1 Todays lecture The nave Bayes Classifier

Bayes Theorem Thomas Bayes (1701-1761) Simple form of Bayes Theorem, for

??? It s It s Make Your Mind Up Make Your Mind Up Time Time ??? Make

DATA MINING: NAVE BAYES 1 Nave Bayes Classifier Thomas Bayes 1702 - 1761 We will start off

Cognitive Modeling Unseen Examples 2 Bayes Classifiers Lecture 14: Naive Bayes Classifiers

STAT 339 Naive Bayes Classification 8-10 March 2017 Colin Reimer Dawson Outline Naive Bayes

Bayes Classifiers Nave Bayes Classification Patrick Mair Bayes Classifiers Weather data

I ntroduction to Mobile Robotics Bayes Filter Kalm an Filter Wolfram Burgard 1 Bayes

Introduction to Machine Learning Classification: Naive Bayes Learning goals 15 Understand the

Out line Wrap up d-separ at ion I nf erence in Bayes Net s Bayes Net s (cont )

Know the mind. Shape the mind. Free the mind. 1 The Neurology of Awakening: Using the New Brain

CS 440/ECE448 Lecture 19: Bayes Net Inference Mark Hasegawa-Johnson, 3/2019 Including slides by

Learning in Bayes Nets Bayes Nets: 1. Parameter Learning/Estimation: infer from data, given G

Bayesian Learning Bayes Theorem MAP, ML hypotheses MAP learners Minimum description

Re-imagining Professional Learning with Habits of Mind Bena Kallick and Art Costa share their new

FAME Master University of Bordeaux Advanced Hybrid Materials and Ceramics by Design

EASM 2014 understand the importance of the stakeholders and how they these are influencing the

context: job evaluation, market pay and reward Peter Reilly, Principal Associate, IES For

1 2 Monday, October 25, 2010 3 4 Monday, October 25, 2010 5 6 Monday, October 25, 2010 7

Things to keep in mind while scanning - Get excellent data to work with Richard Steffen Martin

Personalized Learning with Habits of Mind Bena Kallick @benakallick Purposeful Interactions

Mind the Gap Understanding why we dont understand? by Barrie Bramley by Barrie Bramley