look ma no latent variables accurate cutset networks via
play

Look Ma, No Latent Variables: Accurate Cutset Networks via - PowerPoint PPT Presentation

Look Ma, No Latent Variables: Accurate Cutset Networks via Compilation Tahrima Rahman, Shasha Jin, Vibhav Gogate The University of Texas at Dallas ICML 2019 What is the paper about? Probabilistic Graphical Models ( PGMs ) vs Latent Tractable


  1. Look Ma, No Latent Variables: Accurate Cutset Networks via Compilation Tahrima Rahman, Shasha Jin, Vibhav Gogate The University of Texas at Dallas ICML 2019

  2. What is the paper about? Probabilistic Graphical Models ( PGMs ) vs Latent Tractable Probabilistic Models ( LTPMs ): Criteria Who wins? Why? Test set PGMs Expressive model log-likelihood Marginal (MAR) Reliable Exact LTPMs estimates Inference Both use unreliable Maximum-a- approximate posteriori (MAP) No Clear Winner inference estimates approaches Tractability helps achieve better marginal predictions even though the model fit is inferior. Can we do the same for MAP inference?

  3. What is the paper about? Probabilistic Graphical Models ( PGMs ) vs Latent Tractable Probabilistic Models ( LTPMs ): Criteria Who wins? Why? Test set PGMs Expressive model log-likelihood Marginal (MAR) Reliable Exact LTPMs estimates Inference Both use unreliable Maximum-a- approximate posteriori (MAP) No Clear Winner inference estimates approaches Tractability helps achieve better marginal predictions even though the model fit is inferior. Can we do the same for MAP inference? ◮ YES: If we use MAP-tractable cutset networks and im- prove their fit .

  4. Cutset Networks X 1 0.3 0.7 X 2 X 4 0.8 0.1 0.2 0.9 X 2 X 6 X 3 X 3 X 5 X 4 X 5 X 6 X 3 X 5 0.4 0.6 0.25 0.75 T 3 T 4 X 4 X 4 X 2 X 6 X 5 X 6 X 5 X 6 X 3 X 6 X 2 X 3 T 1 T 2 T 5 T 6 ◮ OR tree (probabilistic decision trees) with tree Bayesian net- works at leaves ◮ MAP tractable and Interpretable unlike sum-product networks

  5. Improving Accuracy of Cutset Networks Issue: When learned just from data the test set LL score of cutset networks is much smaller than latent variable models Our approach to improve accuracy: Compile cutset networks from a more accurate latent variable model. 1. First learn a latent variable model M that admits tractable posterior marginal inference 2. Compute the sufficient statistics used by the classic structure learning algorithm for cutset networks by performing marginal inference over M

  6. Experiments Hypothesis: If we improve the fit (test set LL) of cutset networks, because of tractability, they will yield better MAP estimates as compared with latent variable models Experiments verify our hypothesis Dataset Before Compilation After Compilation DNA -78.33 -74.25 Movie -41.38 -38.14 BBC -160.27 -158.26 Table: Conditional Log-Likelihood of the MAP Estimates More details at the poster...

Recommend


More recommend