Divide and Couple: Using Monte Carlo Variational Objectives for - PowerPoint PPT Presentation

Divide and Couple: Using Monte Carlo Variational Objectives for Posterior Approximation Justin Domke and Daniel Sheldon University of Massachusetts Amherst Overview Variational inference gives both a lower-bound on the log-likelihood and an approximate posterior . Easy to get other lower-bounds. Do they also give approximate posteriors? This work: A general theory connecting likelihood bounds to posterior approximations.

p ( z , x ) − → z Take p ( z , x ) with x fixed.

p ( z , x ) − → z Take p ( z , x ) with x fixed. Observation : If E R = p ( x ) , then E log R ≤ log p ( x ) .

p ( z , x ) − → z Take p ( z , x ) with x fixed. Observation : If E R = p ( x ) , then E log R ≤ log p ( x ) . Example : Take R = p ( x , z ) q ( z ) for z ∼ q Gaussian, optimize q .

log R = 0.237 p ( z , x ) q ( z ) , naive − → z Take p ( z , x ) with x fixed. Observation : If E R = p ( x ) , then E log R ≤ log p ( x ) . Example : Take R = p ( x , z ) q ( z ) for z ∼ q Gaussian, optimize q .

log R = 0.237 p ( z , x ) q ( z ) , naive − → z Take p ( z , x ) with x fixed. Observation : If E R = p ( x ) , then E log R ≤ log p ( x ) . Example : Take R = p ( x , z ) q ( z ) for z ∼ q Gaussian, optimize q . Decomposition : KL ( q ( z ) � p ( z | x )) = log p ( x ) − E log R . Likelihood bound: � Posterior approximation: �

p ( z , x ) Recent work : Better Monte Carlo estimators R .

log R ′ = 0.060 p ( z , x ) q ( z ) , antithetic Recent work : Better Monte Carlo estimators R . Antithetic Sampling : Let T ( z ) “flip” z around mean of q . � p ( z , x )+ p ( T ( z ) , x ) � R = 1 2 q ( z )

log R ′ = 0.060 p ( z , x ) q ( z ) , antithetic Recent work : Better Monte Carlo estimators R . Antithetic Sampling : Let T ( z ) “flip” z around mean of q . � p ( z , x )+ p ( T ( z ) , x ) � R = 1 2 q ( z ) Likelihood bound: � Posterior approximation: × × ×

log R ′ = 0.060 p ( z , x ) q ( z ) , antithetic Recent work : Better Monte Carlo estimators R . Antithetic Sampling : Let T ( z ) “flip” z around mean of q . � p ( z , x )+ p ( T ( z ) , x ) � R = 1 2 q ( z ) Likelihood bound: � Posterior approximation: × × × This paper : Is some other distribution close to p ?

log R ′ = 0.060 p ( z , x ) q ( z ) , antithetic log R ′ = 0.060 p ( z , x ) Q ( z ) , antithetic Contribution of this paper : Given estimator with E R = p ( x ) , we show how to construct Q ( z ) such that KL ( Q ( z ) � p ( z | x )) ≤ log p ( x ) − E log R .

Contribution of this paper : Given estimator with E R = p ( x ) , we show how to construct Q ( z ) such that KL ( Q ( z ) � p ( z | x )) ≤ log p ( x ) − E log R . log R ′ = 0.060 p ( z , x ) q ( z ) , antithetic log R ′ = 0.060 p ( z , x ) Q ( z ) , antithetic

Contribution of this paper : Given estimator with E R = p ( x ) , we show how to construct Q ( z ) such that KL ( Q ( z ) � p ( z | x )) ≤ log p ( x ) − E log R . log R ′ = 0.063 p ( z , x ) q ( z ) , stratified log R ′ = 0.063 p ( z , x ) Q ( z ) , stratified

Contribution of this paper : Given estimator with E R = p ( x ) , we show how to construct Q ( z ) such that KL ( Q ( z ) � p ( z | x )) ≤ log p ( x ) − E log R . log R ′ = 0.021 p ( z , x ) q ( z ) , antithetic within strata log R ′ = 0.021 p ( z , x ) Q ( z ) , antithetic within strata

Contribution of this paper : Given estimator with E R = p ( x ) , we show how to construct Q ( z ) such that KL ( Q ( z ) � p ( z | x )) ≤ log p ( x ) − E log R . How?

Contribution of this paper : Given estimator with E R = p ( x ) , we show how to construct Q ( z ) such that KL ( Q ( z ) � p ( z | x )) ≤ log p ( x ) − E log R . Unbiased estimator: Where is z ? ω R ( ω ) = p ( x ) E

Contribution of this paper : Given estimator with E R = p ( x ) , we show how to construct Q ( z ) such that KL ( Q ( z ) � p ( z | x )) ≤ log p ( x ) − E log R . Unbiased estimator: Where is z ? ω R ( ω ) = p ( x ) E We suggest: Need a coupling : ω R ( ω ) a ( z | ω ω ) = p ( z , x ) E ω � �� coupling

Contribution of this paper : Given estimator with E R = p ( x ) , we show how to construct Q ( z ) such that KL ( Q ( z ) � p ( z | x )) ≤ log p ( x ) − E log R . Unbiased estimator: Where is z ? ω R ( ω ) = p ( x ) E We suggest: Need a coupling : ω R ( ω ) a ( z | ω ω ) = p ( z , x ) E ω � �� coupling Then, exist augmented distributions s.t. KL ( Q ( z , ω ω ) � p ( z , ω ω | x )) = log p ( x ) − E log R ω ω

Contribution of this paper : Given estimator with E R = p ( x ) , we show how to construct Q ( z ) such that KL ( Q ( z ) � p ( z | x )) ≤ log p ( x ) − E log R . Summary : Tightening a bound log p ( x ) − E log R is equivalent to VI in an augmented state space ( ω ω , z ) . ω To sample from Q ( z ) draw ω then z ∼ a ( z | ω ) . Paper gives couplings for: ◮ Antithetic sampling ◮ Stratified sampling ◮ Quasi Monte Carlo ◮ Latin hypercube sampling ◮ Arbitrary recursive combinations of above

Implementation : Different sampling methods with Gaussian q .

Experiments confirm : Better likelihood bounds ⇔ better posteriors Poster : Tue Dec 10th, 5:30-7:30pm @ East Exhibition Hall B + C #166

Divide and Couple: Using Monte Carlo Variational Objectives for - PowerPoint PPT Presentation

Divide and Couple: Using Monte Carlo Variational Objectives for Posterior Approximation Justin Domke and Daniel Sheldon University of Massachusetts Amherst Overview Variational inference gives both a lower-bound on the log-likelihood and an

Monte Carlo Generators Monte Carlo Generators Monte Carlo Generators QCD Lecture III P .

Variational Hamiltonian Monte Carlo via Score Matching Cheng Zhang (Joint work with Prof. Shahbaba

Monte Carlo Methods Guojin Chen Christopher Cprek Chris Rambicure Monte Carlo Methods 1.

Monte Carlo Approximation of Monte Carlo Filters Adam M. Johansen et al. Collaborators Include:

BROCHURE 2019 TETRA JUICES DEL MONTE DEL MONTE 6 x 1L GOLD PINEAPPLE 6 x 1L 6 x 1L 6 x 1L

? Quantum Variational Monte Carlo Problem statement Minimize the functional E [ T ], where

4. THE MONTE CARLO METHOD 4.1 I ntroduction This chapter is aimed at describing the Monte Carlo

Chapter 5: Monte Carlo Methods Monte Carlo methods are learning methods Experience

Draft Introduction to (randomized) quasi-Monte Carlo Pierre LEcuyer MCQMC Conference,

Monte Carlo Estimation 7 January 2019 OSU CSE 1 Monte Carlo Methods Class of computational

Monte Carlo Localization Ximing Yu March 24, 2009 Ximing Yu Monte Carlo Localization 1

Monte Carlo Control CMPUT 366: Intelligent Systems S&B 5.3-5.5, 5.7 Lecture Outline 1.

Variational Auto-encoders 2 VARIATIONAL AUTO-ENCODERS INTRODUCTION VARIATIONAL AUTO-ENCODERS

Draft 1 Density estimation by Monte Carlo and randomized quasi-Monte Carlo (RQMC) Pierre

Introduction to Monte Carlo Method Andrzej Palczewski and Jan Palczewski Introduction to Monte

Techniques in Artificial Intelligence - Part I Todd W. Neller Gettysburg College Monte Carlo

A PRIMER ON ARTIFICIAL INTELLIGENCE EXPERT SYSTEMS IN THE PETROLEUM INDUSTRY BY E.R.CRAIN, P.

Data Modeling Database Systems: The Complete Book Ch. 4.1-4.5, 7.1-7.4 Data Modeling Schema:

From raw data to rich(er) data Lessons learned while aggregating metadata Julia Beck |

Patient F Financial S Services R Report March 2020 2020 Angela McLain-Johnson, MA, RHIA

Diagonals of rational functions Main Conference of Chaire J. Morlet Artin approximation and

Big Data on Small Machines Guy Blelloch with: Aapo Kyrola,

High Order Masking of Look-up Tables with Common Shares J-S.Coron, F.Rondepierre, R.Zeitoun 12th

Using Online Ac,vity as Digital Fingerprints to Create a

Sambuz

Useful Links

Newsletter

Mail Us

Divide and Couple: Using Monte Carlo Variational Objectives for - PowerPoint PPT Presentation

Divide and Couple: Using Monte Carlo Variational Objectives for Posterior Approximation Justin Domke and Daniel Sheldon University of Massachusetts Amherst Overview Variational inference gives both a lower-bound on the log-likelihood and an

Monte Carlo Generators Monte Carlo Generators Monte Carlo Generators QCD Lecture III P .

Variational Hamiltonian Monte Carlo via Score Matching Cheng Zhang (Joint work with Prof. Shahbaba

Monte Carlo Methods Guojin Chen Christopher Cprek Chris Rambicure Monte Carlo Methods 1.

Monte Carlo Approximation of Monte Carlo Filters Adam M. Johansen et al. Collaborators Include:

BROCHURE 2019 TETRA JUICES DEL MONTE DEL MONTE 6 x 1L GOLD PINEAPPLE 6 x 1L 6 x 1L 6 x 1L

? Quantum Variational Monte Carlo Problem statement Minimize the functional E [ T ], where

4. THE MONTE CARLO METHOD 4.1 I ntroduction This chapter is aimed at describing the Monte Carlo

Chapter 5: Monte Carlo Methods Monte Carlo methods are learning methods Experience

Draft Introduction to (randomized) quasi-Monte Carlo Pierre LEcuyer MCQMC Conference,

Monte Carlo Estimation 7 January 2019 OSU CSE 1 Monte Carlo Methods Class of computational

Monte Carlo Localization Ximing Yu March 24, 2009 Ximing Yu Monte Carlo Localization 1

Monte Carlo Control CMPUT 366: Intelligent Systems S&amp;B 5.3-5.5, 5.7 Lecture Outline 1.

Variational Auto-encoders 2 VARIATIONAL AUTO-ENCODERS INTRODUCTION VARIATIONAL AUTO-ENCODERS

Draft 1 Density estimation by Monte Carlo and randomized quasi-Monte Carlo (RQMC) Pierre

Introduction to Monte Carlo Method Andrzej Palczewski and Jan Palczewski Introduction to Monte

Techniques in Artificial Intelligence - Part I Todd W. Neller Gettysburg College Monte Carlo

A PRIMER ON ARTIFICIAL INTELLIGENCE EXPERT SYSTEMS IN THE PETROLEUM INDUSTRY BY E.R.CRAIN, P.

Data Modeling Database Systems: The Complete Book Ch. 4.1-4.5, 7.1-7.4 Data Modeling Schema:

From raw data to rich(er) data Lessons learned while aggregating metadata Julia Beck |

Patient F Financial S Services R Report March 2020 2020 Angela McLain-Johnson, MA, RHIA

Diagonals of rational functions Main Conference of Chaire J. Morlet Artin approximation and

Big Data on Small Machines Guy Blelloch with: Aapo Kyrola,

High Order Masking of Look-up Tables with Common Shares J-S.Coron, F.Rondepierre, R.Zeitoun 12th

Using Online Ac,vity as Digital Fingerprints to Create a

Sambuz

Useful Links

Newsletter

Mail Us

Monte Carlo Control CMPUT 366: Intelligent Systems S&B 5.3-5.5, 5.7 Lecture Outline 1.