CS 285 Instructor: Sergey Levine UC Berkeley Todays Lecture 1. - PowerPoint PPT Presentation

Variational Inference and Generative Models CS 285 Instructor: Sergey Levine UC Berkeley

Today’s Lecture 1. Probabilistic latent variable models 2. Variational inference 3. Amortized variational inference 4. Generative models: variational autoencoders • Goals • Understand latent variable models in deep learning • Understand how to use (amortized) variational inference

Probabilistic models

Latent variable models mixture element

Latent variable models in general “easy” distribution “easy” distribution (e.g., conditional Gaussian) (e.g., Gaussian) “easy” distribution (e.g., Gaussian)

Latent variable models in RL conditional latent variable latent variable models for models for multi-modal policies model-based RL

Other places we’ll see latent variable models Using RL/control + variational inference to model human behavior Muybridge (c. 1870) Mombaur et al. ‘09 Li & Todorov ‘06 Ziebart ‘08 Using generative models and variational inference for exploration

How do we train latent variable models?

Estimating the log-likelihood

Variational Inference

The variational approximation

The variational approximation Jensen’s inequality

A brief aside… Entropy: high Intuition 1: how random is the random variable? Intuition 2: how large is the log probability in expectation under itself low this maximizes the first part this also maximizes the second part (makes it as wide as possible)

A brief aside… KL-Divergence: Intuition 1: how different are two distributions? Intuition 2: how small is the expected log probability of one distribution under another, minus entropy? why entropy? this maximizes the first part this also maximizes the second part (makes it as wide as possible)

The variational approximation

How do we use this? how?

What’s the problem?

Amortized Variational Inference

What’s the problem?

Amortized variational inference how do we calculate this?

Amortized variational inference look up formula for entropy of a Gaussian can just use policy gradient! What’s wrong with this gradient?

The reparameterization trick Is there a better way? most autodiff software (e.g., TensorFlow) will compute this for you!

Another way to look at it… this often has a convenient analytical form (e.g., KL-divergence for Gaussians)

Reparameterization trick vs. policy gradient • Policy gradient • Can handle both discrete and continuous latent variables • High variance, requires multiple samples & small learning rates • Reparameterization trick • Only continuous latent variables • Very simple to implement • Low variance

Example Models

The variational autoencoder

Using the variational autoencoder

Conditional models

Examples

1. collect data 2. learn embedding of image & dynamics model ( jointly ) 3. run iLQG to learn to reach image of goal a type of variational autoencoder with temporally decomposed latent state!

Local models with images

Local models with images variational autoencoder with stochastic dynamics

We’ll see more of this for… Using RL/control + variational inference to model human behavior Muybridge (c. 1870) Mombaur et al. ‘09 Li & Todorov ‘06 Ziebart ‘08 Using generative models and variational inference for exploration

CS 285 Instructor: Sergey Levine UC Berkeley Todays Lecture 1. - PowerPoint PPT Presentation

Variational Inference and Generative Models CS 285 Instructor: Sergey Levine UC Berkeley Todays Lecture 1. Probabilistic latent variable models 2. Variational inference 3. Amortized variational inference 4. Generative models: variational

Performa 285 Performa 285 High Alloy Zinc Nickel High Alloy Zinc Nickel Alloy Zinc Automotive

Ichthys LNG Project Ichthys Project Location Abadi WA 285 P Ichthys Field WA 285

I-285 Top End Express Lanes I-285 Westside Express Lanes 1 Unprecedented Growth in Metro

Ichthys LNG Project Ichthys NG roject Ichthys Project Location Abadi WA 285 P Ichthys

BLU-285: A potent and highly selective inhibitor designed to target malignancies driven by KIT and

GIST: imatinib and beyond Clinical activity of BLU-285 in advanced gastrointestinal stromal tumor

Particulate Air Quality Around Wisconsin Frac Sand Mines #285 B A Presentation by Dr. Crispin

Quality Candles ...in a modern design www.diana-candles.com 285 employees Aprox .

the public sector with Lorraine Forrest-Turner governmentevents.co.uk | 0330 0584 285 |

Clinical activity in a Phase 1 study of BLU-285, a potent, highly-selective inhibitor of KIT D816V

Visual disability Low vision 2015 Estimated blind people 2020 Visually impaired 285 M Blind

Southern Companys Demonstration of a 285 MW Coal-Based Transport Gasifier Project Project

Georgia DOT Updates: MMIP and Transform 285/400 January 23, 2018 Tim Matthews, P.E. MMIP

Lanes and I-285 Top End Express Lanes Fulton County Schools Briefing Tim Matthews, P.E.

COST OR PRICE COST OR PRICE REASONABLENESS REASONABLENESS (CPR) (CPR) UH APM A8.285 RCUH

Introduction to Intelligent Transportation Systems (ITS): I-285 Variable Speed Limits Andrew

Probabilistic Graphical Models 10-708 Learning Partially Observed Learning Partially Observed

Stacks and Queues 25 Stack In/Out LIFO: Last-in First-out Push Pop Undo/Redo Back/Forward

ss 3 Cl Class CSC 495/583 Topics of Software Security X86 Assembly & Stack & Stack

Supporting Functions (procedures) What is needed? Functions: Analogy of a spy secret

Applied Machine Learning Expectation Maximization for Mixture of Gaussians Siamak Ravanbakhsh

Deep Hybrid Models: Bridging Discriminative and Generative Approaches Volodymyr Kuleshov and

Lecture 22 & 23: Variational Autoencoders April 2020 Lecturer: Steven Wu Scribe: Steven Wu

A characterization of combinatorial demand C. Chambers F. Echenique UC San Diego Caltech

CS 285 Instructor: Sergey Levine UC Berkeley Todays Lecture 1. - PowerPoint PPT Presentation

Variational Inference and Generative Models CS 285 Instructor: Sergey Levine UC Berkeley Todays Lecture 1. Probabilistic latent variable models 2. Variational inference 3. Amortized variational inference 4. Generative models: variational

Performa 285 Performa 285 High Alloy Zinc Nickel High Alloy Zinc Nickel Alloy Zinc Automotive

Ichthys LNG Project Ichthys Project Location Abadi WA 285 P Ichthys Field WA 285

I-285 Top End Express Lanes I-285 Westside Express Lanes 1 Unprecedented Growth in Metro

Ichthys LNG Project Ichthys NG roject Ichthys Project Location Abadi WA 285 P Ichthys

BLU-285: A potent and highly selective inhibitor designed to target malignancies driven by KIT and

GIST: imatinib and beyond Clinical activity of BLU-285 in advanced gastrointestinal stromal tumor

Particulate Air Quality Around Wisconsin Frac Sand Mines #285 B A Presentation by Dr. Crispin

Quality Candles ...in a modern design www.diana-candles.com 285 employees Aprox .

the public sector with Lorraine Forrest-Turner governmentevents.co.uk | 0330 0584 285 |

Clinical activity in a Phase 1 study of BLU-285, a potent, highly-selective inhibitor of KIT D816V

Visual disability Low vision 2015 Estimated blind people 2020 Visually impaired 285 M Blind

Southern Companys Demonstration of a 285 MW Coal-Based Transport Gasifier Project Project

Georgia DOT Updates: MMIP and Transform 285/400 January 23, 2018 Tim Matthews, P.E. MMIP

Lanes and I-285 Top End Express Lanes Fulton County Schools Briefing Tim Matthews, P.E.

COST OR PRICE COST OR PRICE REASONABLENESS REASONABLENESS (CPR) (CPR) UH APM A8.285 RCUH

Introduction to Intelligent Transportation Systems (ITS): I-285 Variable Speed Limits Andrew

Probabilistic Graphical Models 10-708 Learning Partially Observed Learning Partially Observed

Stacks and Queues 25 Stack In/Out LIFO: Last-in First-out Push Pop Undo/Redo Back/Forward

ss 3 Cl Class CSC 495/583 Topics of Software Security X86 Assembly &amp; Stack &amp; Stack

Supporting Functions (procedures) What is needed? Functions: Analogy of a spy secret

Applied Machine Learning Expectation Maximization for Mixture of Gaussians Siamak Ravanbakhsh

Deep Hybrid Models: Bridging Discriminative and Generative Approaches Volodymyr Kuleshov and

Lecture 22 &amp; 23: Variational Autoencoders April 2020 Lecturer: Steven Wu Scribe: Steven Wu

A characterization of combinatorial demand C. Chambers F. Echenique UC San Diego Caltech

ss 3 Cl Class CSC 495/583 Topics of Software Security X86 Assembly & Stack & Stack

Lecture 22 & 23: Variational Autoencoders April 2020 Lecturer: Steven Wu Scribe: Steven Wu