deepmdp
play

DeepMDP Learning Latent Space Continuous Models for Representation - PowerPoint PPT Presentation

DeepMDP Learning Latent Space Continuous Models for Representation Learning Carles Gelada, Saurabh Kumar, Jacob Buckman, Ofir Nachum, Marc G. Bellemare Simple Representations for RL 2 12 DeepMDP Latent Space Model: Neural networks MDP:


  1. DeepMDP Learning Latent Space Continuous Models for Representation Learning Carles Gelada, Saurabh Kumar, Jacob Buckman, Ofir Nachum, Marc G. Bellemare

  2. Simple Representations for RL 2 12

  3. DeepMDP Latent Space Model: Neural networks MDP: & trained via the following two losses:

  4. Reward Loss

  5. Transition Loss

  6. Tractable Losses

  7. Deep Policies

  8. Representation Quality

  9. Only Discards: Ferns, N., Panangaden, P., and Precup, D. Metrics for Finite Markov Decision Processes. In Proceedings of the 20th Conference on Uncertainty in Artificial Intelligence, UAI ’04, pp. 162–169, 2004.

  10. Phi as a Representation

  11. Donut World

  12. DeepMDP on Donut World 2D latent space + DeepMDP losses

  13. DeepMDP on Donut World Visualization of latent distance

  14. DeepMDP Auxiliary Task Base C51 agent + DeepMDP losses

  15. DeepMDP Auxiliary Task Base C51 agent + DeepMDP losses

  16. ● DeepMDPs as Models of the Environment ● Norm-MMD Metrics and their Associated Smoothness

  17. Thanks For Listening Poster #108

Recommend


More recommend