collaborative evolutionary reinforcement learning
play

Collaborative Evolutionary Reinforcement Learning Shauharda Khadka, - PowerPoint PPT Presentation

Collaborative Evolutionary Reinforcement Learning Shauharda Khadka, Somdeb Majumdar, Tarek Nassar, Zach Dwiel, Evren Tumer, Santiago Miret, Yinyin Liu, Kagan Tumer* Artificial Intelligence Products Group, Intel Corporation Oregon State


  1. Collaborative Evolutionary Reinforcement Learning Shauharda Khadka, Somdeb Majumdar, Tarek Nassar, Zach Dwiel, Evren Tumer, Santiago Miret, Yinyin Liu, Kagan Tumer* Artificial Intelligence Products Group, Intel Corporation Oregon State University*

  2. A simple actor-critic policy gradient setup

  3. Learner

  4. What do we optimize exactly?

  5. Learner

  6. Portfolio of Learners (varying discount rates)

  7. Why varying discount rates?

  8. Why varying discount rates?

  9. Back to Portfolio of Learners

  10. Adding a Resource Manager

  11. Adding Neuroevolution

  12. Experiment: Humanoid 12

  13. Experiment: Humanoid ● Solves Humanoid under 1 million samples ● TD3 learners fail entirely ● Neuroevolution ~62.5 million samples 13

Recommend


More recommend