A Quick Look at the Reinforcement Learning course A. LAZARIC ( - PowerPoint PPT Presentation

A Quick Look at the “Reinforcement Learning” course A. LAZARIC ( SequeL Team @INRIA-Lille ) Ecole Centrale - Option DAD SequeL – INRIA Lille EC-RL Course

Why A. LAZARIC – Introduction to Reinforcement Learning 2/18

Why: Important Problems A. LAZARIC – Introduction to Reinforcement Learning 3/18

Why: Important Problems ◮ Autonomous robotics A. LAZARIC – Introduction to Reinforcement Learning 4/18

Why: Important Problems ◮ Autonomous robotics ◮ Elder care A. LAZARIC – Introduction to Reinforcement Learning 4/18

Why: Important Problems ◮ Autonomous robotics ◮ Elder care ◮ Exploration of unknown/dangerous environments A. LAZARIC – Introduction to Reinforcement Learning 4/18

Why: Important Problems ◮ Autonomous robotics ◮ Elder care ◮ Exploration of unknown/dangerous environments ◮ Robotics for entertainment A. LAZARIC – Introduction to Reinforcement Learning 4/18

Why: Important Problems ◮ Autonomous robotics ◮ Financial applications A. LAZARIC – Introduction to Reinforcement Learning 5/18

Why: Important Problems ◮ Autonomous robotics ◮ Financial applications ◮ Trading execution algorithms A. LAZARIC – Introduction to Reinforcement Learning 5/18

Why: Important Problems ◮ Autonomous robotics ◮ Financial applications ◮ Trading execution algorithms ◮ Portfolio management A. LAZARIC – Introduction to Reinforcement Learning 5/18

Why: Important Problems ◮ Autonomous robotics ◮ Financial applications ◮ Trading execution algorithms ◮ Portfolio management ◮ Option pricing A. LAZARIC – Introduction to Reinforcement Learning 5/18

Why: Important Problems ◮ Autonomous robotics ◮ Financial applications ◮ Energy management A. LAZARIC – Introduction to Reinforcement Learning 6/18

Why: Important Problems ◮ Autonomous robotics ◮ Financial applications ◮ Energy management ◮ Energy grid integration A. LAZARIC – Introduction to Reinforcement Learning 6/18

Why: Important Problems ◮ Autonomous robotics ◮ Financial applications ◮ Energy management ◮ Energy grid integration ◮ Maintenance scheduling A. LAZARIC – Introduction to Reinforcement Learning 6/18

Why: Important Problems ◮ Autonomous robotics ◮ Financial applications ◮ Energy management ◮ Energy grid integration ◮ Maintenance scheduling ◮ Energy market regulation A. LAZARIC – Introduction to Reinforcement Learning 6/18

Why: Important Problems ◮ Autonomous robotics ◮ Financial applications ◮ Energy management ◮ Energy grid integration ◮ Maintenance scheduling ◮ Energy market regulation ◮ Energy production management A. LAZARIC – Introduction to Reinforcement Learning 6/18

Why: Important Problems ◮ Autonomous robotics ◮ Financial applications ◮ Energy management ◮ Recommender systems A. LAZARIC – Introduction to Reinforcement Learning 7/18

Why: Important Problems ◮ Autonomous robotics ◮ Financial applications ◮ Energy management ◮ Recommender systems ◮ Web advertising A. LAZARIC – Introduction to Reinforcement Learning 7/18

Why: Important Problems ◮ Autonomous robotics ◮ Financial applications ◮ Energy management ◮ Recommender systems ◮ Web advertising ◮ Product recommendation A. LAZARIC – Introduction to Reinforcement Learning 7/18

Why: Important Problems ◮ Autonomous robotics ◮ Financial applications ◮ Energy management ◮ Recommender systems ◮ Web advertising ◮ Product recommendation ◮ Date matching A. LAZARIC – Introduction to Reinforcement Learning 7/18

Why: Important Problems ◮ Autonomous robotics ◮ Financial applications ◮ Energy management ◮ Recommender systems ◮ Social applications A. LAZARIC – Introduction to Reinforcement Learning 8/18

Why: Important Problems ◮ Autonomous robotics ◮ Financial applications ◮ Energy management ◮ Recommender systems ◮ Bike sharing optimization ◮ Social applications A. LAZARIC – Introduction to Reinforcement Learning 8/18

Why: Important Problems ◮ Autonomous robotics ◮ Financial applications ◮ Energy management ◮ Recommender systems ◮ Bike sharing optimization ◮ Social applications ◮ Election campaign A. LAZARIC – Introduction to Reinforcement Learning 8/18

Why: Important Problems ◮ Autonomous robotics ◮ Financial applications ◮ Energy management ◮ Recommender systems ◮ Bike sharing optimization ◮ Social applications ◮ Election campaign ◮ ER service optimization A. LAZARIC – Introduction to Reinforcement Learning 8/18

Why: Important Problems ◮ Autonomous robotics ◮ Financial applications ◮ Energy management ◮ Recommender systems ◮ Bike sharing optimization ◮ Social applications ◮ Election campaign ◮ ER service optimization ◮ Resource distribution optimization A. LAZARIC – Introduction to Reinforcement Learning 8/18

Why: Important Problems ◮ Autonomous robotics ◮ Financial applications ◮ Energy management ◮ Recommender systems ◮ Social applications ◮ And many more... A. LAZARIC – Introduction to Reinforcement Learning 9/18

What A. LAZARIC – Introduction to Reinforcement Learning 10/18

What: Decision-Making under Uncertainty Environment action / state / actuation perception Agent A. LAZARIC – Introduction to Reinforcement Learning 11/18

How: Reinforcement Learning Reinforcement learning is learning what to do – how to map situations to actions – so as to maximize a numerical reward signal. The learner is not told which actions to take, as in most forms of machine learning, but instead must discover which actions yield the most reward by trying them ( trial–and–error ). In the most interesting and challenging cases, actions may affect not only the immediate reward but also the next situation and, through that, all subsequent rewards ( delayed reward ). “An introduction to reinforcement learning”, Sutton and Barto (1998). A. LAZARIC – Introduction to Reinforcement Learning 12/18

How: the Course A. LAZARIC – Introduction to Reinforcement Learning 13/18

How: the Course Environment action / state / actuation perception Agent A. LAZARIC – Introduction to Reinforcement Learning 13/18

How: the Course Environment action / state / actuation perception Agent Formal and rigorous approach to the RL’s way to decision-making under uncertainty A. LAZARIC – Introduction to Reinforcement Learning 13/18

What: the Highlights of the Course How do we formalize the agent-environment interaction? A. LAZARIC – Introduction to Reinforcement Learning 14/18

What: the Highlights of the Course How do we formalize the agent-environment interaction? How do we solve an MDP? A. LAZARIC – Introduction to Reinforcement Learning 14/18

What: the Highlights of the Course How do we formalize the agent-environment interaction? How do we solve an MDP? How do we solve an MDP “online”? A. LAZARIC – Introduction to Reinforcement Learning 14/18

What: the Highlights of the Course How do we formalize the agent-environment interaction? How do we solve an MDP? How do we solve an MDP “online”? How do we effectively trade-off exploration and exploitation? A. LAZARIC – Introduction to Reinforcement Learning 14/18

What: the Highlights of the Course How do we formalize the agent-environment interaction? How do we solve an MDP? How do we solve an MDP “online”? How do we effectively trade-off exploration and exploitation? How do we solve a “huge” MDP? A. LAZARIC – Introduction to Reinforcement Learning 14/18

Who Lectures and Practical Sessions Alessandro LAZARIC SequeL Team INRIA-Lille Nord Europe alessandro.lazaric@inria.fr researchers.lille.inria.fr/˜lazaric/ A. LAZARIC – Introduction to Reinforcement Learning 15/18

When/What/Where See planning on the website. A. LAZARIC – Introduction to Reinforcement Learning 16/18

Evaluation ◮ Three homework (dynamic programming, multi-armed bandit, approximate dynamic programming): 2.5 points each. ◮ Review of literature with oral presentation: 12.5 points. A. LAZARIC – Introduction to Reinforcement Learning 17/18

Reinforcement Learning Alessandro Lazaric alessandro.lazaric@inria.fr sequel.lille.inria.fr

A Quick Look at the Reinforcement Learning course A. LAZARIC ( - PowerPoint PPT Presentation

A Quick Look at the Reinforcement Learning course A. LAZARIC ( SequeL Team @INRIA-Lille ) Ecole Centrale - Option DAD SequeL INRIA Lille EC-RL Course Why A. LAZARIC Introduction to Reinforcement Learning 2/18 Why: Important

Reinforcement Learning A (almost) quick (and very incomplete) introduction Slides from David

Machine Learning, Reinforcement Learning Machine Learning: A quick retrospective AI Class 25

A Quick Look at the Reinforcement Learning course A. LAZARIC ( SequeL Team @INRIA-Lille )

RL Overview of topics About Reinforcement Learning The Reinforcement Learning Problem

Reinforcement Learning Reinforcement Learning Reinforcement Learning in a nutshell g Imagine

Reinforcement Learning You can think of supervised learning as the teacher providing answers

Examples of Reinforcement Learning Robocup Soccer Teams Stone & Veloso, Reidmiller et al.

CS885 Reinforcement Learning Module 2: June 6, 2020 Maximum Entropy Reinforcement Learning

Reinforcement Learning Reinforcement Learning Now that you know a little about Optimal Control

Reinforcement Learning CS 4100: Artificial Intelligence Reinforcement Learning II Still

R i f R i f Reinforcement Learning III Reinforcement Learning III t L t L i i III III Dec

Reinforcement Learning UMaine COS 470/570 Introduction to AI Why reinforcement learning?

7. Motor Control and Reinforcement Learning Outline A. Action Selection and Reinforcement B.

Deep Reinforcement Learning [Mastering the Game of Go with Deep Reinforcement Learning and Tree

Reinforcement Learning CS 4100: Artificial Intelligence Reinforcement Learning Ja Jan-Wi

Reinforcement Learning AIMA Chapters: 21.1, 21.2, 21.3. Sutton and Barto, Reinforcement Learning:

Module 11 Introduction to Reinforcement Learning CS 886 Sequential Decision Making and

A REINFORCEMENT LEARNING PERSPECTIVE ON AGI Itamar Arel, Machine Intelligence Lab

1 Pure Reinforcement Learning vs. Reinforcement Learning Monte-Carlo Planning No knowledge

Safe Reinforcement Learning Philip S. Thomas Stanford CS234: Reinforcement Learning, Guest

Reinforcement Learning Framework Reinforcement Learning Rewards, Returns Lectures 4 and 5

Introduction to Reinforcement Learning Bayesian Methods in Reinforcement Learning ICML 2007

Neural Combinatorial Optimization With Reinforcement Learning CS885 Reinforcement Learning Paper

Todays Outline Reinforcement Learning Dan Weld Reinforcement Learning Q-value

A Quick Look at the Reinforcement Learning course A. LAZARIC ( - PowerPoint PPT Presentation

A Quick Look at the Reinforcement Learning course A. LAZARIC ( SequeL Team @INRIA-Lille ) Ecole Centrale - Option DAD SequeL INRIA Lille EC-RL Course Why A. LAZARIC Introduction to Reinforcement Learning 2/18 Why: Important

Reinforcement Learning A (almost) quick (and very incomplete) introduction Slides from David

Machine Learning, Reinforcement Learning Machine Learning: A quick retrospective AI Class 25

A Quick Look at the Reinforcement Learning course A. LAZARIC ( SequeL Team @INRIA-Lille )

RL Overview of topics About Reinforcement Learning The Reinforcement Learning Problem

Reinforcement Learning Reinforcement Learning Reinforcement Learning in a nutshell g Imagine

Reinforcement Learning You can think of supervised learning as the teacher providing answers

Examples of Reinforcement Learning Robocup Soccer Teams Stone &amp; Veloso, Reidmiller et al.

CS885 Reinforcement Learning Module 2: June 6, 2020 Maximum Entropy Reinforcement Learning

Reinforcement Learning Reinforcement Learning Now that you know a little about Optimal Control

Reinforcement Learning CS 4100: Artificial Intelligence Reinforcement Learning II Still

R i f R i f Reinforcement Learning III Reinforcement Learning III t L t L i i III III Dec

Reinforcement Learning UMaine COS 470/570 Introduction to AI Why reinforcement learning?

7. Motor Control and Reinforcement Learning Outline A. Action Selection and Reinforcement B.

Deep Reinforcement Learning [Mastering the Game of Go with Deep Reinforcement Learning and Tree

Reinforcement Learning CS 4100: Artificial Intelligence Reinforcement Learning Ja Jan-Wi

Reinforcement Learning AIMA Chapters: 21.1, 21.2, 21.3. Sutton and Barto, Reinforcement Learning:

Module 11 Introduction to Reinforcement Learning CS 886 Sequential Decision Making and

A REINFORCEMENT LEARNING PERSPECTIVE ON AGI Itamar Arel, Machine Intelligence Lab

1 Pure Reinforcement Learning vs. Reinforcement Learning Monte-Carlo Planning No knowledge

Safe Reinforcement Learning Philip S. Thomas Stanford CS234: Reinforcement Learning, Guest

Reinforcement Learning Framework Reinforcement Learning Rewards, Returns Lectures 4 and 5

Introduction to Reinforcement Learning Bayesian Methods in Reinforcement Learning ICML 2007

Neural Combinatorial Optimization With Reinforcement Learning CS885 Reinforcement Learning Paper

Todays Outline Reinforcement Learning Dan Weld Reinforcement Learning Q-value

Examples of Reinforcement Learning Robocup Soccer Teams Stone & Veloso, Reidmiller et al.