Reinforcement Learning in Psychology and Neuroscience with thanks - PowerPoint PPT Presentation

Aug 26, 2023 •463 likes •663 views

Reinforcement Learning in Psychology and Neuroscience with thanks to Elliot Ludvig Princeton University Psychology has identified two primitive kinds of learning Classical Conditioning Operant Conditioning (a.k.a. Instrumental

Reinforcement Learning in Psychology and Neuroscience with thanks to Elliot Ludvig Princeton University
Psychology has identified two primitive kinds of learning • Classical Conditioning • Operant Conditioning (a.k.a. Instrumental learning) • Computational theory: ❖ Classical = Prediction - What is going to happen? ❖ Operant = Control - What to do to maximize reward?
Classical Conditioning
Pavlov • Russian physiologist • Interested in how learning happened in the brain • Conditional and Unconditional Stimuli
Rescorla-Wagner Model (1972) • Computational model of conditioning ❖ Widely cited and used • Learning as violation of expectations ❖ TD learning as extension of RW
Operant Learning • Operant Conditioning is all about choice in 3 main ways: ❖ Decide which response to make? ❖ Decide how much to respond? ❖ Decide when to respond?
Thorndike’s Puzzle Box
Operant Chambers
Complex Cognition
Marr’s 3 Levels of Analysis • Computational ❖ What function is being fulfilled? • Algorithmic ❖ How is it accomplished? • Implementational ❖ What physical substrate is involved?
The Basic TD Model • Learn to predict discounted sum of upcoming reward through TD with linear function approximation: n � V t = w T t x t = w t ( i ) x t ( i ) i =1 • The TD error is calculated as: δ t = r t +1 + γ V t +1 − V t .
TD( λ ) algorithm/model/neuron Reward ∑ w i ⋅ x i States x i e i w i δ i or TD Features Value of state Error or action λ w i ~ δ ⋅ e i ˙ TD Eligibility Error Trace
Brain reward systems What signal does this neuron carry? Honeybee Brain VUM Neuron Hammer, Menzel
Dopamine • Small-molecule Neurotransmitter ❖ Diffuse projections from mid-brain throughout the brain from Pinel (2000), p.364 Key Idea: Phasic change in baseline dopamine responding = reward prediction error
Dopamine neurons signal the TD error error/change in prediction of reward Wolfram Schultz, et al.
Reward Unexpected Reward Value Representation- TD error independent Reward Expected predictions Cue of TD errors Value TD error Reward Absent Value TD error t = r t +1 + γ V t +1 � V t TD error
The theory that Dopamine = TD error is one of the most important interactions ever between artificial intelligence and neuroscience

Recommend

Reinforcement Learning in Psychology and Neuroscience with thanks to Elliot Ludvig University

Reinforcement Learning in Psychology and Neuroscience with thanks to Elliot Ludvig University of Warwick Bidirectional Influences Psychology Artificial Intelligence Reinforcement Learning Control Neuroscience Theory Any information

712 views • 32 slides

Reinforcement Learning AIMA Chapters: 21.1, 21.2, 21.3. Sutton and Barto, Reinforcement Learning:

Reinforcement Learning Reinforcement Learning AIMA Chapters: 21.1, 21.2, 21.3. Sutton and Barto, Reinforcement Learning: an Introduction, 2nd Edition: Chapters 6 (6.1 6.5) Outline Reinforcement Learning Reinforcement Learning: the

589 views • 27 slides

Reinforcement Learning Timothy Chou Charlie Tong Vincent Zhuang April 19, 2016 Reinforcement

Reinforcement Learning Q-Learning Deep Q-Learning on Atari Reinforcement Learning Timothy Chou Charlie Tong Vincent Zhuang April 19, 2016 Reinforcement Learning Q-Learning Deep Q-Learning on Atari Table of Contents Reinforcement Learning

939 views • 63 slides

PSYCHOLOGY Dr Amy Pearson Lecturer in Psychology The Truth About Psychology Psychology

PSYCHOLOGY Dr Amy Pearson Lecturer in Psychology The Truth About Psychology Psychology isnt a soft subject Difficult, scientific based, subject requiring analytical and critical thinking skills. Psychology isnt a vocational

604 views • 20 slides

RL Overview of topics About Reinforcement Learning The Reinforcement Learning Problem

Introduction to Reinforcement Learning RL Overview of topics About Reinforcement Learning The Reinforcement Learning Problem Inside an RL agent Temporal difference learning Many faces of Reinforcement Learning What is

552 views • 35 slides

Reinforcement Learning and Simulation-Based Search David Silver Reinforcement Learning and

Reinforcement Learning and Simulation-Based Search Reinforcement Learning and Simulation-Based Search David Silver Reinforcement Learning and Simulation-Based Search Outline 1 Reinforcement Learning 2 Simulation-Based Search 3 Planning Under

425 views • 20 slides

Reinforcement Learning UMaine COS 470/570 Introduction to AI Why reinforcement learning?

Reinforcement Learning<br/><br/> 4/25/19, 8*06 PM Reinforcement Learning<br/><br/> 4/25/19, 8*06 PM Reinforcement Learning UMaine COS 470/570 Introduction to AI Why reinforcement learning? Spring 2019 Created:

371 views • 15 slides

Reinforcement Learning Reinforcement Learning Reinforcement Learning in a nutshell g Imagine

Reinforcement Learning Reinforcement Learning Reinforcement Learning in a nutshell g Imagine playing a new game whose rules you dont know; after a hundred or so moves your don t know; after a hundred or so moves, your opponent announces, You

512 views • 30 slides

Safe Reinforcement Learning Philip S. Thomas Stanford CS234: Reinforcement Learning, Guest

Safe Reinforcement Learning Philip S. Thomas Stanford CS234: Reinforcement Learning, Guest Lecture May 24, 2017 Lecture overview What makes a reinforcement learning algorithm safe ? Notation Creating a safe reinforcement learning

1.43k views • 88 slides

reinforcement learning in humans and animals nathaniel daw nyu neuroscience; psychology;

reinforcement learning in humans and animals nathaniel daw nyu neuroscience; psychology; neuroeconomics cognition centric roundtable stevens, may 13 2011 collaborators NYU: Aaron Bornstein Sara Constantino Nick Gustafson Jian Li Seth

449 views • 27 slides

What is Health Psychology? Division of Health Psychology Scotland Psychology Focuses on

What is Health Psychology? Division of Health Psychology Scotland Psychology Focuses on thoughts, emotions and behaviour and their interactions Thoughts What we think Health Psychology Aims to understand and change thoughts, emotions and

393 views • 8 slides

LGA 504 Sport Psychology Session Objectives Define Sport Psychology and examine why it is

Introduction to Sport Psychology LGA 504 Sport Psychology Session Objectives Define Sport Psychology and examine why it is different to exercise psychology. Examine the history of Sport Psychology. Outline the roles of a Sport

483 views • 8 slides

Introduction to Reinforcement Learning Kevin Chen and Zack Khan Lecture 1: Introduction to

Lecture 1: Introduction to Reinforcement Learning Introduction to Reinforcement Learning Kevin Chen and Zack Khan Lecture 1: Introduction to Reinforcement Learning Outline 1. Course Logistics 2. What is Reinforcement Learning? 3.

930 views • 67 slides

CS885 Reinforcement Learning Module 2: June 6, 2020 Maximum Entropy Reinforcement Learning

CS885 Reinforcement Learning Module 2: June 6, 2020 Maximum Entropy Reinforcement Learning Haarnoja, Tang et al. (2017) Reinforcement Learning with Deep Energy Based Policies, ICML . Haarnoja, Zhou et al. (2018) Soft Actor-Critic: Off-Policy

684 views • 24 slides

Cognitive computational neuroscience of vision Nikolaus Kriegeskorte Department of Psychology,

Cognitive science Computational neuroscience Cognitive computational neuroscience of vision Nikolaus Kriegeskorte Department of Psychology, Department of Neuroscience Zuckerman Mind Brain Behavior Institute Affiliated member, Electrical

413 views • 23 slides

Introduction to Reinforcement Learning and Q-Learning Skyler Seto (ss3349) May 2, 2016 Skyler

Reinforcement Learning and Markov Decision Process Q-Learning Q-Learning Convergence Introduction to Reinforcement Learning and Q-Learning Skyler Seto (ss3349) May 2, 2016 Skyler Seto (ss3349) Introduction to Reinforcement Learning and

565 views • 27 slides

ADHD DHD in Primar imary y Care Andres J. Pumariega, M.D. Professor and Chair, Department of

Treatm eatment ent of Childre ldren n and d Teens eens wi with th ADHD DHD in Primar imary y Care Andres J. Pumariega, M.D. Professor and Chair, Department of Psychiatry, Cooper Health and CMSRU Medical Director, Cooper Hub

877 views • 21 slides

Neuroscience of Decision Making Sam uel McClure Psycholog y Departm ent 1 5 May 2 0 0 8 The

Neuroscience of Decision Making Sam uel McClure Psycholog y Departm ent 1 5 May 2 0 0 8 The Ultim atum Gam e $ 1 0 to divide between two players Proposer chooses a division Receiver can either Accept: both receive

297 views • 28 slides

European Bioinformatics Institute European Bioinformatics Institute British outstation of the

European Bioinformatics Institute European Bioinformatics Institute British outstation of the European Molecular Biology Laboratory Databases Sequences, structures Transcriptomics, Proteomics pathways, models Controlled

814 views • 64 slides

DEEP BRAIN STIMULATION: BEST PRACTICE AND MORE Francesca Morgante Institute of Molecular and

Parkinson's research and the future of treatments DEEP BRAIN STIMULATION: BEST PRACTICE AND MORE Francesca Morgante Institute of Molecular and Clinical Sciences, St George's University of London, London, United Kingdom Disclosures Dr

347 views • 31 slides

Disclosures I have no disclosures to make Doppler in Obstetric Management: How to Interpret

6/15/2017 Disclosures I have no disclosures to make Doppler in Obstetric Management: How to Interpret the Reports You Get Melissa Rosenstein, MD, MAS Assistant Professor University of California, San Francisco June 15, 2017 Outline

562 views • 14 slides

I have nothing to I have nothing to disclose disclose UC UC SF SF University of California

3/29/2016 Critical Ultrasound for Critical Ultrasound for Patient Care Patient Care April 6-8, 2016 April 6-8, 2016 Sonoma, CA Sonoma, CA I have nothing to I have nothing to disclose disclose UC UC SF SF University of California

319 views • 18 slides

Intro to Ultrasound MSK September 19, 2020 DMUCOM Thomas Benzoni, DO, AOBEM, FACEP

Intro to Ultrasound MSK September 19, 2020 DMUCOM Thomas Benzoni, DO, AOBEM, FACEP Declarations COI None Agency none FDA indication deviation None; these are radiation emitting devices Rules of engagement

316 views • 18 slides

Course Projects Class 9. 22 Sep 2009 Administrivia THURSDAYS CLASS: WEAN HALL 5403 n

11-755 Machine Learning for Signal Processing Course Projects Class 9. 22 Sep 2009 Administrivia THURSDAYS CLASS: WEAN HALL 5403 n Thanks to Ramkumar Krishnan for arranging the room! q Almost all submissions of Homework 1 are in n

628 views • 47 slides