Predictive Hebbian Learning Computational Models of Neural Systems - PowerPoint PPT Presentation

Predictive Hebbian Learning Computational Models of Neural Systems Lecture 5.2 David S. Touretzky Based on slides by Mirella Lapata November, 2019

Outline ● The bee brain ● Classical conditioning in honeybees – identification of VUMmx1 (ventral unpaired median neuron maxillare 1) – properties of VUMmx1 ● Bee foraging in uncertain environments – model of bee foraging – theory of predictive Hebbian learning ● Dopamine neurons in the macaque monkey – activity of dopamine neurons – generalized theory of predictive Hebbian learning – modeling predictions 11/03/19 Computational Models of Neural Systems 2

The Bee Brain ● Honeybees have about one million neurons in about 1 mm 3 . – Fruit flies have only about 100,000 neurons – Ants have about 250,000 neurons. ● The mushroom bodies are thought to be involved in learning and memory. 11/03/19 Computational Models of Neural Systems 3

http://web.neurobio.arizona.edu/gronenberg/nrsc581 11/03/19 Computational Models of Neural Systems 4

Anatomy of the Bee Brain ● MB: Mushroom body ● AL: Antenna lobe ● KC: Kenyon cells ● oSN: Olfactory sensory neurons ● MN17: motor neuron involved in PER 11/03/19 Computational Models of Neural Systems 5

Questions ● What are the cellular mechanisms responsible for classical conditioning? ● How is information about the unconditioned stimulus (US) represented at the neuronal level? ● What are the properties of neurons mediating the US? – Response to US – Convergence with the conditioned stimulus (CS) pathway – Reinforcement in conditioning ● How to identify such neurons? 11/03/19 Computational Models of Neural Systems 6

Experiments on Honeybees ● Bees fixed by waxing dorsal thorax to small metal table. ● Odors were presented in a gentle air stream. ● Sucrose solution applied briefly to antenna and proboscis. ● Proboscis extension was seen after a single pairing of the odor (CS) with sucrose (US). 11/03/19 Computational Models of Neural Systems 7

Measuring Responses ● Proboscis extension reflex (PER) was recorded as an electromyogram from the M17 muscle involved in the reflex. ● Neurons were tested for responsiveness to the US. 11/03/19 Computational Models of Neural Systems 8

VUMmx1 Responds to US ● Unique morphology: arborizes in the suboesophageal ganglion (SOG) and projects widely in regions involved in odor (CS) processing ● Responds to sucrose with a long burst of action potentials which outlasts the sucrose US. ● Neurotransmitter is octopamine: related to dopamine. OE = Oesophagus 11/03/19 Computational Models of Neural Systems 9

VUMmx1 11/03/19 Computational Models of Neural Systems 10

Stimulating VUMmx1 Simulates a US ● Introduce CS then inject depolarizing current into VUMmx1 in lieu of applying sucrose. ● Try both forward and backward conditioning paradigms. Schematic diagram. Not real data! 11/03/19 Computational Models of Neural Systems 11

Open bars: sucrose US Shaded bars: VUMmx1 stimulation 11/03/19 Computational Models of Neural Systems 12

Learning Effects of VUMmx1 Stimulation ● After learning, the odor alone stimulates VUMmx1 activity. ● Temporal contiguity effect: forward pairing causes a larger increase in spiking than backward pairing. ● Differential conditioning effect: – Differentially conditioned bees respond strongly to an odor (CS+) specifically paired with the US, and significantly less to an unpaired odor (CS–). 11/03/19 Computational Models of Neural Systems 13

Differential Conditioning of Two Odors (carnation and orange blossom) spontaneous PER 11/03/19 Computational Models of Neural Systems 14

Discussion ● Main claims: – VUMmx1 mediates the US in associative learning – A learned CS also activates VUMmx1. – Physiology is compatible with structures involved in complex forms of learning. ● Questions: – Is VUMmx1 the only neuron mediating the US? ● Serial homologue of VUMmx1 has almost identical branching pattern. ● Response to electrical stimulation is less than response to sucrose, so perhaps other neurons also contribute to the US signal. – Can VUMmx1 mediate other conditioning phenomena, e.g., blocking, overshadowing, extinction? – It's know that honeybees can exhibit second order conditioning and negative patterning (configural learning). Is VUMmx1 involved? – Do different CS or US stimuli induce similar responses? 11/03/19 Computational Models of Neural Systems 15

Bee Foraging ● Real's (1991) experiment: – Bumblebees foraged on artificial blue and yellow flowers. – Blue flowers contained 2 m l of nectar. – Yellow flowers contained 6 m l in one third of the flowers and no nectar in the remaining two thirds. – Blue and yellow flowers contained the same average amount of nectar. ● Results: – Bees favored the constant blue over the variable yellow flowers even though the mean reward was the same. – Bees forage equally from both flower types if the mean reward from yellow is made sufficiently large. 11/03/19 Computational Models of Neural Systems 16

Montague, Dayan, and Sejnowski (1995) ● Model of bee foraging behavior based on VUMmx1. ● Bee decides at each time step whether to randomly reorient. 11/03/19 Computational Models of Neural Systems 17

Neural Network Model S: sucrose sensitive neuron; R: reward neuron; P: reward predicting neuron; d : prediction error signal 11/03/19 Computational Models of Neural Systems 18

TD Equations d( t ) = r ( t ) + γ V ( t ) − V ( t − 1 ) Let γ = 1: no discounting d( t ) r ( t ) + V ( t ) − V ( t − 1 ) = r ( t ) + V ( t ) ˙ = V ( t ) w i x i ( t ) ∑ = i V ( t ) ˙ w i [ x i ( t ) − x i ( t − 1 ) ] ∑ = i w i ˙ x i ( t ) ∑ = i r ( t ) + ∑ d( t ) w i ˙ x i ( t ) = i 11/03/19 Computational Models of Neural Systems 19

Bee Foraging Model x Y ,x B ,x N encode change in scene ˙ V ( t ) = w b x b ( t ) + w y x y ( t ) + w n x n ( t ) ˙ d( t ) = r ( t ) + V ( t ) Δ w i ( t ) = λ x i ( t − 1 ) ⋅ d( t ) 11/03/19 Computational Models of Neural Systems 20

Parameters w B and w Y are adaptable; w N fjxed at -0.5 1 Probability of reorienting: P r (d( t )) = 1 + exp ( m ⋅d( t )+ b ) Learning rate λ = 0.9 Volume of nectar reward determined by empirically derived utility curve. 11/03/19 Computational Models of Neural Systems 21

Theoretical Idea ● Unit P is analogous to VUMmx1. ● Nectar r(t) represents the reward, which can vary over time. ● At each time t, d (t) determines the bee's next action: continue on present heading, or reorient. ● Weights are adjusted on encounters with flowers: they are updated according to the nectar reward. ● Model best matches the bee when λ = 0.9. ● Graph shows bee response to switch in contingencies on trial 15. 11/03/19 Computational Models of Neural Systems 22

An Aside: Honeybee Operant Learning http://web.neurobio.arizona.edu/gronenberg/nrsc581 11/03/19 Computational Models of Neural Systems 23

Dopamine ● Involved in: – Addiction – Self-stimulation – Learning – Motor actions – Rewarding situations 11/03/19 Computational Models of Neural Systems 24

Responses of Dopamine Neurons in Macaques ● Burst for unexpected reward ● Response transfers to reward predictors ● Pause at time of missed reward 11/03/19 Computational Models of Neural Systems 25

1.5 to 3.5 second delay 11/03/19 Computational Models of Neural Systems 26

Correct and Error Trials 11/03/19 Computational Models of Neural Systems 27

Predictive Hebbian Learning Model 11/03/19 Computational Models of Neural Systems 28

Model Behavior Extinction phase 11/03/19 Computational Models of Neural Systems 29

TD Simulation 1 11/03/19 Computational Models of Neural Systems 30

TD Simulation 2 11/03/19 Computational Models of Neural Systems 31

Card Choice Task Deck B Deck A Magnitude of reward is a function of the % choices from deck A in the last 40 draws. Optimal strategy lies to the right of the crossover point, but human subjects generally get stuck around the crossover point 11/03/19 Computational Models of Neural Systems 32

Card Choice Model “Attention” alternates between decks A and B. Change in predicted reward determines P s , the probability of selecting the current deck. The model tends to get stuck at the crossover point, as humans do. 11/03/19 Computational Models of Neural Systems 33

Conclusions ● Specific neurons distribute a signal that represents information about future expected reward (VUMmx1; dopamine neurons). ● These neurons have access to the precise time at which a reward will be delivered. – Serial compound stimulus makes this possible. ● Fluctuations in activity levels of these neurons represent errors in predictions about future reward. ● Montague et al. (1996) present a model of how such errors could be computed in a real brain. ● The theory makes predictions about human choice behaviors in simple decision-making tasks. 11/03/19 Computational Models of Neural Systems 34

Predictive Hebbian Learning Computational Models of Neural Systems - PowerPoint PPT Presentation

Predictive Hebbian Learning Computational Models of Neural Systems Lecture 5.2 David S. Touretzky Based on slides by Mirella Lapata November, 2019 Outline The bee brain Classical conditioning in honeybees identification of VUMmx1

Hebbian Learning Algorithms for Training Convolutional Neural Networks Gabriele Lagani Computer

Hebbian Learning, Hebbian Learning Principal Component Analysis, and Independent Component

Predictive Hebbian Learning Computational Models of Neural Systems Lecture 5.2 David S.

Session 3 Upskilling for Predictive Analytics Travis M Short, FSA Upskilling for Predictive

Model Predictive Control Model Predictive Control of Hybrid Systems of Hybrid Systems Model

Beyond Vanilla LTP Spike-timing-dependent-plasticity or STDP Hebbian learning rule aSN W

Hebbian learning rule Wij (t) = xjxi Wij (t) = F(xj, xi, , t, ) Time? airpuff

Neural Networks and their applications The Hebbian rule in the brain Donald Hebb hypothesised

Predictive Analytics for Capacity Planning HIC 2015 Andrae Gaeth What is predictive

High-Fidelity Coupling of Predictive Plasma-Wall Models Goal: Develop a predictive model of the

Why the Best Predictive What Do We Mean by . . . Models Are Often Different Main Result: . . .

34/83 Pustejovsky - Brandeis Computational Event Models 35/83 Pustejovsky - Brandeis

CAS Ratemaking and Product Management Seminar Effective Predictive Models Senior Leadership

Predictive Models for Min-Entropy Estimation John Kelsey Kerry A. McKay Meltem S onmez Turan

Enhance Pricing and Predictive Models with Historical Exposure Data Visit www.advisenltd.com at

Overcoming big data bottlenecks in healthcare : a Predictive Modeling case study Predictive

management of SMA: Multidisciplinary perspectives SMA, spinal muscular atrophy. What are the

Understanding how activity in the brain produces movements Raghav Rajan, IISER Pune Nov 29 th

Josh Bonkowsky, MD, PhD Department of Pediatrics University of Utah School of Medicine 1.

Towards Programmable Microfluidics William Thies, Mats Cooper , David Wentzlaff, Todd Thorsen

Cortical activity in the null space: permitting preparation without movement Kaufman, Churchland,

Mining a Half-center oscillator database (HCO-db) Anca Doloc-Mihu Postdoctoral Fellow , Calabrese

MUSIC Mikael Djurfeldt, PDC/KTH HBP Neuromorphic SP Outline Interfaces in computatjonal

Lecture 27: Neural Networks and Deep Learning Mark Hasegawa-Johnson April 6, 2020 License:

Predictive Hebbian Learning Computational Models of Neural Systems - PowerPoint PPT Presentation

Predictive Hebbian Learning Computational Models of Neural Systems Lecture 5.2 David S. Touretzky Based on slides by Mirella Lapata November, 2019 Outline The bee brain Classical conditioning in honeybees identification of VUMmx1

Hebbian Learning Algorithms for Training Convolutional Neural Networks Gabriele Lagani Computer

Hebbian Learning, Hebbian Learning Principal Component Analysis, and Independent Component

Predictive Hebbian Learning Computational Models of Neural Systems Lecture 5.2 David S.

Session 3 Upskilling for Predictive Analytics Travis M Short, FSA Upskilling for Predictive

Model Predictive Control Model Predictive Control of Hybrid Systems of Hybrid Systems Model

Beyond Vanilla LTP Spike-timing-dependent-plasticity or STDP Hebbian learning rule aSN W

Hebbian learning rule Wij (t) = *xj*xi Wij (t) = F(xj, xi, , t, ) Time? airpuff

Neural Networks and their applications The Hebbian rule in the brain Donald Hebb hypothesised

Predictive Analytics for Capacity Planning HIC 2015 Andrae Gaeth What is predictive

High-Fidelity Coupling of Predictive Plasma-Wall Models Goal: Develop a predictive model of the

Why the Best Predictive What Do We Mean by . . . Models Are Often Different Main Result: . . .

34/83 Pustejovsky - Brandeis Computational Event Models 35/83 Pustejovsky - Brandeis

CAS Ratemaking and Product Management Seminar Effective Predictive Models Senior Leadership

Predictive Models for Min-Entropy Estimation John Kelsey Kerry A. McKay Meltem S onmez Turan

Enhance Pricing and Predictive Models with Historical Exposure Data Visit www.advisenltd.com at

Overcoming big data bottlenecks in healthcare : a Predictive Modeling case study Predictive

management of SMA: Multidisciplinary perspectives SMA, spinal muscular atrophy. What are the

Understanding how activity in the brain produces movements Raghav Rajan, IISER Pune Nov 29 th

Josh Bonkowsky, MD, PhD Department of Pediatrics University of Utah School of Medicine 1.

Towards Programmable Microfluidics William Thies*, Mats Cooper , David Wentzlaff*, Todd Thorsen

Cortical activity in the null space: permitting preparation without movement Kaufman, Churchland,

Mining a Half-center oscillator database (HCO-db) Anca Doloc-Mihu Postdoctoral Fellow , Calabrese

MUSIC Mikael Djurfeldt, PDC/KTH HBP Neuromorphic SP Outline Interfaces in computatjonal

Lecture 27: Neural Networks and Deep Learning Mark Hasegawa-Johnson April 6, 2020 License:

Hebbian learning rule Wij (t) = xjxi Wij (t) = F(xj, xi, , t, ) Time? airpuff

Towards Programmable Microfluidics William Thies, Mats Cooper , David Wentzlaff, Todd Thorsen