Hippocampus-inspired Adaptive Neural Algorithms Brad Aimone Center - PowerPoint PPT Presentation

Photos placed in horizontal position with even amount of white space between photos and header Hippocampus-inspired Adaptive Neural Algorithms Brad Aimone Center for Computing Research Sandia National Laboratories; Albuquerque, NM Sandia National Laboratories is a multi-mission laboratory managed and operated by Sandia Corporation, a wholly owned subsidiary of Lockheed Martin Corporation, for the U.S. Department of Energy’s National Nuclear Security Administration under contract DE-AC04-94AL85000. SAND NO. 2011-XXXXP

Can neural computing provide the next Moore’s Law?

Moore’s Law was based on scientific discovery and successive innovations Adapted from Wikipedia

Each successive advance made more computing feasible Adapted from Wikipedia

Better devices made better computers, which allowed engineering new devices… Circa 1980 Images from Wikipedia

Better devices made better computers, which allowed engineering new devices… Circa 2017 Images from Wikipedia

If we extrapolate c apabilities out, it is not obvious better devices is the answer… What Comes Next? Conventional ANNs Computing Intelligence per second per $ Computing Devices? or Neural Knowledge? When Deep Nets became efficient 2020

Cycle of computing scaling already has begun to influence neuroscience

Even if Moore’s Law ends, computing will continue to scale to be smarter 2010-??? 1950-2020

The reservoir of known neuroscience untapped for computing inspiration is enormous James, et al., BICA 2017

The brain has many mechanisms for adaptation; which are important for computing? Current hardware focuses on synaptic plasticity, if anything

There are different algorithmic approaches to neural learning  In situ adaptation  Incorporate “new” forms of known neural plasticity into existing algorithms  Ex situ adaptation  Design entirely new algorithms or algorithmic modules to provide cognitive learning abilities

Neurogenesis Deep Learning

Deep Networks are a function of training sets 70 60 1 50 7 40 30 20 10 0 0 5 10 15 20 25 30 35 40 45 50

Deep Networks are a function of training sets 70 60 50 40 30 20 10 0 0 5 10 15 20 25 30 35 40 45 50

Deep networks often struggle to generalize outside of training domain 70 60 50 40 30 20 10 0 5 10 15 20 25 40 45 50 0 30 35

Neurogenesis can be used to capture new information without disrupting old information  Brain incorporates new neurons in a select number of regions  Particularly critical for novelty detection and encoding of new information  “Young” hippocampal neurons exhibit increased plasticity (learn more) and are dynamic in their representations  “Old” hippocampal neurons appear to have reduced learning and maintain their representations  Cortex does not have neurogenesis (or similar mechanisms) in adult-hood, but does during development Aimone et al., Neuron 2011

Neurogenesis can be used to capture new information without disrupting old information  Brain incorporates new neurons in a select number of regions  Hypothesis: Can new neurons be used to facilitate adapting deep learning? ?

Neurogenesis can be used to capture new information without disrupting old information  Brain incorporates new neurons in a select number of regions 70 70 60 60 50 50  Hypothesis: Can new neurons be used to 40 40 30 30 facilitate adapting deep learning? 20 20 10 10 0 0 0 10 20 30 40 50 0 10 20 30 40 50  Neurogenesis Deep Learning Algorithm  Stage 1: Check autoencoder reconstruction to ensure appropriate representations

Neurogenesis can be used to capture new information without disrupting old information  Brain incorporates new neurons in a select number of regions 70 70 60 60 50 50  Hypothesis: Can new neurons be used to 40 40 30 30 facilitate adapting deep learning? 20 20 10 10 0 0 0 10 20 30 40 50 0 10 20 30 40 50  Neurogenesis Deep Learning Algorithm  Stage 1: Check autoencoder reconstruction to ensure appropriate representations  Stage 2: If mismatch, add and train new neurons  Train new nodes with novel inputs coming in (reduced learning for existing nodes)

Neurogenesis can be used to capture new information without disrupting old information  Brain incorporates new neurons in a select 70 70 60 60 number of regions 50 50 40 40 30 30 20 20  Hypothesis: Can new neurons be used to 10 10 0 0 0 10 20 30 40 50 0 10 20 30 40 50 facilitate adapting deep learning?  Neurogenesis Deep Learning Algorithm  Stage 1: Check autoencoder reconstruction to ensure appropriate representations  Stage 2: If mismatch, add and train new neurons  Train new nodes with novel inputs coming in (reduced learning for existing nodes)  Intrinsically replay “imagined” training samples from top-level statistics to fine tune representations  Stage 3: Repeat neurogenesis until reconstructions drop below error thresholds

Neurogenesis algorithm effectively balances stability and plasticity

NDL applied to NIST data set

A New View of the Hippocampus

Deep learning ≈ Cortex What ≈ Hippocampus?

Can a new framework for studying the hippocampus help inspire computing?  Desired functions  Learn associations between cortical modalities Entorhinal  Encoding of temporal, contextual, and spatial Cortex information into associations  Ability for “one-shot” learning Dentate  Cue-based retrieval of information Gyrus  Desired properties  Compatible with spiking representations CA3  Network must be stable with adaptation  Capacity should scale nicely  Biologically plausible in context of extensive hippocampus literature CA1  Ability to formally quantify costs and performance  This requires a new model of CA3

Formal model of DG provides lossless encoding of cortical inputs  Constraining EC inputs to have “grid cell” structure sets DG size to biological level of expansion (~10:1)  Mixed code of broad- tuned (immature) neurons and narrow tuned (mature) neurons confirms predicted ability to encode novel information William Severa, NICE 2016 Severa et al., Neural Computation, 2017 28

Classic model of CA3 uses Hopfield-like recurrent network attractors Problems  “Auto-associative” attractors make more sense in frequency coding regime than in spiking networks  Capacity of classic Hopfield networks is generally low  Quite difficult to perform stable one-shot updates to recurrent networks Deng, Aimone, Gage, Nat Rev Neuro 2010 29

Moving away from the Hopfield “learned auto-association” function for CA3 Hillar and Tran, 2014 time Hopfield dynamics are discrete state transitions

Spiking dynamics are inconsistent with fixed point attractors in associative models time time Hopfield dynamics are Biology uses sequence of spiking neurons? discrete state transitions

Spiking dynamics are inconsistent with fixed point attractors in associative models time time One can see how sequences can replace fixed populations

Path attractors, such as orbits , are consistent with spiking dynamics

A new dynamical model of CA3 Problems  “Auto-associative” attractors make more sense in frequency coding regime than in spiking networks Orbits of Spiking Neurons  Capacity of classic Hopfield networks is generally low  Quite difficult to perform stable one-shot updates to recurrent networks 34

Neuromodulation can shift dynamics of recurrent networks Carlson, Warrender, Severa and Aimone; in preparation 35

Cortex and subcortical inputs can modulate CA3 attractor access  Modulation can be provided mechanistically by several sources  Spatial distribution of CA3 synaptic inputs suggests EC inputs could be considered modulatory  Metabotrophic modulators (e.g., serotonin, acetylcholine) can bias neuronal timings and thresholds  Attractor network can thus have many “memories”, but only fraction are accessible within each context

A new modulated, dynamical model of CA3 Problems  “Auto-associative” attractors make more sense in frequency coding regime than in spiking networks Orbits of Spiking Neurons  Capacity of classic Hopfield networks is generally low Context modulation  Quite difficult to perform stable one-shot updates to recurrent networks 37

CA1 encoding can integrate cortical input with transformed DG/CA3 input  CA1 plasticity is dramatic  Synapses appear to be structurally volatile  Representations are temporally volatile  Consistent with one-shot learning  Can consider EC-CA1-EC loosely as an Current autoencoder, with DG / CA3 “guiding” State what representation is used within CA1 Average State of the CA3 Orbit Combined Representation across Time 38

A new modulated, dynamical model of CA3 Problems  “Auto-associative” attractors make more sense in frequency coding regime than in spiking networks Orbits of Spiking Neurons  Capacity of classic Hopfield networks is generally low Context modulation  Quite difficult to perform stable one-shot updates to recurrent networks Schaffer Collateral (CA3-CA1) Learning 39

Hippocampus-inspired Adaptive Neural Algorithms Brad Aimone Center - PowerPoint PPT Presentation

Photos placed in horizontal position with even amount of white space between photos and header Hippocampus-inspired Adaptive Neural Algorithms Brad Aimone Center for Computing Research Sandia National Laboratories; Albuquerque, NM Sandia

Cholinergic Modulation of the Hippocampus Computational Models of Neural Systems Lecture 2.5

Anatomy of the Hippocampus Computational Models of Neural Systems Lecture 3.2 David S.

Anatomy of the Hippocampus Computational Models of Neural Systems Lecture 3.2 David S.

Neural Nets for Adaptive Filter and Adaptive Neural Nets as Adaptive Filters Pattern Recognition

Pattern Separation and Completion in the Hippocampus Computational Models of Neural Systems

The Hippocampus as a Cognitive Map Computational Models of Neural Systems Lecture 3.6 David S. T

OPTIMUM OPTIMUM ADAPTIVE ALGORITHMS ADAPTIVE ALGORITHMS for for SYSTEM IDENTIFICATION SYSTEM

Algorithms in Nature Nature inspired algorithms http://www.cs.cmu.edu/~02317/ Ziv Bar-Joseph

Marr's Theory of the Hippocampus: Part I Computational Models of Neural Systems Lecture 3.3

Neural Networks (Reading: Kuncheva Section 2.5) Introduction Inspired by Biology But as used in

Adaptive Mapping of Linear DSP Adaptive Mapping of Linear DSP Algorithms to Fixed- -Point

Survey of Quantum-Inspired Evolutionary Algorithms Robert Nowotniak, MSc Supervisor: Jacek

TRADING USING DEEP LEARNING MAN VS MACHINE Orders By Algorithms 84% Orders By Human 16%

Mode-Adaptive Neural Networks for Quadruped Motion Control He Zhang he.zhang@ed.ac.uk CGVU Group

Why do we have a hippocampus? Short-term memory and consolidation So far we have talked about the

(Very) Brief Introduction to Neural Networks IITP-03 Algorithms for NLP 1 / 31 Learning

Adaptive Knowledge Sharing in Multi-Task Learning: Improving Low-Resource Neural Machine

Adaptive Checkpoint Adjoint Method for Gradient Estimation in Neural ODE Juntang Zhuang, Nicha C.

Neural Programs: Towards Adaptive Control in Cyber-Physical Systems Konstantin Selyunin 1 , Denise

Marr's Theory of the Hippocampus Part II: Effect of Recurrent Collaterals Computational Models of

Collaborative Bio-Inspired Algorithms Lecture 11 : Clonal Selection Algorithms Prof Jon Timmis

11.1 Ant Algorithms Dorigo et al. (1991 onwards), Hoos & Sttzle (1997), ... Inspired

Brain inspired Deep Learning Architectures Alex Movila Conventional artificial neural networks

Location of the hippocampus in the rat Location of the hippocampus in the rat Location of the

Hippocampus-inspired Adaptive Neural Algorithms Brad Aimone Center - PowerPoint PPT Presentation

Photos placed in horizontal position with even amount of white space between photos and header Hippocampus-inspired Adaptive Neural Algorithms Brad Aimone Center for Computing Research Sandia National Laboratories; Albuquerque, NM Sandia

Cholinergic Modulation of the Hippocampus Computational Models of Neural Systems Lecture 2.5

Anatomy of the Hippocampus Computational Models of Neural Systems Lecture 3.2 David S.

Anatomy of the Hippocampus Computational Models of Neural Systems Lecture 3.2 David S.

Neural Nets for Adaptive Filter and Adaptive Neural Nets as Adaptive Filters Pattern Recognition

Pattern Separation and Completion in the Hippocampus Computational Models of Neural Systems

The Hippocampus as a Cognitive Map Computational Models of Neural Systems Lecture 3.6 David S. T

OPTIMUM OPTIMUM ADAPTIVE ALGORITHMS ADAPTIVE ALGORITHMS for for SYSTEM IDENTIFICATION SYSTEM

Algorithms in Nature Nature inspired algorithms http://www.cs.cmu.edu/~02317/ Ziv Bar-Joseph

Marr's Theory of the Hippocampus: Part I Computational Models of Neural Systems Lecture 3.3

Neural Networks (Reading: Kuncheva Section 2.5) Introduction Inspired by Biology But as used in

Adaptive Mapping of Linear DSP Adaptive Mapping of Linear DSP Algorithms to Fixed- -Point

Survey of Quantum-Inspired Evolutionary Algorithms Robert Nowotniak, MSc Supervisor: Jacek

TRADING USING DEEP LEARNING MAN VS MACHINE Orders By Algorithms 84% Orders By Human 16%

Mode-Adaptive Neural Networks for Quadruped Motion Control He Zhang he.zhang@ed.ac.uk CGVU Group

Why do we have a hippocampus? Short-term memory and consolidation So far we have talked about the

(Very) Brief Introduction to Neural Networks IITP-03 Algorithms for NLP 1 / 31 Learning

Adaptive Knowledge Sharing in Multi-Task Learning: Improving Low-Resource Neural Machine

Adaptive Checkpoint Adjoint Method for Gradient Estimation in Neural ODE Juntang Zhuang, Nicha C.

Neural Programs: Towards Adaptive Control in Cyber-Physical Systems Konstantin Selyunin 1 , Denise

Marr's Theory of the Hippocampus Part II: Effect of Recurrent Collaterals Computational Models of

Collaborative Bio-Inspired Algorithms Lecture 11 : Clonal Selection Algorithms Prof Jon Timmis

11.1 Ant Algorithms Dorigo et al. (1991 onwards), Hoos &amp; Sttzle (1997), ... Inspired

Brain inspired Deep Learning Architectures Alex Movila Conventional artificial neural networks

Location of the hippocampus in the rat Location of the hippocampus in the rat Location of the

11.1 Ant Algorithms Dorigo et al. (1991 onwards), Hoos & Sttzle (1997), ... Inspired