Real-time adaptive information-theoretic optimization of - PowerPoint PPT Presentation

Real-time adaptive information-theoretic optimization of neurophysiology experiments Presented by Alex Roper March 5, 2009

Goals ◮ How do neurons react to stimuli? ◮ What is a neuron’s preferred stimulus?

Goals ◮ How do neurons react to stimuli? ◮ What is a neuron’s preferred stimulus? ◮ Minimize number of trials. ◮ Speed - must run in real time.

Goals ◮ How do neurons react to stimuli? ◮ What is a neuron’s preferred stimulus? ◮ Minimize number of trials. ◮ Speed - must run in real time. ◮ Emphasis on dimensional scalability (vision)

Challenges ◮ Typically high dimension ◮ Model complexity - memory ◮ Stimulus complexity - visual bitmap

Challenges ◮ Typically high dimension ◮ Model complexity - memory ◮ Stimulus complexity - visual bitmap ◮ Bayesian approach expensive ◮ Estimation ◮ Integration ◮ Multivariate optimization

Challenges ◮ Typically high dimension ◮ Model complexity - memory ◮ Stimulus complexity - visual bitmap ◮ Bayesian approach expensive ◮ Estimation ◮ Integration ◮ Multivariate optimization ◮ Limited firing capacity of a neuron (exhaustion)

Challenges ◮ Typically high dimension ◮ Model complexity - memory ◮ Stimulus complexity - visual bitmap ◮ Bayesian approach expensive ◮ Estimation ◮ Integration ◮ Multivariate optimization ◮ Limited firing capacity of a neuron (exhaustion) ◮ Essential issues ◮ Update a posteriori beliefs quickly given new data ◮ Find optimal stimulus quickly

Neuron Model p ( r t |{ x t , x t − 1 , ..., x t − t k } , { r t − 1 , ..., r t − t k } )

Neuron Model p ( r t |{ x t , x t − 1 , ..., x t − t k } , { r t − 1 , ..., r t − t k } ) ◮ The response r t to stimulus x t is dependent on x t itself, as well as the history of stimuli and responses for a constant sliding window.

Neuron Model p ( r t |{ x t , x t − 1 , ..., x t − t k } , { r t − 1 , ..., r t − t k } ) ◮ The response r t to stimulus x t is dependent on x t itself, as well as the history of stimuli and responses for a constant sliding window. ◮ This is needed to measure exhaustion, depletion, etc.

Neuron Model p ( r t |{ x t , x t − 1 , ..., x t − t k } , { r t − 1 , ..., r t − t k } ) ◮ The response r t to stimulus x t is dependent on x t itself, as well as the history of stimuli and responses for a constant sliding window. ◮ This is needed to measure exhaustion, depletion, etc. �� l = 1 t k k i , t − l + � t a � λ t = E ( r t ) = f j = 1 a j r t − j , i

Neuron Model p ( r t |{ x t , x t − 1 , ..., x t − t k } , { r t − 1 , ..., r t − t k } ) ◮ The response r t to stimulus x t is dependent on x t itself, as well as the history of stimuli and responses for a constant sliding window. ◮ This is needed to measure exhaustion, depletion, etc. �� l = 1 t k k i , t − l + � t a � λ t = E ( r t ) = f j = 1 a j r t − j , i ◮ Filter coefficients k i , t − l represent dependence on the input itself.

Neuron Model p ( r t |{ x t , x t − 1 , ..., x t − t k } , { r t − 1 , ..., r t − t k } ) ◮ The response r t to stimulus x t is dependent on x t itself, as well as the history of stimuli and responses for a constant sliding window. ◮ This is needed to measure exhaustion, depletion, etc. �� l = 1 t k k i , t − l + � t a � λ t = E ( r t ) = f j = 1 a j r t − j , i ◮ Filter coefficients k i , t − l represent dependence on the input itself. ◮ a j models dependence on observed recent activity.

Neuron Model p ( r t |{ x t , x t − 1 , ..., x t − t k } , { r t − 1 , ..., r t − t k } ) ◮ The response r t to stimulus x t is dependent on x t itself, as well as the history of stimuli and responses for a constant sliding window. ◮ This is needed to measure exhaustion, depletion, etc. �� l = 1 t k k i , t − l + � t a � λ t = E ( r t ) = f j = 1 a j r t − j , i ◮ Filter coefficients k i , t − l represent dependence on the input itself. ◮ a j models dependence on observed recent activity. ◮ We summarize all unknown parameters as θ . This is what we’re trying to learn.

Generalized Linear Models ◮ Distribution function (multivariate gaussian). ◮ Linear predictor, θ . ◮ Link function (exponential).

Updating the Posterior ◮ Ideally, this runs in real time. ◮ Approximate the posterior as Gaussian

Updating the Posterior ◮ Ideally, this runs in real time. ◮ Approximate the posterior as Gaussian ◮ The posterior is the product of two smooth, log-concave terms. ◮ (The GLM likelihood function and the Gaussian prior)

Updating the Posterior ◮ Ideally, this runs in real time. ◮ Approximate the posterior as Gaussian ◮ The posterior is the product of two smooth, log-concave terms. ◮ (The GLM likelihood function and the Gaussian prior) ◮ Laplace approximation to construct a Gaussian approximation of the posterior.

Updating the Posterior ◮ Ideally, this runs in real time. ◮ Approximate the posterior as Gaussian ◮ The posterior is the product of two smooth, log-concave terms. ◮ (The GLM likelihood function and the Gaussian prior) ◮ Laplace approximation to construct a Gaussian approximation of the posterior. ◮ Set µ t to the peak of the posterior. ◮ Set covariance matrix C t to negative inverse of Hessian of log posterior at µ t .

Updating the Posterior ◮ Ideally, this runs in real time. ◮ Approximate the posterior as Gaussian ◮ The posterior is the product of two smooth, log-concave terms. ◮ (The GLM likelihood function and the Gaussian prior) ◮ Laplace approximation to construct a Gaussian approximation of the posterior. ◮ Set µ t to the peak of the posterior. ◮ Set covariance matrix C t to negative inverse of Hessian of log posterior at µ t . ◮ Compute directly?

Updating the Posterior ◮ Ideally, this runs in real time. ◮ Approximate the posterior as Gaussian ◮ The posterior is the product of two smooth, log-concave terms. ◮ (The GLM likelihood function and the Gaussian prior) ◮ Laplace approximation to construct a Gaussian approximation of the posterior. ◮ Set µ t to the peak of the posterior. ◮ Set covariance matrix C t to negative inverse of Hessian of log posterior at µ t . ◮ Compute directly? ◮ Complexity is O ( td 2 + d 3 )

Updating the Posterior ◮ Ideally, this runs in real time. ◮ Approximate the posterior as Gaussian ◮ The posterior is the product of two smooth, log-concave terms. ◮ (The GLM likelihood function and the Gaussian prior) ◮ Laplace approximation to construct a Gaussian approximation of the posterior. ◮ Set µ t to the peak of the posterior. ◮ Set covariance matrix C t to negative inverse of Hessian of log posterior at µ t . ◮ Compute directly? ◮ Complexity is O ( td 2 + d 3 ) ◮ O ( td 2 ) for product of t likelihood terms. ◮ O ( d 3 ) for inverting the Hessian ◮ Approximate p ( θ t − 1 | x t − 1 , r t − 1 ) as Gaussian

Updating the Posterior ◮ Ideally, this runs in real time. ◮ Approximate the posterior as Gaussian ◮ The posterior is the product of two smooth, log-concave terms. ◮ (The GLM likelihood function and the Gaussian prior) ◮ Laplace approximation to construct a Gaussian approximation of the posterior. ◮ Set µ t to the peak of the posterior. ◮ Set covariance matrix C t to negative inverse of Hessian of log posterior at µ t . ◮ Compute directly? ◮ Complexity is O ( td 2 + d 3 ) ◮ O ( td 2 ) for product of t likelihood terms. ◮ O ( d 3 ) for inverting the Hessian ◮ Approximate p ( θ t − 1 | x t − 1 , r t − 1 ) as Gaussian ◮ Now we can use Bayes’ rule to find the posterior in one dimension.

Updating the Posterior ◮ Ideally, this runs in real time. ◮ Approximate the posterior as Gaussian ◮ The posterior is the product of two smooth, log-concave terms. ◮ (The GLM likelihood function and the Gaussian prior) ◮ Laplace approximation to construct a Gaussian approximation of the posterior. ◮ Set µ t to the peak of the posterior. ◮ Set covariance matrix C t to negative inverse of Hessian of log posterior at µ t . ◮ Compute directly? ◮ Complexity is O ( td 2 + d 3 ) ◮ O ( td 2 ) for product of t likelihood terms. ◮ O ( d 3 ) for inverting the Hessian ◮ Approximate p ( θ t − 1 | x t − 1 , r t − 1 ) as Gaussian ◮ Now we can use Bayes’ rule to find the posterior in one dimension. O ( d 2 ) .

Deriving the optimal stimulus ◮ Main idea: maximize conditional mutual information:

Deriving the optimal stimulus ◮ Main idea: maximize conditional mutual information: ◮ I ( θ ; r t + 1 | x t + 1 , x t , r t ) = H ( θ | x t , r t ) − H ( θ | x t + 1 , r t + 1 ) .

Deriving the optimal stimulus ◮ Main idea: maximize conditional mutual information: ◮ I ( θ ; r t + 1 | x t + 1 , x t , r t ) = H ( θ | x t , r t ) − H ( θ | x t + 1 , r t + 1 ) . ◮ This ends up being equivalent to minimizing the conditional entropy H ( θ | x t + 1 , r t + 1 ) .

Deriving the optimal stimulus ◮ Main idea: maximize conditional mutual information: ◮ I ( θ ; r t + 1 | x t + 1 , x t , r t ) = H ( θ | x t , r t ) − H ( θ | x t + 1 , r t + 1 ) . ◮ This ends up being equivalent to minimizing the conditional entropy H ( θ | x t + 1 , r t + 1 ) . ◮ End up with equation for covariance in terms of Fisher information, J obs .

Real-time adaptive information-theoretic optimization of - PowerPoint PPT Presentation

Real-time adaptive information-theoretic optimization of neurophysiology experiments Presented by Alex Roper March 5, 2009 Goals How do neurons react to stimuli? What is a neurons preferred stimulus? Goals How do neurons react

Neural Nets for Adaptive Filter and Adaptive Neural Nets as Adaptive Filters Pattern Recognition

Adaptive Control Chapter 1: Introduction to Adaptive Control Adaptive Control Landau, Lozano,

Adaptive Control Chapter 11: Direct Adaptive Control 1 Adaptive Control Landau, Lozano,

Real- Real -Time Systems Time Systems Real- -Time Systems Time Systems Real

Real Real- -Time Systems Time Systems Designing a real- Designing a real -time system time

Real- Real -time systems time systems Real- Real -time programming time programming

Adaptive Control Chapter 12: Indirect Adaptive Control 1 Adaptive Control Landau, Lozano,

Real graduates, Real graduates, real transitions, real transitions, real stories: real

INFORMATION-THEORETIC SECURITY INFORMATION-THEORETIC SECURITY Lecture 4 - Elements of Information

INFORMATION-THEORETIC SECURITY INFORMATION-THEORETIC SECURITY Lecture 1 - Elements of Information

INFORMATION-THEORETIC SECURITY INFORMATION-THEORETIC SECURITY Lecture 2 - Elements of Information

Real Real Real Time Real-Time Time Time Model Checking Model Model Checking Model

Adaptive Control Chapter 13: Multimodel adaptive control with switching Chapter 13: Multimodel

Adaptive Control Chapter 14: Adaptive regulation Rejection of unknown disturbances 1

Lattice-Theoretic Framework for Data-Flow Analysis Last time Generalizing data-flow

Lattice-Theoretic Data-Flow Framework and Intro to SSA Last Time Started lattice theoretic

Statistical models for neural encoding, decoding, and optimal stimulus design Liam Paninski

Cognitive Models of Programming CS294-184: Building User-Centered Programming Tools UC Berkeley

City Today D IGITAL S KILLS : C RISIS OR O PPORTUNITY ? The Digital Revolution 1.0 2.0 I

China and Japan: How Asias Economic Giants are Shaping the Regions Outlook Sean Creehan

How Music Alters Decision Making: Impact of Music Stimuli on Emotional Classification Elad

CDA 4253/CIS 6930 FPGA System Design VHDL Testbench Development Hao Zheng Comp. Sci & Eng

Spread and Sparse: Learning Interpretable Transforms for Bandlimited Signals on Digraphs Rasoul

EE558 - Digital Communications Lecture 2: Review of Signals and Systems Dr. Duy Nguyen Signals

Real-time adaptive information-theoretic optimization of - PowerPoint PPT Presentation

Real-time adaptive information-theoretic optimization of neurophysiology experiments Presented by Alex Roper March 5, 2009 Goals How do neurons react to stimuli? What is a neurons preferred stimulus? Goals How do neurons react

Neural Nets for Adaptive Filter and Adaptive Neural Nets as Adaptive Filters Pattern Recognition

Adaptive Control Chapter 1: Introduction to Adaptive Control Adaptive Control Landau, Lozano,

Adaptive Control Chapter 11: Direct Adaptive Control 1 Adaptive Control Landau, Lozano,

Real- Real -Time Systems Time Systems Real- -Time Systems Time Systems Real

Real Real- -Time Systems Time Systems Designing a real- Designing a real -time system time

Real- Real -time systems time systems Real- Real -time programming time programming

Adaptive Control Chapter 12: Indirect Adaptive Control 1 Adaptive Control Landau, Lozano,

Real graduates, Real graduates, real transitions, real transitions, real stories: real

INFORMATION-THEORETIC SECURITY INFORMATION-THEORETIC SECURITY Lecture 4 - Elements of Information

INFORMATION-THEORETIC SECURITY INFORMATION-THEORETIC SECURITY Lecture 1 - Elements of Information

INFORMATION-THEORETIC SECURITY INFORMATION-THEORETIC SECURITY Lecture 2 - Elements of Information

Real Real Real Time Real-Time Time Time Model Checking Model Model Checking Model

Adaptive Control Chapter 13: Multimodel adaptive control with switching Chapter 13: Multimodel

Adaptive Control Chapter 14: Adaptive regulation Rejection of unknown disturbances 1

Lattice-Theoretic Framework for Data-Flow Analysis Last time Generalizing data-flow

Lattice-Theoretic Data-Flow Framework and Intro to SSA Last Time Started lattice theoretic

Statistical models for neural encoding, decoding, and optimal stimulus design Liam Paninski

Cognitive Models of Programming CS294-184: Building User-Centered Programming Tools UC Berkeley

City Today D IGITAL S KILLS : C RISIS OR O PPORTUNITY ? The Digital Revolution 1.0 2.0 I

China and Japan: How Asias Economic Giants are Shaping the Regions Outlook Sean Creehan

How Music Alters Decision Making: Impact of Music Stimuli on Emotional Classification Elad

CDA 4253/CIS 6930 FPGA System Design VHDL Testbench Development Hao Zheng Comp. Sci &amp; Eng

Spread and Sparse: Learning Interpretable Transforms for Bandlimited Signals on Digraphs Rasoul

EE558 - Digital Communications Lecture 2: Review of Signals and Systems Dr. Duy Nguyen Signals

CDA 4253/CIS 6930 FPGA System Design VHDL Testbench Development Hao Zheng Comp. Sci & Eng