Information Geometric Nonlinear Filtering: a Hilbert Space Approach - PowerPoint PPT Presentation

Information Geometric Nonlinear Filtering: a Hilbert Space Approach Nigel Newton (University of Essex) Information Geometry and its Applications IV, Liblice, June 2016 In honour of Shun-ichi Amari on the occasion of his 80 th birthday

Overview • Nonlinear Filtering (recursive Bayesian estimation) – The need for a proper state space for posterior distributions • The infinite-dimensional Hilbert manifold of probability measures, M , (and Banach variants) • An M -valued Itô stochastic differential equation for the nonlinear filter • Information geometric properties of the nonlinear filter 1 NJN U of E 2016

Nonlinear Filtering      • Markov “signal” process: , [ 0 , ) X t X t   – is a metric space, with reference probability measure m m , X  m  d – Eg. R , ( 0 , ) X N I      • Partial “observation” process: R, [ 0 , ) Y t t   0 t  ( ) Y h X ds W t s t Brownian Motion, independent of X 2 NJN U of E 2016

Nonlinear Filtering      • Markov “signal” process: , [ 0 , ) X t X t   – is a metric space, with reference probability measure m m , X  m  d – Eg. R , ( 0 , ) X N I      • Partial “observation” process: R, [ 0 , ) Y t t   0 t  ( ) Y h X ds W t s t Brownian Motion, independent of X • Estimate X t at each time t from its prior distribution P t and the history of the observation:   t : ( , [ 0 , ]) Y Y s t 0 s • The linear-Gaussian case yields the Kalman-Bucy filter 2 NJN U of E 2016

Nonlinear Filtering  t   • Regular conditional (posterior) distribution: P : ( ) X      t ( ) | B P X B Y 0 t t  • is a random probability measure evolving on . P ( X ) t How should we represent it? 3 NJN U of E 2016

Nonlinear Filtering  t   • Regular conditional (posterior) distribution: P : ( ) X      t ( ) | B P X B Y 0 t t  • is a random probability measure evolving on . P ( X ) t How should we represent it? • We could consider the conditional density (w.r.t m ), p t – typical differential equation (Shiriyayev, Wonham, Stratonovich, Kushner):   p  p  p    A " ( )( )" d dt h h dY h dt   : ( ) ( ) h h x dx t t t t t t t t • Spaces of densities are not necessarily optimal 3 NJN U of E 2016

Mean-Square Errors    • Suppose for some 2 ( ) : R E f X f X t  • Then minimises the mean-square error : E f t f  t   ˆ ˆ      2 2 2 ( ( ) ) E ( ) ( ) f X f f f f f E E  t t t t t t  estimation error approximat ion error 4 NJN U of E 2016

Mean-Square Errors    • Suppose for some 2 ( ) : R E f X f X t  • Then minimises the mean-square error : E f t f  t   ˆ ˆ      2 2 2 ( ( ) ) E ( ) ( ) f X f f f f f E E  t t t t t t  estimation error approximat ion error ˆ ˆ ˆ     m • If for some , and then  t   E f t f P X , : ( ) ˆ  t t t ˆ   p  p ˆ 2 2 2 ( ) E E ( ) f f f m m t t t t and so the L 2 ( m ) norm on densities may be useful 4 NJN U of E 2016

Mean-Square Errors    • Suppose for some 2 ( ) : R E f X f X t  • Then minimises the mean-square error : E f t f  t   ˆ ˆ      2 2 2 ( ( ) ) E ( ) ( ) f X f f f f f E E  t t t t t t  estimation error approximat ion error ˆ ˆ ˆ     m • If for some , and then  t   E f t f P X , : ( ) ˆ  t t t ˆ   p  p ˆ 2 2 2 ( ) E E ( ) f f f m m t t t t and so the L 2 ( m ) norm on densities may be useful • Not if f = 1 B and  t ( B ) is very small (Eg. fault detection) 4 NJN U of E 2016

Mean-Square Errors    • Suppose for some 2 ( ) : R E f X f X t  • Then minimises the mean-square error : E f t f  t   ˆ ˆ      2 2 2 ( ( ) ) E ( ) ( ) f X f f f f f E E  t t t t t t  estimation error approximat ion error ˆ ˆ ˆ     m • If for some , and then  t   E f t f P X , : ( ) ˆ  t t t ˆ   p  p ˆ 2 2 2 ( ) E E ( ) f f f m m t t t t and so the L 2 ( m ) norm on densities may be useful • Not if f = 1 B and  t ( B ) is very small (Eg. fault detection) • When topologised in this way, P ( X ) has a boundary 4 NJN U of E 2016

Multi-Objective Mean-Square Errors • Maximising the L 2 error over square-integrable functions ˆ  2 ( ) f f   ˆ    approximat ion error   M ( | ) : sup t t     2   estimation error  t t 2 ( ) f L E ( ) f f t  t t   ˆ 2     sup E ( 1 / ) f d d   f F t t t ˆ     2 E ( 1 / ) d d  t t t        where 2 2 : ( ) : 0 , E 1 F f L f  f t t t 5 NJN U of E 2016

Multi-Objective Mean-Square Errors • Maximising the L 2 error over square-integrable functions ˆ  2 ( ) f f   ˆ    approximat ion error   M ( | ) : sup t t     2   estimation error  t t 2 ( ) f L E ( ) f f t  t t   ˆ 2     sup E ( 1 / ) f d d   f F t t t ˆ     2 E ( 1 / ) d d  t t t        where 2 2 : ( ) : 0 , E 1 F f L f  f t t t ˆ  • In time-recursive approximations, the accuracy of is t ˆ  affected by that of ( s < t ). This naturally induces s multi-objective criteria at time s (nonlinear dynamics). 5 NJN U of E 2016

Geometric Sensitivity • M is “geometrically sensitive”. (It requires small probabilities to be approximated with greater absolute accuracy than large probabilities) • When topologised by M , P ( X ) does not have a boundary 6 NJN U of E 2016

Geometric Sensitivity • M is “geometrically sensitive”. (It requires small probabilities to be approximated with greater absolute accuracy than large probabilities) • When topologised by M , P ( X ) does not have a boundary • This is highly desirable in the context of recursive Bayesian estimation, where conditional probabilities are repeatedly multiplied by the likelihood functions of new observations. 6 NJN U of E 2016

Geometric Sensitivity • M is “geometrically sensitive”. (It requires small probabilities to be approximated with greater absolute accuracy than large probabilities.) • When topologised by M , P ( X ) does not have a boundary. • This is highly desirable in the context of recursive Bayesian estimation, where conditional probabilities are repeatedly multiplied by the likelihood functions of new observations. • M is Pearson’s c 2 divergence. It belongs to the one-  D parameter family of a -divergences: M  3 6 NJN U of E 2016

Geometric Sensitivity • M is “geometrically sensitive”. (It requires small probabilities to be approximated with greater absolute accuracy than large probabilities.) • When topologised by M , P ( X ) does not have a boundary. • This is highly desirable in the context of recursive Bayesian estimation, where conditional probabilities are repeatedly multiplied by the likelihood functions of new observations. • M is Pearson’s c 2 divergence. It belongs to the one-  D parameter family of a -divergences: M  3 • It is too restrictive to use in practice 6 NJN U of E 2016 NJN U of E 2016

a -Divergences • As | a | becomes larger becomes increasingly D a “geometrically sensitive” • The case a = 0 yields the Hellinger metric NJN U of E 2016 7

a -Divergences • As | a | becomes larger becomes increasingly D a “geometrically sensitive” • The case a = 0 yields the Hellinger metric • The case a = ± 1 yields the KL-Divergence: dP dP  D  D ( | ) : ( | ) E log P Q P Q 1 - Q dQ dQ • This is widely used in practice. NJN U of E 2016 7

a -Divergences • As | a | becomes larger becomes increasingly D a “geometrically sensitive” • The case a = 0 yields the Hellinger metric • The case a = ± 1 yields the KL-Divergence: dP dP  D  D ( | ) : ( | ) E log P Q P Q 1 - Q dQ dQ • This is widely used in practice. • Symmetric error criteria may be appropriate, such as ˆ ˆ      D D ( | ) ( | ) t t t t NJN U of E 2016 7

Information Geometric Nonlinear Filtering: a Hilbert Space Approach - PowerPoint PPT Presentation

Information Geometric Nonlinear Filtering: a Hilbert Space Approach Nigel Newton (University of Essex) Information Geometry and its Applications IV, Liblice, June 2016 In honour of Shun-ichi Amari on the occasion of his 80 th birthday

Nonlinear Filtering using Particles and Outline Nonlinear Quadrature Filtering Monte Carlo

A new weak Hilbert space Jess Surez de la Fuente, UEx Workshop on Banach spaces and Banach

Filtering Cubemaps Filtering Cubemaps Angular Extent Filtering and Edge Seam Fixup Methods

Traffic Control Mechanisms Filtering Source address filtering Other forms of filtering

Lesson 7 Rate Conversion Filtering and Downsampling interchange Filtering and Upsampling

Nonlinear Control Lecture # 31 Nonlinear Observers Nonlinear Control Lecture # 31 Nonlinear

Nonlinear Control Lecture # 22 Special nonlinear Forms Nonlinear Control Lecture # 22 Special

Nonlinear Control Lecture # 21 Special nonlinear Forms Nonlinear Control Lecture # 21 Special

On Hilbert IVth Problem Marc Troyanov (EPFL) SJTU, June 21, 2019 On Hilbert IVth Abstract

Nonlinear Control Lecture # 8 Special nonlinear Forms Nonlinear Control Lecture # 8 Special

Nonlinear Control Lecture # 12 Nonlinear Observers and Output Feedback Stabilization Nonlinear

Nonlinear Control Lecture # 20 Special nonlinear Forms Nonlinear Control Lecture # 20 Special

The Hilbert Series of SQCD Matti J arvinen University of Crete 2 March 2012 1/26

Interpolating Between Hilbert-Samuel and Hilbert-Kunz Multiplicity William D. Taylor University

Stochastic optimization in Hilbert spaces Aymeric Dieuleveut Aymeric Dieuleveut Stochastic

Introduction to Hilbert schemes of curves on a 3-fold . Hirokazu Nasu Tokai University Autust

Unit 3: Direct current and electric resistance Electric current and movement of charges.

A Grid Research Toolbox The Failure Trace Archive DGSim A. Iosup, O. Sonmez, N. Yigitbasi, M.

MyProxy: A Multi-Purpose Grid Authentication Service Jim Basney Senior Research Scientist NCSA

Computer Animation Tim Weyrich March 2010 Physical simulation Heavily based on slides by Marco

1 Prof. S. Ben-Yaakov , DC-DC Converters [9- 4] Current transformer I L I L L T t I I out

Imperfect Competition, Compensating Differentials and Rent Sharing in the U.S. Labor Market June

Milestone Meeting #3 - Demo - Michael Knigs Michael Buler Virtual Aachen Project 08 Team

Physical Information Security Fall 2010 CS461/ECE422 Computer Security I Reading Material