CS 287 Lecture 12 (Fall 2019) Kalman Filtering Lecturer: Ignasi - PowerPoint PPT Presentation

CS 287 Lecture 12 (Fall 2019) Kalman Filtering Lecturer: Ignasi Clavera Slides by Pieter Abbeel UC Berkeley EECS Many slides adapted from Thrun, Burgard and Fox, Probabilistic Robotics

Outline n Gaussians n Kalman filtering n Extend Kalman Filter (EKF) n Unscented Kalman Filter (UKF) [aka “sigma-point filter”]

Multivariate Gaussians

Multivariate Gaussians (integral of vector = vector of integrals of each entry) (integral of matrix = matrix of integrals of each entry)

Multivariate Gaussians: Examples § µ = [1; 0] § µ = [-.5; 0] § µ = [-1; -1.5] § S = [1 0; 0 1] § S = [1 0; 0 1] § S = [1 0; 0 1]

Multivariate Gaussians: Examples µ = [0; 0] § µ = [0; 0] § µ = [0; 0] n S = [1 0 ; 0 1] § S = [.6 0 ; 0 .6] § S = [2 0 ; 0 2] n

Multivariate Gaussians: Examples § µ = [0; 0] § µ = [0; 0] § µ = [0; 0] § S = [1 0; 0 1] § S = [1 0.5; 0.5 1] § S = [1 0.8; 0.8 1]

Multivariate Gaussians: Examples µ = [0; 0] µ = [0; 0] µ = [0; 0] § § § S = [1 -0.5 ; -0.5 1] S = [1 -0.8 ; -0.8 1] S = [3 0.8 ; 0.8 1] § § §

Partitioned Multivariate Gaussian Consider a multi-variate Gaussian and partition random vector into (X, Y). n

Partitioned Multivariate Gaussian: Dual Representation Precision matrix (1) n Straightforward to verify from (1) that: n And swapping the roles of Sigma and Gamma: n

Marginalization: p(x) = ? We integrate out over y to find the marginal: Hence we have: Note: if we had known beforehand that p(x) would be a Gaussian distribution, then we could have found the result more quickly. We would have just needed to find and , which we had available through

Marginalization Recap If Then

Self-quiz

Conditioning: p(x | Y = y 0 ) = ? We have Hence we have: Conditional mean moved according to correlation and variance on measurement • Conditional covariance does not depend on y 0 •

Conditioning Recap If Then

Kalman Filter Kalman Filter = special case of a Bayes’ filter with dynamics and sensory models linear Gaussians: n 2 -1

Time update Assume we have current belief for : n X t X t+1 Then, after one time step passes: n

Time Update: Finding the joint n Now we can choose to continue by either of n (i) mold it into a standard multivariate Gaussian format so we can read of the joint distribution’s mean and covariance n (ii) observe this is a quadratic form in x_{t} and x_{t+1} in the exponent; the exponent is the only place they appear; hence we know this is a multivariate Gaussian. We directly compute its mean and covariance. [usually simpler!]

Time Update: Finding the joint n We follow (ii) and find the means and covariance matrices in [Exercise: Try to prove each of these without referring to this slide!]

Time Update Recap Assume we have n X t X t+1 Then we have n Marginalizing the joint, we immediately get n

Generality! Assume we have n V W Then we have n Marginalizing the joint, we immediately get n

Observation update X t+1 Assume we have: n Z t+1 Then: n And, by conditioning on (see lecture slides on Gaussians) we readily get: n

Complete Kalman Filtering Algorithm At time 0: n For t = 1, 2, … n n Dynamics update: n Measurement update: n Often written as: (Kalman gain) “innovation”

Kalman Filter Summary n Highly efficient: Polynomial in measurement dimensionality k and state dimensionality n : O(k 2.376 + n 2 ) n Optimal for linear Gaussian systems!

Nonlinear Dynamical Systems n Most realistic robotic problems involve nonlinear functions: n Versus linear setting:

Linearity Assumption Revisited y y p(y) x p(x) x

Non-linear Function y y p(y) x p(x) “Gaussian of p(y)” has mean and variance of y under p(y) x

EKF Linearization (1)

EKF Linearization (2) p(x) has HIGH variance relative to region in which linearization is accurate.

EKF Linearization (3) p(x) has LOW variance relative to region in which linearization is accurate.

EKF Linearization: First Order Taylor Series Expansion n Dynamics model: for x t “close to” μ t we have: n Measurement model: for x t “close to” μ t we have:

EKF Algorithm At time 0: n For t = 1, 2, … n n Dynamics update: n Measurement update:

EKF Summary n Highly efficient: Polynomial in measurement dimensionality k and state dimensionality n : O(k 2.376 + n 2 ) n Not optimal! n Can diverge if nonlinearities are large! n Works surprisingly well even when all assumptions are violated!

Linearization via Unscented Transform EKF UKF

UKF Sigma-Point Estimate (2) EKF UKF

UKF Sigma-Point Estimate (3) EKF UKF

UKF Sigma-Point Estimate (4)

[Julier and Uhlmann, 1997] UKF intuition why it can perform better Assume we know the distribution over X and it has a mean \bar{x} n Y = f(X) n EKF approximates f to first order and ignores higher-order terms n UKF uses f exactly, but approximates p(x). n

Original Unscented Transform Picks a minimal set of sample points that match 1 st , 2 nd and 3 rd moments of a Gaussian: n \bar{x} = mean, P xx = covariance, i à i’th column, x in R n n κ : extra degree of freedom to fine-tune the higher order moments of the approximation; when x is n Gaussian, n+κ = 3 is a suggested heuristic L = \sqrt{P_{xx}} can be chosen to be any matrix satisfying: n L L T = P xx n [Julier and Uhlmann, 1997]

Unscented Kalman filter n Dynamics update: n Can simply use unscented transform and estimate the mean and variance at the next time from the sample points n Observation update: n Use sigma-points from unscented transform to compute the covariance matrix between x t and z t . Then can do the standard update.

[Table 3.4 in Probabilistic Robotics]

UKF Summary n Highly efficient: Same complexity as EKF, with a constant factor slower in typical practical applications n Better linearization than EKF: Accurate in first two terms of Taylor expansion (EKF only first term) + capturing more aspects of the higher order terms n Derivative-free: No Jacobians needed n Still not optimal!

Forthcoming How to estimate A t , B t , C t , Q t , R t from data ( z 0:T , u 0:T ) n EM algorithm n How to compute (= smoothing) (note the capital “T”) n

Things to be aware of (but we won’t cover) Square-root Kalman filter --- keeps track of square root of covariance matrices --- equally n fast, numerically more stable (bit more complicated conceptually) Very large systems with sparsity structure n Sparse Information Filter n Very large systems with low-rank structure n Ensemble Kalman Filter n Kalman filtering over SE(3) n How to estimate A t , B t , C t , Q t , R t from data ( z 0:T , u 0:T ) n EM algorithm n How to compute (= smoothing) (note the capital “T”) n

Things to be aware of (but we won’t cover) If A t = A, Q t = Q, C t = C, R t = R n If system is “observable” then covariances and Kalman gain will converge to steady-state values for t -> 1 n Can take advantage of this: pre-compute them, only track the mean, which is done by multiplying Kalman gain with n “innovation” System is observable if and only if the following holds true: if there were zero noise you could determine n the initial state after a finite number of time steps Observable if and only if: rank( [ C ; CA ; CA 2 ; CA 3 ; … ; CA n-1 ]) = n n Typically if a system is not observable you will want to add a sensor to make it observable n Kalman filter can also be derived as the (recursively computed) least-squares solutions to a (growing) set of n linear equations

Kalman filter property If system is observable (=dual of controllable!) then Kalman filter will converge to the true state. n System is observable if and only if: n O = [C ; CA ; CA 2 ; … ; CA n-1 ] is full column rank (1) Intuition: if no noise, we observe y 0 , y 1 , … and we have that the unknown initial state x 0 satisfies: y 0 = C x 0 y 1 = CA x 0 ... y K = CA K x 0 This system of equations has a unique solution x 0 iff the matrix [C; CA; … CA K ] has full column rank. B/c any power of a matrix higher than n can be written in terms of lower powers of the same matrix, condition (1) is sufficient to check (i.e., the column rank will not grow anymore after having reached K=n-1).

CS 287 Lecture 12 (Fall 2019) Kalman Filtering Lecturer: Ignasi - PowerPoint PPT Presentation

CS 287 Lecture 12 (Fall 2019) Kalman Filtering Lecturer: Ignasi Clavera Slides by Pieter Abbeel UC Berkeley EECS Many slides adapted from Thrun, Burgard and Fox, Probabilistic Robotics Outline n Gaussians n Kalman filtering n Extend Kalman

ECE 516: Adaptive Digital Filters Lecture 8 (Kalman Filtering) Mojtaba Soltanalian Kalman

Recursive State Estimation 2 Lecture 8 Recap Today Kalman Filter Extended Kalman Filter

Multimodality in the Kalman Filter and Ensemble Kalman Filter Maxime Conjard, Henning Omre

CS 287 Advanced Robotics (Fall 2019) Lecture 13: Kalman Smoother, Maximum A Posteriori, Maximum

Kalman Filtering Notes Portions of these notes are adapted from [3], [5], [4], [2], and [1]. What

UNSCENTED KALMAN FILTER UNSCENTED KALMAN FILTER MATTHIEU BLOCH April 21, 2020 1 / 9 RECAP:

Kalman filter Kalman Filter Kalman filter is used to filter true system states from noisy

The Kalman Filter (part 1) Administrative Stuff Rudolf Emil Kalman

Statistical Filtering and Control for AI and Robotics Part II. Linear methods for regression

287(g) Program Sheriff Eric J. Severson Waukesha County, WI 287(g) Program Legal Authority

Traffic Control Mechanisms Filtering Source address filtering Other forms of filtering

Package robKalman . Kalmans revenge or obustness for Kalman Filtering evisited

Filtering Cubemaps Filtering Cubemaps Angular Extent Filtering and Edge Seam Fixup Methods

Lesson 7 Rate Conversion Filtering and Downsampling interchange Filtering and Upsampling

THE KALMAN FILTER RAUL ROJAS Abstract. This paper provides a gentle introduction to the Kalman

US 287 / SH 40 Passing Lane Pre-Proposal August 18, 2020 1 US 287 / SH 40 Passing Lane Project

BLAST: Basic Local Alignment Search Tool Altschul et al. J. Mol Bio. 1990. Hashing A

Extending the Path Analysis Technique to Obtain a Soft WCET Paul Keim, Amanda Noyes, Drew

SWAMP+: Enhanced Smith- Waterman Search for Parallel Models Shannon Steinfadt, Ph.D. Los Alamos

Extending Q-Grams to Estimate Selectivity of String Matching with Low Edit Distance Hongrae

Concepts of Object-Oriented Programming 7 January 2019 OSU CSE 1 Recall... Standard extends

Row polymorphism 1/ 25 Record operations 2. Extend a record with a field ( extend ) 3. Access the

Using Formal Concept Analysis to Acquire Knowledge about Verbs Ingrid Falk 124 Claire Gardent 34

The MIPS instruction set architecture The MIPS has a 32 bit architecture, with 32 bit