Preconditioning Weighted Toeplitz Least Squares Problems Structured - PowerPoint PPT Presentation

Preconditioning Weighted Toeplitz Least Squares Problems Structured Numerical Linear Algebra Problems: Algorithms and Applications Cortona, Italy, September 19-24, 2004 Michele Benzi Emory University Atlanta, GA Thanks to: • NSF (MPS/Computational Mathematics) • M. Ng (Hong Kong) • G. Golub (Stanford), V. Simoncini (Bologna)

Outline • The basic problem • An example: nonlinear image restoration • Equivalent formulations • Preconditioned Krylov methods • Constraint preconditioning • HSS preconditioning • Numerical examples • Conclusions Note: Technical Report will be soon made available at http://www.mathcs.emory.edu/ ∼ benzi.

Basic Problem Weighted regularized Toeplitz least squares problem: � Ax − b � 2 min 2 x where � � � � DK Df A = and b = . µL 0 • K is m × n , Toeplitz or BTTB, m ≥ n • D is m × m , diagonal, nonnegative definite • f is m × 1, given • µ > 0 is a regularization parameter • L is n × n , a smoothing operator (here L = I n ) • We further assume that m , n are large

Motivation Such problems arise in various applications, including: • Nonlinear image restoration • Seismography • Acoustics • Linear prediction See ˚ A. Bj¨ orck, Numerical Methods for Least Squares Problems , SIAM, 1996. Problem: The weighting matrix D destroys the Toeplitz structure. Note that D can be very ill-conditioned ⇒ fast Toeplitz solvers do not apply ! If D = I or is nearly constant, efficient solvers exist.

Example: Nonlinear Image Restoration Nonlinear image restoration problem: min || f − s ( Kx ) || 2 x • f is the observed image • x is the original image (unknown) • K is the blurring operator ( m × n , m ≥ n ) • s : R m → R m is a (separable) nonlinear map

Example: Nonlinear Image Restoration Nonlinear image restoration problem: min || f − s ( Kx ) || 2 x • f is the observed image • x is the original image (unknown) • K is the blurring operator ( m × n , m ≥ n ) • s : R m → R m is a (separable) nonlinear map Discrete ill-posed problem ⇒ Tikhonov regularization: || f − s ( Kx ) || 2 2 + µ || x || 2 min 2 x

Example: Nonlinear Image Restoration Regularized nonlinear least-squares: || f − s ( Kx ) || 2 2 + µ || x || 2 min 2 x

Example: Nonlinear Image Restoration Regularized nonlinear least-squares: || f − s ( Kx ) || 2 2 + µ || x || 2 min 2 x Gauss-Newton linearization ⇒ sequence of weighted linear LS problems of the form || D ( f − Kx ) || 2 2 + µ || x || 2 min 2 x with D = D ( k ) diagonal, positive definite and f = f ( k ). Note : D = D ( k ) is the Jacobian of s evaluated at the current Newton approximation.

Equivalent formulations Equations : The regularized weighted least Normal squares problem is equivalent to ( K T D 2 K + µI ) x = K T D 2 f , (1) an n -by- n symmetric positive definite linear system. Note again that the presence of D destroys any structure the problem may have. Also note that D contributes to make (1) more ill-conditioned. Solving (1) is quite a challenge. Unless the entries of D are nearly constant, standard Toeplitz solvers and preconditioners will fail.

Augmented system formulations Another equivalent formulation is the following: D − 2 � � � � � � K y f = (2) K T x 0 − µI where the auxiliary variable y = D ( f − K x ) represents a weighted residual. The ( m + n ) × ( m + n ) coefficient matrix in (2) is symmetric indefinite. This system is equivalent to D − 2 � � � � � � K y f = (3) − K T x 0 µI where the system matrix is now nonsymmetric positive definite: the eigenvalues have positive real part.

Augmented system formulations Letting W = D − 2 for simplicity, the augmented matrix can be factored as follows: � � � � � � � W − 1 K � W K I O W O I = K T K T W − 1 O − Σ O I − µI I where Σ = µI + K T W − 1 K is the Schur complement. Note that Σ is precisely the coefficient matrix of the normal equations. By Sylvester’s Law of Inertia, the augmented matrix has m positive and n negative eigenvalues.

Augmented system formulations The nonsymmetric augmented matrix can be split as � � � � � � W K O K W O = + − K T − K T O µI µI O Since the symmetric part of the matrix is positive definite, the eigenvalues all have positive real part. Further, we note that the matrix is J -symmetric, i.e., it is symmetric with respect to the indefinite inner product associated with the ( m + n ) × ( m + n ) matrix � � I m O J = . O − I n

Preconditioned Krylov methods Augmented systems from weighted least squares problems belong to the class of saddle point problems. In recent years, many new methods have been proposed for solving saddle point systems. In most cases, these methods have been designed for large, sparse problems. In particular, many preconditioners have been proposed. The Toeplitz case has not received much attention. An exception is the paper X.-Q. Jin, A preconditioner for constrained and weighted least squares problems with Toeplitz structure , BIT 36 (1996), pp. 101–109 where circulant-type preconditioners are considered.

Preconditioned Krylov methods Preconditioning : Find an invertible matrix P such that Krylov methods applied to the preconditioned system P − 1 A x = P − 1 b will converge rapidly.

Preconditioned Krylov methods Preconditioning : Find an invertible matrix P such that Krylov methods applied to the preconditioned system P − 1 A x = P − 1 b will converge rapidly. Rapid convergence is often associated with a clustered spectrum of P − 1 A . However, characterizing the rate of convergence in general is not an easy matter.

Preconditioned Krylov methods Preconditioning : Find an invertible matrix P such that Krylov methods applied to the preconditioned system P − 1 A x = P − 1 b will converge rapidly. Rapid convergence is often associated with a clustered spectrum of P − 1 A . However, characterizing the rate of convergence in general is not an easy matter. To be effective, a preconditioner must significantly reduce the total amount of work: • P must be easy to compute • Evaluating z = P − 1 r must be cheap

Preconditioned Krylov methods Available Krylov methods include: 1. Symmetric A : • MINRES (Paige & Saunders, SINUM ‘76) • SQMR (Freund & Nachtigal, APNUM ‘95) • Preconditioner must be SPD for MINRES • Preconditioner can be symm. indefinite for SQMR

Preconditioned Krylov methods Available Krylov methods include: 1. Symmetric A : • MINRES (Paige & Saunders, SINUM ‘76) • SQMR (Freund & Nachtigal, APNUM ‘95) • Preconditioner must be SPD for MINRES • Preconditioner can be symm. indefinite for SQMR 2. Nonsymmetric A : • GMRES (Saad & Schultz, SISSC ‘86) • Bi-CGSTAB (van der Vorst, SISSC ‘91) • Preconditioner can be anything

Preconditioned Krylov methods Available Krylov methods include: 1. Symmetric A : • MINRES (Paige & Saunders, SINUM ‘76) • SQMR (Freund & Nachtigal, APNUM ‘95) • Preconditioner must be SPD for MINRES • Preconditioner can be symm. indefinite for SQMR 2. Nonsymmetric A : • GMRES (Saad & Schultz, SISSC ‘86) • Bi-CGSTAB (van der Vorst, SISSC ‘91) • Preconditioner can be anything Recent trend : Use GMRES or Bi-CGSTAB with a nonsymmetric preconditioner, even when A is symmetric!

Preconditioners for saddle point systems Options include: 1. Multigrid methods

Preconditioners for saddle point systems Options include: 1. Multigrid methods 2. Schur complement-based methods • Block diagonal preconditioning • Block triangular preconditioning • Uzawa preconditioning

Preconditioners for saddle point systems Options include: 1. Multigrid methods 2. Schur complement-based methods • Block diagonal preconditioning • Block triangular preconditioning • Uzawa preconditioning 3. Constraint preconditioning

Preconditioners for saddle point systems Options include: 1. Multigrid methods 2. Schur complement-based methods • Block diagonal preconditioning • Block triangular preconditioning • Uzawa preconditioning 3. Constraint preconditioning 4. Hermitian/Skew-Hermitian splitting (HSS)

Preconditioners for saddle point systems Options include: 1. Multigrid methods 2. Schur complement-based methods • Block diagonal preconditioning • Block triangular preconditioning • Uzawa preconditioning 3. Constraint preconditioning 4. Hermitian/Skew-Hermitian splitting (HSS) Here we examine methods of type 3 and 4 (methods of type 2 did not work).

Constraint Preconditioning Consider the symmetric augmented matrix � � W K A = K T − µI and the preconditioning matrix � � cI K P = K T − µI where c is a constant. For example, c could be the average value of the entries in W , or c = 1. Note that linear systems of the form P z = r must be solved at each iteration. Because P has a BTTB structure, we can use fast methods to solve P z = r .

Preconditioning Weighted Toeplitz Least Squares Problems Structured - PowerPoint PPT Presentation

Preconditioning Weighted Toeplitz Least Squares Problems Structured Numerical Linear Algebra Problems: Algorithms and Applications Cortona, Italy, September 19-24, 2004 Michele Benzi Emory University Atlanta, GA Thanks to: NSF

Practical Least-Squares for Computer Graphics Siggraph Course 11 Siggraph Course 11 Practical

Multigrid preconditioning for anisotropic positive semidefinite block Toeplitz systems Rainer

Toeplitz and Asymptotic Toeplitz operators on H 2 ( D n ) Amit Maji (Joint work with Jaydeb

Statistical Properties of the Regularized Least Squares Functional and a hybrid LSQR Newton method

Least Mean Squares Regression Machine Learning 1 Least Squares Method for regression

ECE 516: Adaptive Digital Filters Lecture 13 (Recursive Least-Squares) Mojtaba Soltanalian 2

The Mathemagic of Magic Squares History of Magic Squares Mathematics and Magic Squares

ECS231 Least-squares problems (Introduction to Randomized Algorithms) May 21, 2019 1 / 12

Weighted graphs Weighted graphs Weighted graphs Weighted graphs Graphs with numbers, called

Dynamic Programming Today: Weighted Interval Scheduling Segmented Least Squares Weighted

Least Squares (outline) Standard regression: Fit data with weighted sum of regressors.

Statistical Geometry Processing Winter Semester 2011/2012 Least-Squares Least-Squares Fitting

9. Equality constraints and tradeoffs More least squares Example: moving average model

8. Least squares Review of linear equations Least squares Example: curve-fitting

Linear Least Squares I Steve Marschner Cornell CS 322 Cornell CS 322 Linear Least Squares I 1

Moving Least Squares Outline The Approximation Power of Moving Least- Squares D. Levin

Efficient L-Shape Fitting for Vehicle Detection Using Laser Scanners Xiao Zhang , Wenda Xu

Hypergraph Mining D.Papadimitriou (dimitri.papadimitriou@alcatel-lucent.com) Graph-based

Why use the Weibull model? Heidi Seibold Statistician at LMU Munich DataCamp Survival Analysis

A/the (possible) solution of the Continuum Problem Saka Fuchino ( ) Graduate School

1 Outline 2 Outline 3 Review the characteristics of this SMART design 4 This primary aim is a

FY 2016 Regional CoC Debriefing Norm Suchar Director Office of Special Needs Assistance

Stat 5101 Lecture Slides Deck 1 Charles J. Geyer School of Statistics University of Minnesota

Understanding flash reconstruction Bruce Howard and Denver Whittington DUNE PD Sim Meeting 22