Preconditioning for Nonsymmetry and Time-dependence Andy Wathen - PowerPoint PPT Presentation

Preconditioning for Nonsymmetry and Time-dependence Andy Wathen Oxford University, UK joint work with Jen Pestana and Elle McDonald Jeju, Korea, 2015 – p.1/24

Iterative methods For self-adjoint problems/symmetric matrices, iterative methods of choice exist: conjugate gradients for SPD, MINRES otherwise but many possible methods for non-self-adjoint problems/nonsymmetric matrices: GMRES , BICGSTAB , QMR , IDR , . . . Jeju, Korea, 2015 – p.2/24

Iterative methods Arises because of convergence guarantees: • for symmetric matrices: descriptive convergence bounds ⇒ a priori estimates of iterations for acceptable convergence; good preconditioning ensures fast convergence. • for nonsymmetric matrices: by contrast, to date there are no generally applicable and descriptive convergence bounds even for GMRES ; for any of the other nonsymmetric methods without a minimisation property, convergence theory is extremely limited ⇒ no good a priori way to identify what are the desired qualities of a preconditioner A major theoretical difficulty, but heuristic ideas abound! Jeju, Korea, 2015 – p.3/24

The situation is more severe than this: Theorem ( Greenbaum, Ptak and Strakos, 1996 ) Given any set of eigenvalues and any monotonic convergence curve, then for any b there exists a matrix B having those eigenvalues and an initial guess x 0 such that GMRES for Bx = b with x 0 as starting vector will give that convergence curve. In fact more extreme negative results than this exist. Jeju, Korea, 2015 – p.4/24

One way to address such questions: look for (non-standard) inner products in which a problem might be self-adjoint Jeju, Korea, 2015 – p.5/24

One way to address such questions: look for (non-standard) inner products in which a problem might be self-adjoint such inner products exist for a real nonsymmetric matrix B if and only if B is diagonalizable and has real eigenvalues but preconditioners still would have to have self-adjointness in any relevant non-standard inner product!! Jeju, Korea, 2015 – p.5/24

Convection-diffusion equation: ∂u Ω ⊂ R 2 or R 3 ∂t + b · ∇ u − ǫ ∇ 2 u = f in Ω × (0 , T ] , u given on ∂ Ω u (x , 0) = u 0 (x) , • arises widely e.g. as a subproblem in Navier-Stokes • is non-self-adjoint ⇒ nonsymmetric discretization matrix ⇒ convergence of Krylov subspace methods not easily described so no mathematical idea how to precondition Jeju, Korea, 2015 – p.6/24

For steady convection-diffusion b · ∇ u − ǫ ∇ 2 u = f the nonsymmetric issue remains even in 1-dimension u ′ − ǫu ′′ = f (though iterative methods not so crucial here!) Jeju, Korea, 2015 – p.7/24

u ′ − ǫu ′′ = f is − exp( − x/ǫ ) u ′ � ′ = 1 � ǫ exp( − x/ǫ ) f so continuous problem is self-adjoint in a highly distorted inner product based on this integrating factor (given certain simple boundary conditions) Discretizations however have matrices which are not self-adjoint in any inner product for large enough h/ǫ Jeju, Korea, 2015 – p.8/24

Recently ( Pestana & W, 2015 ) we have made progress for real nonsymmetric Toeplitz (constant diagonal) matrices regardless of nonnormality with a very simple observation: If B is a real Toeplitz matrix then     a 0 a − 1 · · a 1 − n 0 0 · 0 1 a 1 a 0 a − 1 · · 0 · 0 1 0         · a 1 a 0 · · · 0 1 0 ·         · · · · a − 1 · · · · · a n − 1 · · a 1 a 0 1 0 · 0 0 � �� B Y is the real symmetric matrix   a 1 − n · · a − 1 a 0 · · a − 1 a 0 a 1     · · a 0 a 1 ·     a − 1 · · · · a 0 a 1 · · a n − 1 Jeju, Korea, 2015 – p.9/24

Thus MINRES can be robustly applied to BY — it is symmetric but generally indefinite — and its convergence will depend only on eigenvalues. BUT preconditioning? – needs to be symmetric and positive definite for MINRES Fortunately it is well known that Toeplitz matrices are well preconditioned by related circulant matrices, C ( Strang, 1986, Chan, 1988 ) which are diagonalised by an FFT in O ( n log n ) work: C = U ⋆ Λ U . For many Toeplitz matrices we have that the Strang or Optimal (Chan) circulant C satisfy C − 1 B = I + R + E where R is of small rank and E is of small norm ⇒ eigenvalues clustered around 1 except for a few outliers Jeju, Korea, 2015 – p.10/24

To ensure symmetric and positive definite preconditioner for BY just use the circulant matrix | C | = U ⋆ | Λ | U which is real symmetric and positive definite Theorem ( Pestana & W, 2015 ) | C | − 1 BY = J + R + E where J is real symmetric and orthogonal with eigenvalues ± 1 , R is of small rank and E is of small norm ⇒ guaranteed fast convergence because MINRES convergence only depends on eigenvalues which are clustered around ± 1 except for few outliers! Jeju, Korea, 2015 – p.11/24

Example   1 0 . 01 1 1 0 . 01     ... ... ... ∈ R n × n B =     1 1 0 . 01   1 1 n Condition number of B preconditioned MINRES iters 10 14 6 100 207 6 2.6 × 10 6 1000 6 Similar ideas apply for block Toeplitz matrices—higher dimensions—but theory not so good Jeju, Korea, 2015 – p.12/24

Time-dependent Convection-Diffusion ∂u Ω ⊂ R 2 or R 3 ∂t + b . ∇ u − ǫ ∇ 2 u = f in Ω × (0 , T ] , u given on ∂ Ω u (x , 0) = u 0 (x) , Finite elements in space ( x ), θ time stepping gives M u k − u k − 1 � � + K θ u k + (1 − θ )u k − 1 = f k τ M ∈ R n × n : SPD mass matrix (identity operator, but same sparsity as K ) K ∈ R n × n : nonsymmetric discrete steady convection-diffusion matrix Jeju, Korea, 2015 – p.13/24

Rearranging: � � � � M + τ θ K u k = M − τ (1 − θ ) K u k − 1 + τ f k , k = 1 , 2 , . . . , N Nτ = T Recall for unconditional stability: 1 2 ≤ θ ≤ 1 θ = 1 θ = 1 : backwards Euler, 2 : Crank-Nicolson else need τ = O ( h 2 ) : very small time steps for explicit method Jeju, Korea, 2015 – p.14/24

� � � � M + τ θ K u k = M − τ (1 − θ ) K u k − 1 + τ f k , k = 1 , 2 , . . . , N Standard solution method: Solve the N separate n × n nonsymmetric linear systems (sequentially) for k = 1 , 2 , . . . , N . Here we use a geometric multigrid due to Ramage specifically for convection diffusion ⇒ r = 5 V-cycles for solution of each linear system to a relative residual tolerence of 10 − 6 Hence if we (quite reasonably) regard 1 V-cycle as the main unit of work ⇒ Nr V-cycles sequentially for the overall solution Jeju, Korea, 2015 – p.15/24

Alternatively: Write all timesteps at one go (all-at-once method):   u 1 u 2   A  = r.h.s .   . .  u N where A is the matrix   M + τθK 0 0 0 − M + τ (1 − θ ) K M + τθK 0 0     ... ...  0 0  0 0 − M + τ (1 − θ ) K M + τθK and r.h.s. = [ M − τ (1 − θ ) K u 0 + τ f 1 , τ f 2 , . . . , τ f N ] T nonsymmetric! Jeju, Korea, 2015 – p.16/24

  M + τθK 0 0 0 − M + τ (1 − θ ) K M + τθK 0 0   A =   ... ... 0 0   0 0 − M + τ (1 − θ ) K M + τθK A ∈ R L × L , L = Nn We propose to solve this huge linear system (for the solution at all time steps) by GMRES (or BICGSTAB ) with block diagonal preconditioner   ( M + τθK ) MG 0 0 0 0 ( M + τθK ) MG 0 0   P =   ... ... 0 0   0 0 0 ( M + τθK ) MG where ( M + τθK ) MG is one GMG V-cycle exactly as above. Any other approximate nonsymmetric solver could be used. Jeju, Korea, 2015 – p.17/24

Theory: If we used   ( M + τθK ) 0 0 0 0 ( M + τθK ) 0 0   P exact =   ... ...  0 0  0 0 0 ( M + τθK ) as preconditioner (no multigrid approximation) then we would have   I 0 0 0 J I 0 0   P − 1 exact A =  ,   ... ... 0 0  0 0 J I J = ( M + τθK ) − 1 ( − M + τ (1 − θ ) K ) Jeju, Korea, 2015 – p.18/24

For   I 0 0 0 J I 0 0   P − 1 exact A =  ,   ... ...  0 0 0 0 J I the minimum polynomial is (1 − s ) N , so GMRES would terminate (in exact arithmetic) in N iterations We observe that ( M + τθK ) MG is close to ( M + τθK ) so that convergence to a tolerance much less than the discretization error is achieved in N iterations also with P as preconditioner. Jeju, Korea, 2015 – p.19/24

Thus: N V-cycles for each of N GMRES iterations—hence N 2 ( > Nr ) overall. but with N processors, solution with P is (embarrasingly) parallel—block diagonal ⇒ independent computation. Thus parallel effort is N < Nr (= sequential effort). Jeju, Korea, 2015 – p.20/24

Convection-diffusion 1 0.8 0.6 0.4 0.2 0 1 0.5 1 0 -0.5 0 -1 -1 Jeju, Korea, 2015 – p.21/24

Convection-diffusion 10 2 GMRES BiCGSTAB 10 1 10 0 10 -1 10 -2 Residual 10 -3 10 -4 10 -5 10 -6 10 -7 10 -8 0 10 20 30 40 50 60 70 80 90 Jeju, Korea, 2015 – p.22/24 Iteration

Convection-diffusion −5 2.5 x 10 λ ( ˆ P − 1 A BE ) 2 λ ( ˆ P − 1 A CN ) λ ( ˆ P − 1 A BDF ) 1.5 1 0.5 Im( λ ) 0 −0.5 −1 −1.5 −2 −2.5 0.95 0.96 0.97 0.98 0.99 1 1.01 Re( λ ) Jeju, Korea, 2015 – p.23/24

Preconditioning for Nonsymmetry and Time-dependence Andy Wathen - PowerPoint PPT Presentation

Preconditioning for Nonsymmetry and Time-dependence Andy Wathen Oxford University, UK joint work with Jen Pestana and Elle McDonald Jeju, Korea, 2015 p.1/24 Iterative methods For self-adjoint problems/symmetric matrices, iterative

Measuring Dependence and Conditional Dependence with Kernels Kenji Fukumizu The Institute of

Linear dependence and independence Linear dependence 1 Definition (linear (in)dependence) Let {

Preconditioning and nonlinear time solvers for the JOREK MHD code E. Franck, A. Lessig, M. H

Treating Tobacco Treating Tobacco Treating Tobacco Treating Tobacco Dependence and Providing

Control-dependence Analysis 2 Control-dependence Analysis 1. Introduction (motivation, overview)

More refined representations Control dependence graph Problem: control-flow edges in CFG

Numerical Upscaling and Preconditioning of Flows in Highly Heterogeneous Porous Media R. Lazarov,

Original motivation for nonlinear preconditioning A nonlinear system F ( u ) = 0 may be

What is Responsible for the Low-level What is Responsible for the Low-level Moist Preconditioning

GPSCP: A General-Purpose Support-Circuit Preconditioning Approach to Large Scale SPICE Accurate

Priority and Particle Physics: structure, dependence, and moderation in all things Kerry McKenzie

Dependence: Theory and Practice Introduction to loop dependence and loop transformation 1 The

Data Dependence in Data Dependence in Combining Classifiers Combining Classifiers Mohamed

Local Dependence and Persistence in Discrete Sliding Window Processes Ohad N. Feldheim Joint

Dependence in Games & Dependence Games Davide Grossi (ILLC, University of Amsterdam) Paolo

Chapter 3 Structural breaks for models with path dependence 2 Chapter 3 Path dependence (p.

QCrypt: Implemenng a Next-Generaon Quantum Key Disllaon Engine in Pracce P. Junod

Optimizing R VM: Interpreter-level Specialization and Vectorization Haichuan Wang 1 , Peng Wu 2 ,

in the Finite Memory Length Regime Joint work of Yu-Chih Huang , National Chiao Tung University,

Regularization Methods for System Identification Input Design Biqiang MU Academy of Mathematics

Shorter Linear Straight-Line Programs for MDS Matrices Yet another XOR Count Paper Thorsten Kranz

Lapin (an efficient authentication protocol based on Ring-LPN) Stefan Heyse, Eike Kiltz, Vadim

Module 2 2.1 Transportation Planning Process The transportation planning process has a lot of

Tracing While Loops 7 January 2019 OSU CSE 1 Consider... int i = 4, j = 1, n = 0; while (i >

Preconditioning for Nonsymmetry and Time-dependence Andy Wathen - PowerPoint PPT Presentation

Preconditioning for Nonsymmetry and Time-dependence Andy Wathen Oxford University, UK joint work with Jen Pestana and Elle McDonald Jeju, Korea, 2015 p.1/24 Iterative methods For self-adjoint problems/symmetric matrices, iterative

Measuring Dependence and Conditional Dependence with Kernels Kenji Fukumizu The Institute of

Linear dependence and independence Linear dependence 1 Definition (linear (in)dependence) Let {

Preconditioning and nonlinear time solvers for the JOREK MHD code E. Franck, A. Lessig, M. H

Treating Tobacco Treating Tobacco Treating Tobacco Treating Tobacco Dependence and Providing

Control-dependence Analysis 2 Control-dependence Analysis 1. Introduction (motivation, overview)

More refined representations Control dependence graph Problem: control-flow edges in CFG

Numerical Upscaling and Preconditioning of Flows in Highly Heterogeneous Porous Media R. Lazarov,

Original motivation for nonlinear preconditioning A nonlinear system F ( u ) = 0 may be

What is Responsible for the Low-level What is Responsible for the Low-level Moist Preconditioning

GPSCP: A General-Purpose Support-Circuit Preconditioning Approach to Large Scale SPICE Accurate

Priority and Particle Physics: structure, dependence, and moderation in all things Kerry McKenzie

Dependence: Theory and Practice Introduction to loop dependence and loop transformation 1 The

Data Dependence in Data Dependence in Combining Classifiers Combining Classifiers Mohamed

Local Dependence and Persistence in Discrete Sliding Window Processes Ohad N. Feldheim Joint

Dependence in Games &amp; Dependence Games Davide Grossi (ILLC, University of Amsterdam) Paolo

Chapter 3 Structural breaks for models with path dependence 2 Chapter 3 Path dependence (p.

QCrypt: Implemenng a Next-Generaon Quantum Key Disllaon Engine in Pracce P. Junod

Optimizing R VM: Interpreter-level Specialization and Vectorization Haichuan Wang 1 , Peng Wu 2 ,

in the Finite Memory Length Regime Joint work of Yu-Chih Huang , National Chiao Tung University,

Regularization Methods for System Identification Input Design Biqiang MU Academy of Mathematics

Shorter Linear Straight-Line Programs for MDS Matrices Yet another XOR Count Paper Thorsten Kranz

Lapin (an efficient authentication protocol based on Ring-LPN) Stefan Heyse, Eike Kiltz, Vadim

Module 2 2.1 Transportation Planning Process The transportation planning process has a lot of

Tracing While Loops 7 January 2019 OSU CSE 1 Consider... int i = 4, j = 1, n = 0; while (i &gt;

Dependence in Games & Dependence Games Davide Grossi (ILLC, University of Amsterdam) Paolo

Tracing While Loops 7 January 2019 OSU CSE 1 Consider... int i = 4, j = 1, n = 0; while (i >