Projection methods: convergence and counterexamples 4 January 2019 - PowerPoint PPT Presentation

Projection methods: convergence and counterexamples 4 January 2019 Hangzhou Dianzi University Vera Roshchina School of Mathematics and Statistics UNSW Sydney v.roshchina@unsw.edu.au Based on joint work with Hong-Kun Xu , Roberto Cominetti and Andrew Williamson .

The method of alternating projections C 2 C 1

The method of alternating projections Let H be a Hilbert space, with inner product �· , ·� and norm � · � . For any closed convex set C ⊆ H and any x ∈ H there exists a unique point P C ( x ) ∈ C such that � x − P C ( x ) � = inf y ∈ C � x − y � . Given two closed convex sets C 1 , C 2 ⊆ H and x 0 ∈ H , let x 1 = P C 1 ( x 0 ) , x 2 = P C 2 ( x 1 ) , x 3 = P C 1 ( x 2 ) , x 4 = P C 2 ( x 3 ) , . . . . . . x 2 k +1 = P C 1 x 2 k , x 2 k +2 = P C 2 x 2 k +1 , . . . . . .

Convergence Let M 1 and M 2 be closed affine subspaces of H , M = M 1 ∩ M 2 . Theorem 1 (von Neumann 1933) . For each x ∈ H n →∞ � ( P M 2 P M 1 ) n ( x ) − P M ( x ) � = 0 . lim von Neumann, Functional Operators-Vol. II. The Geometry of Orthogonal Spaces, Annals of Math. Studies, 1950 (reprint of 1933 lectures). Theorem 2 (Bregman 1965) . For C = C 1 ∩ C 2 � = ∅ , where C 1 , C 2 ⊆ H are closed convex sets, the sequence of alternating projections converges weakly to a point in C . Bregman, The method of successive projection for finding a common point of convex sets, Sov. Math. Dokl., 1965. The question of whether convergence is always strong remained open until 2004, despite many works on sufficient conditions.

Counterexample of Hundal Theorem 3 (Hundal 2004) . There exist a Hilbert space H , closed convex sets C 1 , C 2 ⊂ H with intersection C 1 ∩ C 2 = { 0 } and a starting point x 0 such that n →∞ � ( P C 2 P C 1 ) n ( x 0 ) � > 0 . lim In a separable Hilbert space with an orthonormal basis { e i } ∞ i =1 , let C 1 = { x | � x, e 1 � ≤ 0 } , C 2 = cone { p ( t ) | t ≥ 0 } , p ( t ) = e ⌊ t ⌋ +2 cos( f ( t )) + e ⌊ t ⌋ +3 sin( f ( t )) + e 1 h ( t ) , t ≥ 0 , f ( t ) = π h ( t ) = e − 100 t 3 2( t − ⌊ t ⌋ ) , Hundal, An alternating projection that does not converge in norm. Nonlinear Anal. 2004.

Rate of convergence x 0 x 0 x 0 x 0

Angles between subspaces The Friedrichs angle between two closed linear subspaces M 1 and M 2 is α ∈ [0 , π 2 ] such that ( B H is a unit ball, M = M 1 ∩ M 2 ) c = cos α = sup |� x, y �| . x ∈ M 1 ∩ M ⊥ ∩ B H y ∈ M 2 ∩ M ⊥ ∩ B H Theorem 4 (Aronszajn, 1950) . For each x ∈ H and n ≥ 1 � ( P M 2 P M 1 ) n ( x ) − P M ( x ) � ≤ c 2 n − 1 � x � . We have c < 1 iff M 1 + M 2 is closed; in this case the method of alternating projections converges linearly. Aronszajn, Theory of reproducing kernels, Trans. Amer. Math. Soc., 1950. The constant is the smallest possible Kayalar and Weinert, Error bounds for the method of alternating projections, Math. Control Signals Systems, 1988. Generalisations to several sets Reich and Zalas, The optimal error bound for the method of simultaneous projections, J. Approx. Theory, 2017

What if c = 1? Theorem 5 (Bauschke, Borwein and Lewis) . For two closed affine subspaces M 1 , M 2 ∈ H exactly one of the alternatives holds. (1) M 1 + M 2 is closed. Then for each x the alternating projections converge linearly to P M 1 ∩ M 2 ( x ) with a rate c 2 . (2) M 1 + M 2 is not closed. Then for any sequence of positive real numbers 1 > λ 1 ≥ λ 2 ≥ · · · ≥ λ n → 0 there exists a point x λ ∈ H such that � ( P M 2 P M 1 ) n ( x λ ) − P M ( x λ ) � ≥ λ n ∀ n ∈ N . Bauschke, Borwein, and Lewis, The method of cyclic projections for closed convex sets in Hilbert space, Contemporary Mathematics, 1997. Bauschke, Deutsch, Hundal, Characterizing arbitrarily slow convergence in the method of alternating projections. Int. Trans. Oper. Res., 2009.

Special properties and convergence Regularity and the existence of Slater points Gubin, Polyak, Raik, The method of projections for finding the common point of convex sets, USSR Comput. Math. Math. Phys., 1967. Symmetry Bruck, Reich, Nonexpansive projections and resolvents of accretive operators in Banach spaces, Houston J. Math., 1977. Reich, A limit theorem for projections, Linear and Multilinear Algebra, 1983. Semialgebraic structure Borwein, Li, Yao, Analysis of the convergence rate for the cyclic projection algorithm applied to basic semialgebraic convex sets. SIAM J. Optim. 24, 498–527 (2014) Drusvyatskiy, Li, Wolkowicz, A note on alternating projections for ill-posed semidef- inite feasibility problems. Math. Program. 162 (2017), 537–548.

What if the problem is infeasible? Assume that C 1 , C 2 ∈ H are convex and closed, but possibly C 1 ∩ C 2 = ∅ . Define the distance between C 1 and C 2 as � y − x � . dist( C 1 , C 2 ) = inf x ∈ C 1 y ∈ C 2 The following sets may be empty, P 1 = { x ∈ C 1 | dist( x, C 2 ) = dist( C 1 , C 2 ) } , P 2 = { y ∈ C 2 | dist( y, C 1 ) = dist( C 1 , C 2 ) } . C 2 C 2 P 2 v v P 1 C 1 C 1

The displacement vector and convergence Define the displacement vector v = P C 2 − C 1 (0) , where C 2 − C 1 is the Minkowski difference, C 2 − C 1 = { y − x, x ∈ C 1 , y ∈ C 2 } . For the alternating projections we have x 2 k − x 2 k +1 → v, x 2 k +2 − x 2 k +1 → v. If P 1 and P 2 are empty, then � x n � → ∞ . Otherwise x 2 k +1 ⇀ ¯ x ∈ P 1 , x 2 k ⇀ ¯ y ∈ P 2 , and ¯ y − ¯ x = v . Bauschke, Borwein, On the Convergence of yon Neumann’s Alternating Projec- tion Algorithm for Two Sets, Set-Valued Analysis, 1993.

A helpful illustration C 2 C 1

What about more than two sets? For m ≥ 2 sets we can generalise alternating projections starting from x 0 ∈ H , and projecting cyclically onto each of the sets. For three sets C 1 , C 2 , C 3 , x 1 = P C 1 ( x 0 ) , x 2 = P C 2 ( x 1 ) , x 3 = P C 3 ( x 2 ) , x 4 = P C 1 ( x 3 ) , · · · u 0 u 1 u 2 C 2 u 5 C 1 u 4 u 3 u 6 C 3

There is no variational characterisation Under mild assumptions (e.g. one of the sets is bounded) cyclic projections converge weakly either to a point in the intersection C 1 ∩ C 2 ∩ · · · ∩ C m or to a fixed cycle if the intersection is empty. Bruck, Reich, Nonexpansive projections and resolvents of accretive operators in Banach spaces. Houston J. Math., 1977. Recall that for two sets this cycle realises the distance between the sets; however, for m ≥ 3 there is no function Φ : H m → R such that for any collection of compact convex sets C 1 , C 2 , . . . , C m ⊂ H the limit cycles are precisely the solutions to the minimisation problem min Φ( x 1 , x 2 , . . . , x m ) . x i ∈ C i Baillon, Combettes, Cominetti, There is no variational characterization of the cycles in the method of periodic projections. J. Funct. Anal., 2012.

Under-relaxed projections Fix α ∈ (0 , 1] and instead of P C ( x ) consider R ( x ) = (1 − α ) x + αP C ( x ) . true u projection C under-relaxed projection This leads to under-relaxed alternating and cyclic projections.

Under-relaxed projections C 2 C 1 C 3 Iterations for α = 0 . 75 and α = 0 . 35 (shown in red).

Two special limits Fix α ∈ (0 , 1] and instead of P C ( x ) consider R ( x ) = (1 − α ) x + α ( P C ( x ) − x ) . The under-relaxed cyclic projections converge weakly to a fixed cycle iff such a cycle exists (e.g. when one of the sets is bounded). Bruck, Reich, Nonexpansive projections and resolvents of accretive operators in Banach spaces. Houston J. Math., 1977. Consider the limit of such α -cycles as α ↓ 0 , or alternatively vary α , letting α k ↓ 0 , � k ∈ N α k = + ∞ .

De Pierro’s conjecture Conjecture 1. The least squares solution m � x − x i � 2 � S = Arg min min x i ∈ C i x ∈H i =1 exists iff both limits exist and solve this least squares problem. De Pierro, From parallel to sequential projection methods and vice versa in convex feasibility: results and conjectures, Stud. Comput. Math., 2001. The conjecture is true for affine subspaces of R n , Censor, Eggermont, Gordon, Strong underrelaxation in Kaczmarz’s method for in- consistent systems. Numer. Math., 1983. closed affine subspaces satisfying a metric regularity condition, Bauschke, Edwards, A conjecture by De Pierro is true for translates of regular subspaces, J. Nonlinear Convex Anal., 2005. and sets satisfying a certain geometric condition. Baillon, Combettes, Cominetti, Asymptotic behavior of compositions of under- relaxed nonexpansive operators, J. Dyn. Games, 2014.

A misleading example C 1 = co { ( − 2 , 2 , 1) , ( − 2 , 2 , − 1) } , C 2 = co { (2 , 2 , 1) , (2 , 2 , − 1) } , C 3 = { ( x, y, z ) | x 2 + y 2 ≤ 1 , | z | ≤ 1 } , �� 0 , 5 � � S = : | z | ≤ 1 3 , z . C 3 C 3 u 0 z 0 =0.5 S C 1 u 0 z 0 =-0.5 C 1 S C 2 C 2 Under-relaxed projections for α = 0 . 5 and different starting points.

Counterexample C 1 = co { ( − 2 , 2 , 1) , ( − 2 , 2 , − 1) } , C 2 = co { (2 , 2 , 1) , (2 , 2 , − 1) } , p k = (cos t k , sin t k , ( − 1) k ) . C 3 = co { p k | k ∈ N } , Here { t k } is increasing, t 1 = π 4 and t k → π 2 as k → ∞ . p 2 p 4 C 1 C 3 p 3 p 1 C 2

Counterexample For this three-set system the limits described earlier do not exist, however, the least-squares problem has a solution. p 2 p 4 C 1 C 3 p 3 p 1 C 2 Cominetti, Roshchina, Williamson, A counterexample to De Pierro’s conjecture on the convergence of under-relaxed cyclic projections, Optimization, 2018.

Projection methods: convergence and counterexamples 4 January 2019 - PowerPoint PPT Presentation

Projection methods: convergence and counterexamples 4 January 2019 Hangzhou Dianzi University Vera Roshchina School of Mathematics and Statistics UNSW Sydney v.roshchina@unsw.edu.au Based on joint work with Hong-Kun Xu , Roberto Cominetti

A Comparison of Emission Projection Methods for NOx and SO2 Emissions From Electricity Generating

Comparison of Projection Methods TU Berlin derived from Deflation, Domain Deflation Comparison

An Implementation and Analysis of the Refined Projection Method For (Jacobi-)Davidson Type Methods

Projection Methods for Generalized Eigenvalue Problems Christoph Conrads

Scaling the Hierarchical Topic Modeling Mountain Neural NMF and Iterative Projection Methods

Scaled gradient projection methods in image deblurring and denoising Mario Bertero 1 Patrizia

Projection Ping Yu School of Economics and Finance The University of Hong Kong Ping Yu (HKU)

APPLIED MACHINE LEARNING Methods for Reduction of Dimensionality through Linear Projection

Radial Projection Techniques InfoVis SS2020 G4 12 05 2020 Radial Projection Basics Also

Robust PCA Yingjun Wu Preliminary: vector projection Scalar projection of a onto b: a1 could be

Comparing Temporal Smoothers for use in Demographic Estimation and Projection Monica Alexander

Improved Bounds on the Dot Product under Random Projection and Random Sign Projection Ata Kab

Computer Graphics CS 543 Lecture 6 (Part 2) Projection (Part I) Prof Emmanuel Agu Computer

ECS 231 Subspace projection methods for LS 1 / 38 Part I. Basics The landscape of solvers for

University of Cambridge Engineering Part IIB Module 4F12: Computer Vision Handout 3: Projection

VIDEO SIGNALS VIDEO SIGNALS Corners and Shapes PROJECTION OF VECTORS PROJECTION OF VECTORS

Principal Components Analysis Sargur Srihari University at Buffalo 1 Topics Projection

Stochastic Filtering by Projection The Example of the Quadratic Sensor John Armstrong (Kings

CS-184: Computer Graphics Lecture #8: Projection Prof. James OBrien University of

Probabilistic Projection of Subnational Total Fertility Rates Hana Sev c kov a,

DT and 3D CH Theorem: Let P ={ p 1 ,, p n } with p i =( a i , b i ,0). Let p i =( a i , b i ,

Nonlinear Methods Data often lies on or near a nonlinear low-dimensional curve aka manifold. 27

Projection (Part 2) : Derivation Created by Dr. Slim BECHIKH for SPSU course - CS4363 Computer

Time Projection Chamber Principles of operation and the ALICE example Max Lamparth, 18 th November