Phase synchronization An example of global optimality on manifolds Nicolas Boumal, Inria & ENS Paris Joint work with Afonso Bandeira and Amit Singer One-day workshop on Riemannian and nonsmooth optimization Sept. 25, 2015
Note for the reader A tighter version of the theorems in these slides will appear in an update to the arxiv paper 1411.3272 (version 1 from Nov. 2014 is less tight). The version in these slides was chosen as it is relatively simple to derive in the allotted time. In particular, the bound on π¨ β π¦ 2 can be reduced to π(π) and the tightness rate can be improved to π β€ π β π 1/4 as a result. Nicolas, September 28, 2015.
The goal: estimate individual orientations, from pairwise comparisons (up to global shift)
The goal: estimate individual orientations, from pairwise comparisons (up to global shift)
Estimate phases from relative info Unknowns π¨ 1 , β¦ , π¨ π β β , with π¨ 1 = β― = π¨ π = 1 π β 1 Data Noisy measurements of relative phases: β + ππ π· ππ = π¨ π π¨ ππ π
What does additive Gaussian noise mean here? Noise affects the phase of the measurement β + ππ π· ππ = π¨ π π¨ ππ π
Maximum likelihood estimation π· = π¨π¨ β + ππ π , π¨ β β 1 Under Gaussian noise, the MLE solves a least-squares 2 π· β π¦π¦ β π π¦ β π·π¦ min β‘ max F π π¦ββ 1 π¦ββ 1
β 2 ball of radius 12π π π¦ (the MLE) π¨ (the signal) level sets of π¦ β π·π¦
Lemma: The MLE π¦ is close to the signal π¨ The MLE is more likely than the signal: π¨ β π·π¨ β€ π¦ β π·π¦ . Hence, with π· = π¨π¨ β + ππ and π op β€ 3 π , π 2 + ππ¨ β ππ¨ β€ π¨ β π¦ 2 + ππ¦ β ππ¦ π 2 β π¨ β π¦ 2 β€ π π¦ β ππ¦ β π¨ β ππ¨ β€ π β 2π π op β€ 6ππ 3/2 Divide by π + π¨ β π¦ β₯ π . Implies: 2 = 2(π β |π¨ β π¦|) β€ 12π π π¨ β π¦π ππ min 2 π
The MLE is NP-hard to compute The problem π π¦ β π·π¦ max π¦ββ 1 has a quadratic cost π¦ β π·π¦ , and nonconvex quadratic constraints π¦ π 2 = 1 .
Surprisingly, global optimality can be tested (sometimes) π , consider Given π¦ β β 1 π = π π¦ = β ddiag π·π¦π¦ β β π· If π is positive semidefinite, then π¦ is optimal: π , 0 β€ π§ β ππ§ = π¦ β π·π¦ β π§ β π·π§ βπ§ β β 1
Optimization on manifolds finds a certified optimum quite often Donβt know Noise Level π Certified optimum: π β½ 0 Number of phases π
How is that possible? The problem is convex in a lifted space
Classic lifting trick: rewrite everything in terms of π = π¦π¦ β π π¦ β π·π¦ max π¦ββ 1 The cost π¦ β π·π¦ = Trace π¦ β π·π¦ = Trace(π·π) The constraints β = 1 β π ππ = 1 π¦ π 2 = 1 β π¦ π π¦ π The knot βπ¦: π = π¦π¦ β β π β½ 0, rank π = 1
Suggests a semidefinite relaxation Recast the QCQP π π¦ β π·π¦ max π¦ββ 1 Into πββ πΓπ Trace(π·π) max diag π = π π β½ 0 rank π = 1
Suggests a semidefinite relaxation Relax the QCQP π π¦ β π·π¦ max π¦ββ 1 Into the SDP πββ πΓπ Trace(π·π) max diag π = π π β½ 0 rank π = 1
The SDP seems tight roughly when Riemannian optimization succeeds Not tight Noise Level π Tight: SDP solution has rank 1 Number of phases π
We prove the SDP has a unique solution of rank 1 This is with high probability for large π , And if π β€ π β π 1/6 (empirically, π(π 1/2 ) ok).
General idea: dual certification πββ πΓπ Trace(π·π) max Lemma: diag π = π π β½ 0 π solves the SDP if and only if π π = β ddiag π·π β π· β½ 0 Proof via KKT conditions. Thus, the certificate works iff the SDP is tight.
General idea: dual certification Let π¦ be the MLE, solution of π π¦ β π·π¦ max π¦ββ 1 We aim to prove that π π¦π¦ β β½ 0 . Challenge : we donβt know π¦ .
Step 1: characterize the MLE π¦ π¦ is second-order critical, close to π¨ . π¦ (the MLE) π¨ (the signal) level sets of π¦ β π·π¦
Step 1: characterize the MLE π¦ The MLE problem lives on a smooth manifold π π¦ β π·π¦ max π¦ββ 1 π¦ is critical simply if its projected gradient vanishes: π¦ is critical β ππ¦ = 0 π = π π¦ = β ddiag π·π¦π¦ β β π·
Step 1: characterize the MLE π¦ π = π π¦ = β ddiag π·π¦π¦ β β π· π¦ is critical β ππ¦ = 0 β diag π·π¦π¦ β is real If π¦ is also second-order critical, then π is positive semidefinite on the tangent space . π¦ is second-order critical β diag π·π¦π¦ β β₯ π
Step 2: certify Assuming π¦ is second-order critical and close to π¨ , show that π = ddiag π·π¦π¦ β β π· β½ 0. For all π£ β β π with π£ β π¦ = 0 , seven lines give: 2 π β π 21 π + ππ¦ β π£ β ππ£ β₯ π£ 2 (Used π op β€ 3 π again.)
Step 2: certify Sufficient condition for tightness of the SDP: π β₯ π 21 π + ππ¦ β Note: π¦ and π are not independent. Suboptimal argument (whp): ππ¦ β β€ ππ¨ β + π π¦ β π¨ β β€ π log π + π op π¦ β π¨ 2 π log π + 3 12ππ 3/2 β€
Theorem (ArXiv 1411.3272) With high probability for large (finite) π , If π β€ π β π 1/6 , Then a second-order critical point π¦ close to π¨ is optimal, with certificate π , And π¦π¦ β is the unique solution of the SDP.
Recommend
More recommend