phase synchronization
play

Phase synchronization An example of global optimality on manifolds - PowerPoint PPT Presentation

Phase synchronization An example of global optimality on manifolds Nicolas Boumal, Inria & ENS Paris Joint work with Afonso Bandeira and Amit Singer One-day workshop on Riemannian and nonsmooth optimization Sept. 25, 2015 Note for the


  1. Phase synchronization An example of global optimality on manifolds Nicolas Boumal, Inria & ENS Paris Joint work with Afonso Bandeira and Amit Singer One-day workshop on Riemannian and nonsmooth optimization Sept. 25, 2015

  2. Note for the reader A tighter version of the theorems in these slides will appear in an update to the arxiv paper 1411.3272 (version 1 from Nov. 2014 is less tight). The version in these slides was chosen as it is relatively simple to derive in the allotted time. In particular, the bound on 𝑨 βˆ’ 𝑦 2 can be reduced to 𝑃(𝜏) and the tightness rate can be improved to 𝜏 ≀ 𝑑 β‹… π‘œ 1/4 as a result. Nicolas, September 28, 2015.

  3. The goal: estimate individual orientations, from pairwise comparisons (up to global shift)

  4. The goal: estimate individual orientations, from pairwise comparisons (up to global shift)

  5. Estimate phases from relative info Unknowns 𝑨 1 , … , 𝑨 π‘œ ∈ β„‚ , with 𝑨 1 = β‹― = 𝑨 π‘œ = 1 π‘œ β„‚ 1 Data Noisy measurements of relative phases: βˆ— + πœπ‘‹ 𝐷 π‘—π‘˜ = 𝑨 𝑗 𝑨 π‘—π‘˜ π‘˜

  6. What does additive Gaussian noise mean here? Noise affects the phase of the measurement βˆ— + πœπ‘‹ 𝐷 π‘—π‘˜ = 𝑨 𝑗 𝑨 π‘—π‘˜ π‘˜

  7. Maximum likelihood estimation 𝐷 = 𝑨𝑨 βˆ— + πœπ‘‹ π‘œ , 𝑨 ∈ β„‚ 1 Under Gaussian noise, the MLE solves a least-squares 2 𝐷 βˆ’ 𝑦𝑦 βˆ— π‘œ 𝑦 βˆ— 𝐷𝑦 min ≑ max F π‘œ π‘¦βˆˆβ„‚ 1 π‘¦βˆˆβ„‚ 1

  8. β„“ 2 ball of radius 12𝜏 π‘œ 𝑦 (the MLE) 𝑨 (the signal) level sets of 𝑦 βˆ— 𝐷𝑦

  9. Lemma: The MLE 𝑦 is close to the signal 𝑨 The MLE is more likely than the signal: 𝑨 βˆ— 𝐷𝑨 ≀ 𝑦 βˆ— 𝐷𝑦 . Hence, with 𝐷 = 𝑨𝑨 βˆ— + πœπ‘‹ and 𝑋 op ≀ 3 π‘œ , π‘œ 2 + πœπ‘¨ βˆ— 𝑋𝑨 ≀ 𝑨 βˆ— 𝑦 2 + πœπ‘¦ βˆ— 𝑋𝑦 π‘œ 2 βˆ’ 𝑨 βˆ— 𝑦 2 ≀ 𝜏 𝑦 βˆ— 𝑋𝑦 βˆ’ 𝑨 βˆ— 𝑋𝑨 ≀ 𝜏 β‹… 2π‘œ 𝑋 op ≀ 6πœπ‘œ 3/2 Divide by π‘œ + 𝑨 βˆ— 𝑦 β‰₯ π‘œ . Implies: 2 = 2(π‘œ βˆ’ |𝑨 βˆ— 𝑦|) ≀ 12𝜏 π‘œ 𝑨 βˆ’ 𝑦𝑓 π‘—πœ„ min 2 πœ„

  10. The MLE is NP-hard to compute The problem π‘œ 𝑦 βˆ— 𝐷𝑦 max π‘¦βˆˆβ„‚ 1 has a quadratic cost 𝑦 βˆ— 𝐷𝑦 , and nonconvex quadratic constraints 𝑦 𝑗 2 = 1 .

  11. Surprisingly, global optimality can be tested (sometimes) π‘œ , consider Given 𝑦 ∈ β„‚ 1 𝑇 = 𝑇 𝑦 = β„œ ddiag 𝐷𝑦𝑦 βˆ— βˆ’ 𝐷 If 𝑇 is positive semidefinite, then 𝑦 is optimal: π‘œ , 0 ≀ 𝑧 βˆ— 𝑇𝑧 = 𝑦 βˆ— 𝐷𝑦 βˆ’ 𝑧 βˆ— 𝐷𝑧 βˆ€π‘§ ∈ β„‚ 1

  12. Optimization on manifolds finds a certified optimum quite often Don’t know Noise Level 𝜏 Certified optimum: 𝑇 ≽ 0 Number of phases π‘œ

  13. How is that possible? The problem is convex in a lifted space

  14. Classic lifting trick: rewrite everything in terms of π‘Œ = 𝑦𝑦 βˆ— π‘œ 𝑦 βˆ— 𝐷𝑦 max π‘¦βˆˆβ„‚ 1 The cost 𝑦 βˆ— 𝐷𝑦 = Trace 𝑦 βˆ— 𝐷𝑦 = Trace(π·π‘Œ) The constraints βˆ— = 1 ⇔ π‘Œ 𝑗𝑗 = 1 𝑦 𝑗 2 = 1 ⇔ 𝑦 𝑗 𝑦 𝑗 The knot βˆƒπ‘¦: π‘Œ = 𝑦𝑦 βˆ— ⇔ π‘Œ ≽ 0, rank π‘Œ = 1

  15. Suggests a semidefinite relaxation Recast the QCQP π‘œ 𝑦 βˆ— 𝐷𝑦 max π‘¦βˆˆβ„‚ 1 Into π‘Œβˆˆβ„‚ π‘œΓ—π‘œ Trace(π·π‘Œ) max diag π‘Œ = 𝟐 π‘Œ ≽ 0 rank π‘Œ = 1

  16. Suggests a semidefinite relaxation Relax the QCQP π‘œ 𝑦 βˆ— 𝐷𝑦 max π‘¦βˆˆβ„‚ 1 Into the SDP π‘Œβˆˆβ„‚ π‘œΓ—π‘œ Trace(π·π‘Œ) max diag π‘Œ = 𝟐 π‘Œ ≽ 0 rank π‘Œ = 1

  17. The SDP seems tight roughly when Riemannian optimization succeeds Not tight Noise Level 𝜏 Tight: SDP solution has rank 1 Number of phases π‘œ

  18. We prove the SDP has a unique solution of rank 1 This is with high probability for large π‘œ , And if 𝜏 ≀ 𝑑 β‹… π‘œ 1/6 (empirically, 𝑃(π‘œ 1/2 ) ok).

  19. General idea: dual certification π‘Œβˆˆβ„‚ π‘œΓ—π‘œ Trace(π·π‘Œ) max Lemma: diag π‘Œ = 𝟐 π‘Œ ≽ 0 π‘Œ solves the SDP if and only if 𝑇 π‘Œ = β„œ ddiag π·π‘Œ βˆ’ 𝐷 ≽ 0 Proof via KKT conditions. Thus, the certificate works iff the SDP is tight.

  20. General idea: dual certification Let 𝑦 be the MLE, solution of π‘œ 𝑦 βˆ— 𝐷𝑦 max π‘¦βˆˆβ„‚ 1 We aim to prove that 𝑇 𝑦𝑦 βˆ— ≽ 0 . Challenge : we don’t know 𝑦 .

  21. Step 1: characterize the MLE 𝑦 𝑦 is second-order critical, close to 𝑨 . 𝑦 (the MLE) 𝑨 (the signal) level sets of 𝑦 βˆ— 𝐷𝑦

  22. Step 1: characterize the MLE 𝑦 The MLE problem lives on a smooth manifold π‘œ 𝑦 βˆ— 𝐷𝑦 max π‘¦βˆˆβ„‚ 1 𝑦 is critical simply if its projected gradient vanishes: 𝑦 is critical ⇔ 𝑇𝑦 = 0 𝑇 = 𝑇 𝑦 = β„œ ddiag 𝐷𝑦𝑦 βˆ— βˆ’ 𝐷

  23. Step 1: characterize the MLE 𝑦 𝑇 = 𝑇 𝑦 = β„œ ddiag 𝐷𝑦𝑦 βˆ— βˆ’ 𝐷 𝑦 is critical ⇔ 𝑇𝑦 = 0 ⇔ diag 𝐷𝑦𝑦 βˆ— is real If 𝑦 is also second-order critical, then 𝑇 is positive semidefinite on the tangent space . 𝑦 is second-order critical β‡’ diag 𝐷𝑦𝑦 βˆ— β‰₯ 𝟐

  24. Step 2: certify Assuming 𝑦 is second-order critical and close to 𝑨 , show that 𝑇 = ddiag 𝐷𝑦𝑦 βˆ— βˆ’ 𝐷 ≽ 0. For all 𝑣 ∈ β„‚ π‘œ with 𝑣 βˆ— 𝑦 = 0 , seven lines give: 2 π‘œ βˆ’ 𝜏 21 π‘œ + 𝑋𝑦 ∞ 𝑣 βˆ— 𝑇𝑣 β‰₯ 𝑣 2 (Used 𝑋 op ≀ 3 π‘œ again.)

  25. Step 2: certify Sufficient condition for tightness of the SDP: π‘œ β‰₯ 𝜏 21 π‘œ + 𝑋𝑦 ∞ Note: 𝑦 and 𝑋 are not independent. Suboptimal argument (whp): 𝑋𝑦 ∞ ≀ 𝑋𝑨 ∞ + 𝑋 𝑦 βˆ’ 𝑨 ∞ ≀ π‘œ log π‘œ + 𝑋 op 𝑦 βˆ’ 𝑨 2 π‘œ log π‘œ + 3 12πœπ‘œ 3/2 ≀

  26. Theorem (ArXiv 1411.3272) With high probability for large (finite) π‘œ , If 𝜏 ≀ 𝑑 β‹… π‘œ 1/6 , Then a second-order critical point 𝑦 close to 𝑨 is optimal, with certificate 𝑇 , And 𝑦𝑦 βˆ— is the unique solution of the SDP.

Recommend


More recommend