Rapid, Robust, and Reliable Blind Deconvolution via Nonconvex - PowerPoint PPT Presentation

Rapid, Robust, and Reliable Blind Deconvolution via Nonconvex Optimization Shuyang Ling Department of Mathematics, UC Davis Oct.18th, 2016 Shuyang Ling (UC Davis) 16w5136, Oaxaca, Mexico Oct.18th, 2016 1 / 31

Acknowledgements Research in collaboration with: Prof.Xiaodong Li (UC Davis) Prof.Thomas Strohmer (UC Davis) Dr.Ke Wei (UC Davis) This work is sponsored by NSF-DMS and DARPA. Shuyang Ling (UC Davis) 16w5136, Oaxaca, Mexico Oct.18th, 2016 2 / 31

Outline Applications in image deblurring and wireless communication Mathematical models and convex approach A nonconvex optimization approach towards blind deconvolution Shuyang Ling (UC Davis) 16w5136, Oaxaca, Mexico Oct.18th, 2016 3 / 31

What is blind deconvolution? What is blind deconvolution? Suppose we observe a function y which consists of the convolution of two unknown functions, the blurring function f and the signal of interest g , plus noise w . How to reconstruct f and g from y ? y = f ∗ g + w . It is obviously a highly ill-posed bilinear inverse problem... Much more difficult than ordinary deconvolution...but has important applications in various fields. Solvability? What conditions on f and g make this problem solvable? How? What algorithms shall we use to recover f and g ? Shuyang Ling (UC Davis) 16w5136, Oaxaca, Mexico Oct.18th, 2016 4 / 31

Why do we care about blind deconvolution? Image deblurring Let f be the blurring kernel and g be the original image, then y = f ∗ g is the blurred image. Question: how to reconstruct f and g from y ? y g f w = + blurred original blurring noise image image kernel = + Shuyang Ling (UC Davis) 16w5136, Oaxaca, Mexico Oct.18th, 2016 5 / 31

Why do we care about blind deconvolution? Joint channel and signal estimation in wireless communication Suppose that a signal x , encoded by A , is transmitted through an unknown channel f . How to reconstruct f and x from y ? y = f ∗ Ax + w . y :received f :unknown A :Encoding x :unknown w : noise signal channel matrix signal = + Shuyang Ling (UC Davis) 16w5136, Oaxaca, Mexico Oct.18th, 2016 6 / 31

Subspace assumptions We start from the original model y = f ∗ g + w . As mentioned before, it is an ill-posed problem. Hence, this problem is unsolvable without further assumptions... Subspace assumption Both f and g belong to known subspaces: there exist known tall matrices B ∈ C L × K and A ∈ C L × N such that f = Bh 0 , g = Ax 0 , for some unknown vectors h 0 ∈ C K and x 0 ∈ C N . Shuyang Ling (UC Davis) 16w5136, Oaxaca, Mexico Oct.18th, 2016 7 / 31

Model under subspace assumption In the frequency domain, y = ˆ g + w = diag(ˆ ˆ f ⊙ ˆ f )ˆ g + w , where “ ⊙ ” denotes entry-wise multiplication. We assume y and ˆ y are both of length L . Subspace assumption Both ˆ f and ˆ g belong to known subspaces: there exist known tall matrices B ∈ C L × K and ˆ A ∈ C L × N such that ˆ ˆ f = ˆ g = ˆ Bh 0 , ˆ Ax 0 , for some unknown vectors h 0 ∈ C K and x 0 ∈ C N . Here ˆ B = FB and ˆ A = FA . The degree of freedom for unknowns: K + N ; number of constraint: L . To make the solution identifiable, we require L ≥ K + N at least. Shuyang Ling (UC Davis) 16w5136, Oaxaca, Mexico Oct.18th, 2016 8 / 31

Remarks on subspace assumption 𝒛: 𝑀×1 𝑪: 𝑀×𝐿 𝒊: 𝐿×1 𝑦: 𝑂×1 𝒙: 𝑀×1 𝐵: 𝑀×𝑂 + = + ⊙ Subspace assumption is flexible and useful in applications. In imaging deblurring, B can be the support of the blurring kernel; A is a wavelet basis. In wireless communication, B corresponds to time-limitation of the channel and A is an encoding matrix. Shuyang Ling (UC Davis) 16w5136, Oaxaca, Mexico Oct.18th, 2016 9 / 31

Mathematical model y = diag( Bh 0 ) Ax 0 + w , where w 1 2 N ( 0 , σ 2 I L ) + i 2 N ( 0 , σ 2 I L ) and d 0 = � h 0 �� x 0 � . d 0 ∼ √ √ Shuyang Ling (UC Davis) 16w5136, Oaxaca, Mexico Oct.18th, 2016 10 / 31

Mathematical model y = diag( Bh 0 ) Ax 0 + w , where w 1 2 N ( 0 , σ 2 I L ) + i 2 N ( 0 , σ 2 I L ) and d 0 = � h 0 �� x 0 � . d 0 ∼ √ √ One might want to solve the following nonlinear least squares problem, min F ( h , x ) := � diag( Bh ) Ax − y � 2 . Difficulties: 1 Nonconvexity: F is a nonconvex function; algorithms (such as gradient descent) are likely to get trapped at local minima. 2 No performance guarantees. Shuyang Ling (UC Davis) 16w5136, Oaxaca, Mexico Oct.18th, 2016 10 / 31

Convex approach and lifting Two-step convex approach (a) Lifting: convert bilinear to linear constraints (b) Solving a SDP relaxation to recover hx ∗ . Shuyang Ling (UC Davis) 16w5136, Oaxaca, Mexico Oct.18th, 2016 11 / 31

Convex approach and lifting Two-step convex approach (a) Lifting: convert bilinear to linear constraints (b) Solving a SDP relaxation to recover hx ∗ . Step 1: lifting Let a i be the i -th column of A ∗ and b i be the i -th column of B ∗ . y i = ( Bh 0 ) i x ∗ 0 a i + w i = b ∗ i h 0 x ∗ 0 a i + w i , 0 and define the linear operator A : C K × N → C L as, X 0 := h 0 x ∗ Let A ( Z ) := { b ∗ i Za i } L i =1 = {� Z , b i a ∗ i �} L i =1 . Then, there holds y = A ( X 0 ) + w . In this way, A ∗ ( z ) = � L i : C L → C K × N . i =1 z i b i a ∗ Shuyang Ling (UC Davis) 16w5136, Oaxaca, Mexico Oct.18th, 2016 11 / 31

Convex relaxation and state of the art Step 2: nuclear norm minimization Consider the convex envelop of rank( Z ): nuclear norm � Z � ∗ = � σ i ( Z ). min � Z � ∗ s.t. A ( Z ) = A ( X 0 ) . Convex optimization can be solved within polynomial time. Shuyang Ling (UC Davis) 16w5136, Oaxaca, Mexico Oct.18th, 2016 12 / 31

Convex relaxation and state of the art Step 2: nuclear norm minimization Consider the convex envelop of rank( Z ): nuclear norm � Z � ∗ = � σ i ( Z ). min � Z � ∗ s.t. A ( Z ) = A ( X 0 ) . Convex optimization can be solved within polynomial time. Theorem [Ahmed-Recht-Romberg 11] Assume y = diag( Bh 0 ) Ax 0 , A : L × N is a complex Gaussian random matrix, � b i � 2 ≤ µ 2 max K B ∗ B = I K , L � Bh 0 � 2 ∞ ≤ µ 2 , h , L the above convex relaxation recovers X = h 0 x ∗ 0 exactly with high probability if L C 0 max( µ 2 max K , µ 2 h N ) ≤ log 3 L . Shuyang Ling (UC Davis) 16w5136, Oaxaca, Mexico Oct.18th, 2016 12 / 31

Pros and Cons of Convex Approach Pros and Cons Pros: Simple and comes with theoretic guarantees Cons: Computationally too expensive to solve SDP Our Goal: rapid, robust, reliable nonconvex approach Rapid: linear convergence Robust: stable to noise Reliable: provable and comes with theoretical guarantees; number of measurements close to information-theoretic limits. Shuyang Ling (UC Davis) 16w5136, Oaxaca, Mexico Oct.18th, 2016 13 / 31

A nonconvex optimization approach? An increasing list of nonconvex approach to various problems: Phase retrieval: by Cand´ es, Li, Soltanolkotabi, Chen, Wright, etc... Matrix completion: by Sun, Luo, Montanari, etc... Various problems: by Recht, Wainwright, Constantine, etc... Two-step philosophy for provable nonconvex optimization (a) Use spectral initialization to construct a starting point inside “the basin of attraction ” ; (b) Simple gradient descent method. The key is to build up “the basin of attraction”. Shuyang Ling (UC Davis) 16w5136, Oaxaca, Mexico Oct.18th, 2016 14 / 31

Building “the basin of attraction” The basin of attraction relies on the following three observations. Observation 1: Unboundedness of solution If the pair ( h 0 , x 0 ) is a solution to y = diag( Bh 0 ) Ax 0 , then so is the pair ( α h 0 , α − 1 x 0 ) for any α � = 0. Thus the blind deconvolution problem always has infinitely many solutions of this type. We can recover ( h 0 , x 0 ) only up to a scalar. It is possible that � h � ≫ � x � (vice versa) while � h � · � x � = d 0 . Hence we define N d 0 to balance � h � and � x � : � � N d 0 := { ( h , x ) : � h � ≤ 2 d 0 , � x � ≤ 2 d 0 } . Shuyang Ling (UC Davis) 16w5136, Oaxaca, Mexico Oct.18th, 2016 15 / 31

Building “the basin of attraction” Observation 2: Incoherence Our numerical experiments have shown that the algorithm’s performance depends on how much b l and h 0 are correlated. = L max i | b ∗ h := L � Bh 0 � 2 i h 0 | 2 µ 2 ∞ , the smaller µ h , the better. � h 0 � 2 � h 0 � 2 Therefore, we introduce the N µ to control the incoherence: √ � N µ := { h : L � Bh � ∞ ≤ 4 µ d 0 } . “Incoherence” is not a new idea. In matrix completion, we also require the left and right singular vectors of the ground truth cannot be too “aligned” with those of measurement matrices { b i a ∗ i } 1 ≤ i ≤ L . The same philosophy applies here. Shuyang Ling (UC Davis) 16w5136, Oaxaca, Mexico Oct.18th, 2016 16 / 31

Building “the basin of attraction” Observation 3: “Close” to the ground truth We define N ε to quantify closeness of ( h , x ) to true solution, i.e., N ε := { ( h , x ) : � hx ∗ − h 0 x ∗ 0 � F ≤ ε d 0 } . We want to find an initial guess close to ( h 0 , x 0 ) . Shuyang Ling (UC Davis) 16w5136, Oaxaca, Mexico Oct.18th, 2016 17 / 31

Rapid, Robust, and Reliable Blind Deconvolution via Nonconvex - PowerPoint PPT Presentation

Rapid, Robust, and Reliable Blind Deconvolution via Nonconvex Optimization Shuyang Ling Department of Mathematics, UC Davis Oct.18th, 2016 Shuyang Ling (UC Davis) 16w5136, Oaxaca, Mexico Oct.18th, 2016 1 / 31 Acknowledgements Research in

Semi-blind deconvolution in 4Pi-microscopy Robert St uck Institute for numerical and applied

Near Real-Time Deconvolution for 3D Fluorescence Microscopy Marc Bruce, PhD CEO, Microvolution

Joint Blind Deconvolution and Blind Demixing via Nonconvex Optimization Shuyang Ling Department

Global Geometry of Multichannel Sparse Blind Deconvolution on the Sphere Yanjun Li Yoram Bresler

Blind galaxy survey images Deconvolution with Shape Constraint Jean-Luc Starck

The Convex Geometry of Blind Deconvolution Dominik Stger Technische Universitt Mnchen

Rapid Response Jobs are Alaskas Future Rapid Response Rapid Response Rapid Response is a

Old and New Algorithms for Blind Deconvolution Yair Weiss Hebrew University of Jerusalem joint

Nanocrafter Xylem vs. r pl r lv r pl r lv r pl r lv + d - d - d + d BLIND BLIND RATINGS

Outlier Outlier Outlier- Outlier - -robust - robust robust robust identification

Identifiability of Blind Deconvolution with Subspace or Sparsity Constraints Yanjun Li Joint

Blind deconvolution of 3D data in wide field fluorescence microscopy Ferrol Soulez 1 , 2 Loc

Blind/Myopic Deconvolution Julian Christou UCSC CfAO PSF Reconstruction Workshop May 2004

Blind Image Deconvolution Need for Theoretical . . . Based on Sparsity: Need for Improvement

Rotation-Invariance State-of-the-Art . . . Can Further Improve Need for Improvement

Model REM Rapid Engineering Model What is REM? REM Rapid Engineering Model What is REM? REM

ROBUST LOCATION AND SCALE ESTIMATION WITH CENSORED OUTCOMES Jerome H. Friedman Stanford

Nonlinear Control Lecture # 28 Robust State Feedback Stabilization Nonlinear Control Lecture # 28

Nonlinear Control Lecture # 10 State Feedback Stabilization and Robust State Feedback

Robust Lower Bounds for Communication and Stream Computation Amit Chakrabarti Dartmouth

Web Crawling Introduction to Information Retrieval INF 141 Donald J. Patterson Content adapted

Dynamic Robust Utility P Beissner* F Maccheroni # M Marinacci # S Mukerji @ * ANU # U Bocconi and @

R-Packages for Robust Asymptotic Statistics Dr. Matthias Kohl Chair for Stochastics joint work

Trade-off between Efficiency and Robustness Doctoral Colloqium @ SenSys18, Shenzhen Robert