MADMM: a generic algorithm for non-smooth optimization on manifolds - PowerPoint PPT Presentation

Manifold ADMM Non-smooth manifold optimization problem equivalently written as X ∈M ,Z f ( X ) + g ( Z ) s . t . Z = AX min introducing an artificial variable Z and a linear constraint Hestenes 1969; Powell 1969; Kovnatsky, Glashoff, B 2015 11/54

Manifold ADMM Non-smooth manifold optimization problem equivalently written as X ∈M ,Z f ( X ) + g ( Z ) s . t . Z = AX min introducing an artificial variable Z and a linear constraint Apply the method of multipliers only to the constraint Z = AX X ∈M ,Z f ( X ) + g ( Z ) + ρ 2 ∥ AX − Z + U ∥ 2 min F Solve alternating w.r.t. X and Z and updating U ← U + AX − Z Hestenes 1969; Powell 1969; Kovnatsky, Glashoff, B 2015 11/54

Manifold ADMM Non-smooth manifold optimization problem equivalently written as X ∈M ,Z f ( X ) + g ( Z ) s . t . Z = AX min introducing an artificial variable Z and a linear constraint Apply the method of multipliers only to the constraint Z = AX X ∈M ,Z f ( X ) + g ( Z ) + ρ 2 ∥ AX − Z + U ∥ 2 min F Solve alternating w.r.t. X and Z and updating U ← U + AX − Z Problem breaks into Smooth manifold optimization sub-problem w.r.t. X , and Non-smooth unconstrained sub-problem w.r.t. Z Hestenes 1969; Powell 1969; Kovnatsky, Glashoff, B 2015 11/54

MADMM Algorithm 2 MADMM for non-smooth optimization on manifold M Initialize k ← 1 , Z ( 1 ) = AX ( 1 ) , U ( 1 ) = 0 . repeat 2 ∥ AX − Z ( k ) + U ( k ) ∥ 2 X -step: X ( k + 1 ) = argmin f ( X ) + ρ F X ∈M Z -step: Z ( k + 1 ) = argmin 2 ∥ AX ( k + 1 ) − Z + U ( k ) ∥ 2 g ( Z ) + ρ F Z Update U ( k + 1 ) = U ( k ) + AX ( k + 1 ) − Z ( k + 1 ) k ← k + 1 until convergence ; Kovnatsky, Glashoff, B 2015 12/54

MADMM Algorithm 2 MADMM for non-smooth optimization on manifold M Initialize k ← 1 , Z ( 1 ) = AX ( 1 ) , U ( 1 ) = 0 . repeat 2 ∥ AX − Z ( k ) + U ( k ) ∥ 2 X -step: X ( k + 1 ) = argmin f ( X ) + ρ F X ∈M Z -step: Z ( k + 1 ) = argmin 2 ∥ AX ( k + 1 ) − Z + U ( k ) ∥ 2 g ( Z ) + ρ F Z Update U ( k + 1 ) = U ( k ) + AX ( k + 1 ) − Z ( k + 1 ) k ← k + 1 until convergence ; Solver/number of optimization iterations in X - and Z -steps Kovnatsky, Glashoff, B 2015 12/54

MADMM Algorithm 2 MADMM for non-smooth optimization on manifold M Initialize k ← 1 , Z ( 1 ) = AX ( 1 ) , U ( 1 ) = 0 . repeat 2 ∥ AX − Z ( k ) + U ( k ) ∥ 2 X -step: X ( k + 1 ) = argmin f ( X ) + ρ F X ∈M Z -step: Z ( k + 1 ) = argmin 2 ∥ AX ( k + 1 ) − Z + U ( k ) ∥ 2 g ( Z ) + ρ F Z Update U ( k + 1 ) = U ( k ) + AX ( k + 1 ) − Z ( k + 1 ) k ← k + 1 until convergence ; Solver/number of optimization iterations in X - and Z -steps X -step and Z -step in some problems have a closed form Kovnatsky, Glashoff, B 2015 12/54

MADMM Algorithm 2 MADMM for non-smooth optimization on manifold M Initialize k ← 1 , Z ( 1 ) = AX ( 1 ) , U ( 1 ) = 0 . repeat 2 ∥ AX − Z ( k ) + U ( k ) ∥ 2 X -step: X ( k + 1 ) = argmin f ( X ) + ρ F X ∈M Z -step: Z ( k + 1 ) = argmin 2 ∥ AX ( k + 1 ) − Z + U ( k ) ∥ 2 g ( Z ) + ρ F Z Update U ( k + 1 ) = U ( k ) + AX ( k + 1 ) − Z ( k + 1 ) k ← k + 1 until convergence ; Solver/number of optimization iterations in X - and Z -steps X -step and Z -step in some problems have a closed form Parameter ρ > 0 can be chosen fixed or adapted Kovnatsky, Glashoff, B 2015 12/54

Compressed modes 13/54

Laplacian eigenfunctions The first k eigenfunctions of some Laplacian are used in... Spectral clustering Dimensionality Spectral distances reduction Ng et al. 2001; Belkin, Nyogi 2001; Coifman, Lafon 2006 14/54

Laplacian eigenfunctions Find the first k eigenfunctions of an n × n Laplacian matrix ∆ Φ ∈ R n × k tr ( Φ ⊺ ∆Φ ) s . t . Φ ⊺ Φ = I min 15/54

Laplacian eigenfunctions Find the first k eigenfunctions of an n × n Laplacian matrix ∆ Φ ∈ R n × k tr ( Φ ⊺ ∆Φ ) s . t . Φ ⊺ Φ = I min tr ( Φ ⊺ ∆Φ ) = ∑ ij w ij ∥ φ i − φ j ∥ 2 F a.k.a. Dirichlet energy in physics 15/54

Laplacian eigenfunctions Find the first k eigenfunctions of an n × n Laplacian matrix ∆ Φ ∈ R n × k tr ( Φ ⊺ ∆Φ ) s . t . Φ ⊺ Φ = I min tr ( Φ ⊺ ∆Φ ) = ∑ ij w ij ∥ φ i − φ j ∥ 2 F a.k.a. Dirichlet energy in physics Many efficient solvers with global optimality guarantees 15/54

Laplacian eigenfunctions Find the first k eigenfunctions of an n × n Laplacian matrix ∆ Φ ∈ R n × k tr ( Φ ⊺ ∆Φ ) s . t . Φ ⊺ Φ = I min tr ( Φ ⊺ ∆Φ ) = ∑ ij w ij ∥ φ i − φ j ∥ 2 F a.k.a. Dirichlet energy in physics Many efficient solvers with global optimality guarantees 1D Euclidean Laplacian eigenfunctions = Fourier basis ∆ e − iωx = − ω 2 e − iωx 15/54

Laplacian eigenfunctions Find the first k eigenfunctions of an n × n Laplacian matrix ∆ Φ ∈ R n × k tr ( Φ ⊺ ∆Φ ) s . t . Φ ⊺ Φ = I min tr ( Φ ⊺ ∆Φ ) = ∑ ij w ij ∥ φ i − φ j ∥ 2 F a.k.a. Dirichlet energy in physics Many efficient solvers with global optimality guarantees 1D Euclidean Laplacian eigenfunctions = Fourier basis ∆ e − iωx = − ω 2 e − iωx Globally supported! 15/54

Laplacian eigenfunctions: 1D example 0.2 0.2 0 0 −0.2 −0.2 0 10 20 30 40 50 60 70 80 90 100 0 10 20 30 40 50 60 70 80 90 100 φ 1 φ 2 0.2 0.2 0 0 −0.2 −0.2 0 10 20 30 40 50 60 70 80 90 100 0 10 20 30 40 50 60 70 80 90 100 φ 3 φ 4 0.2 0.2 0 0 −0.2 −0.2 0 10 20 30 40 50 60 70 80 90 100 0 10 20 30 40 50 60 70 80 90 100 φ 5 φ 6 First eigenfunctions of a 1D Euclidean Laplacian 16/54

Laplacian eigenfunctions: non-Euclidean example max 0 min First Laplacian eigenfunctions of a Laplacian on a triangular mesh Neumann et al. 2014 17/54

Compressed modes Φ ∈ R n × k tr ( Φ ⊺ ∆Φ ) + µ ∥ Φ ∥ 1 s . t . Φ ⊺ Φ = I min Ozoli¸ nˇ s et al. 2013 18/54

Compressed modes Φ ∈ R n × k tr ( Φ ⊺ ∆Φ ) + µ ∥ Φ ∥ 1 s . t . Φ ⊺ Φ = I min Dirichlet energy = smoothness Ozoli¸ nˇ s et al. 2013 18/54

Compressed modes Φ ∈ R n × k tr ( Φ ⊺ ∆Φ ) + µ ∥ Φ ∥ 1 s . t . Φ ⊺ Φ = I min Dirichlet energy = smoothness L 1 -norm = sparsity Ozoli¸ nˇ s et al. 2013 18/54

Compressed modes Φ ∈ R n × k tr ( Φ ⊺ ∆Φ ) + µ ∥ Φ ∥ 1 s . t . Φ ⊺ Φ = I min Dirichlet energy = smoothness L 1 -norm = sparsity Smoothness + sparsity = localization Ozoli¸ nˇ s et al. 2013 18/54

Compressed modes: 1D example 6 5 4 2 0 0 −5 −2 0 10 20 30 40 50 60 70 80 90 100 0 10 20 30 40 50 60 70 80 90 100 φ 1 φ 2 5 5 0 0 −5 −5 0 10 20 30 40 50 60 70 80 90 100 0 10 20 30 40 50 60 70 80 90 100 φ 3 φ 4 5 5 0 0 −5 −5 0 10 20 30 40 50 60 70 80 90 100 0 10 20 30 40 50 60 70 80 90 100 φ 5 φ 6 First compressed modes of a 1D Euclidean Laplacian Ozoli¸ nˇ s et al. 2013 19/54

Compressed modes: non-Euclidean example max 0 min First compressed modes Neumann et al. 2014 20/54

Compressed modes: non-Euclidean example max 0 min First Laplacian eigenfunctions Neumann et al. 2014 20/54

Wannier functions Maximally-localized Wannier functions in Si and GaAs crystals Wannier 1937; Mostofi 2008 21/54

Splitting method for orthogonality constraints (SOC) Φ ∈ R n × k tr ( Φ ⊺ ∆Φ ) + µ ∥ Φ ∥ 1 Φ ⊺ Φ = I min s . t . Lai, Osher 2014 22/54

Splitting method for orthogonality constraints (SOC) Φ ,P,Q ∈ R n × k tr ( Φ ⊺ ∆Φ ) + µ ∥ Q ∥ 1 P = Φ , Q = Φ , P ⊺ P = I min s . t . Lai, Osher 2014 22/54

Splitting method for orthogonality constraints (SOC) Φ ,P,Q ∈ R n × k tr ( Φ ⊺ ∆Φ ) + µ ∥ Q ∥ 1 P = Φ , Q = Φ , P ⊺ P = I min s . t . Algorithm 3 SOC method for computing compressed modes Initialize k ← 1 , Φ ( 1 ) , P ( 1 ) = Q ( 1 ) = Φ ( 1 ) , U ( 1 ) = V ( 1 ) = 0 repeat Φ ( k + 1 ) = argmin F + ρ ′ tr ( Φ ⊺ ∆Φ )+ ρ 2 ∥ Φ − Q ( k ) + U ( k ) ∥ 2 2 ∥ Φ − P ( k ) + V ( k ) ∥ 2 F Φ Q ( k + 1 ) = argmin 2 ∥ Φ ( k + 1 ) − Q + U ( k ) ∥ 2 µ ∥ Q ∥ 1 + ρ F Q P ( k + 1 ) = argmin 2 ∥ Φ ( k + 1 ) − P + V ( k ) ∥ 2 ρ ′ P ⊺ P = I s . t . F P U ( k + 1 ) = U ( k ) + Φ ( k + 1 ) − Q ( k + 1 ) V ( k + 1 ) = V ( k ) + Φ ( k + 1 ) − P ( k + 1 ) k ← k + 1 until convergence ; Lai, Osher 2014 22/54

Compressed modes as manifold optimization Φ ∈ S ( n,k ) tr ( Φ ⊺ ∆Φ ) + µ ∥ Φ ∥ 1 min Stiefel manifold S ( n,k ) = { X ∈ R n × k ∶ X ⊺ X = I } Kovnatsky, Glashoff, B 2015 23/54

Compressed modes as manifold optimization Φ ∈ S ( n,k ) ,Z tr ( Φ ⊺ ∆Φ ) + µ ∥ Z ∥ 1 + ρ 2 ∥ Φ − Z + U ∥ 2 min F Stiefel manifold S ( n,k ) = { X ∈ R n × k ∶ X ⊺ X = I } Kovnatsky, Glashoff, B 2015 23/54

Compressed modes as manifold optimization Φ ∈ S ( n,k ) ,Z tr ( Φ ⊺ ∆Φ ) + µ ∥ Z ∥ 1 + ρ 2 ∥ Φ − Z + U ∥ 2 min F Stiefel manifold S ( n,k ) = { X ∈ R n × k ∶ X ⊺ X = I } Sub-problem w.r.t. Φ : smooth manifold-constrained minimization of a quadratic function Φ ∈ S ( n,k ) tr ( Φ ⊺ ∆Φ ) + ρ 2 ∥ Φ − Z + U ∥ 2 min F Kovnatsky, Glashoff, B 2015 23/54

Compressed modes as manifold optimization Φ ∈ S ( n,k ) ,Z tr ( Φ ⊺ ∆Φ ) + µ ∥ Z ∥ 1 + ρ 2 ∥ Φ − Z + U ∥ 2 min F Stiefel manifold S ( n,k ) = { X ∈ R n × k ∶ X ⊺ X = I } Sub-problem w.r.t. Φ : smooth manifold-constrained minimization of a quadratic function Φ ∈ S ( n,k ) tr ( Φ ⊺ ∆Φ ) + ρ 2 ∥ Φ − Z + U ∥ 2 min F Sub-problem w.r.t. Z : sparse coding (Lasso) problem ∥ Z ∥ 1 + ρ 2 ∥ Φ + U − Z ∥ 2 min F Z Kovnatsky, Glashoff, B 2015; Chen et al. 1995; Tibshirani 1996 23/54

Compressed modes by MADMM Algorithm 4 MADMM for computing compressed modes Input n × n Laplacian matrix ∆ , parameter µ > 0 Output k first compressed modes of ∆ Initialize k ← 1 , Φ ( 1 ) ← some orthonormal matrix, Z ( 1 ) = Φ ( 1 ) , U ( 1 ) = 0 repeat 2 ∥ Φ − Z ( k ) + U ( k ) ∥ 2 Φ ( k + 1 ) = argmin tr ( Φ ⊺ ∆Φ ) + ρ F Φ ∈ S ( n,k ) ρ ( Φ ( k + 1 ) + U ( k ) ) Z ( k + 1 ) = Shrink µ Update U ( k + 1 ) = U ( k ) + Φ ( k + 1 ) − Z ( k + 1 ) k ← k + 1 until convergence ; where Shrink α ( x ) = sign ( x ) max { 0 , ∣ x ∣ − α } is the shrinkage operator Kovnatsky, Glashoff, B 2015 24/54

Convergence Convergence of MADMM with different random initializations (compressed modes problem of size n = 500 ,k = 10 ) 10 3 10 2 Cost 10 1 10 − 1 10 0 10 1 10 2 Time (sec) Kovnatsky, Glashoff, B 2015 25/54

Convergence Convergence of MADMM with different X -step solvers (compressed modes problem of size n = 500 ,k = 10 ) Trust regions 10 3 2 Conjugate gradients 3 3 2 10 2 Cost 5 5 10 1 10 − 1 10 0 10 1 10 2 Time (sec) Kovnatsky, Glashoff, B 2015; Manopt: Boumal et al. 2014 26/54

Convergence Example of convergence of different methods (compressed modes problem of size n = 8 × 10 3 ,k = 10 ) 10 3 Lai & Osher Neumann et al. MADMM 10 2 Cost 10 1 10 0 0 1 , 000 2 , 000 3 , 000 4 , 000 5 , 000 Time (sec) Kovnatsky, Glashoff, B 2015; Lai, Osher 2014; Neumann et al. 2014 27/54

Scalability Complexity of different methods (compressed modes problem of size n,k = 10 ) 10 2 Lai & Osher Neumann MADMM 10 1 Time/iter (sec) 10 0 10 − 1 1 , 000 2 , 000 3 , 000 4 , 000 5 , 000 Problem size n Kovnatsky, Glashoff, B 2015; Lai, Osher 2014; Neumann et al. 2014 28/54

Functional correspondence 29/54

Applications of shape correspondence Texture mapping Pose transfer B 2 , Kimmel 2007; Sumner et al. 2004 30/54

Shape correspondence t q s S Q Point-wise map t ∶ S → Q 31/54

Shape correspondence t q s q ′ s ′ S Q Minimum-distortion point-wise map t ∶ S → Q B 2 , Kimmel 2006 31/54

Shape correspondence g f F(S) F(Q) linear T Functional map T ∶F( S ) → F ( Q ) Ovsjanikov et al. 2012 31/54

Functional correspondence f ↓ T ↓ g Ovsjanikov et al. 2012 32/54

Functional correspondence ≈ a 1 + a 2 + ⋯ + a k f φ 1 φ 2 φ k ↓ T ↓ g ≈ b 1 + b 2 + ⋯ + b k ψ 1 ψ 2 ψ k Ovsjanikov et al. 2012 32/54

Functional correspondence ≈ a 1 + a 2 + ⋯ + a k f φ 1 φ 2 φ k ↓ ↓ T C ↓ ↓ g ≈ b 1 + b 2 + ⋯ + b k ψ 1 ψ 2 ψ k Representation in truncated Laplacain eigenbasis, T ≈ Ψ C Φ ⊺ Ovsjanikov et al. 2012 32/54

Functional correspondence ≈ a 1 + a 2 + ⋯ + a k f φ 1 φ 2 φ k ↓ ↓ T C ↓ ↓ g ≈ b 1 + b 2 + ⋯ + b k ψ 1 ψ 2 ψ k Representation in truncated Laplacain eigenbasis, T ≈ Ψ C Φ ⊺ If T is area-preserving, C ⊺ C = I Ovsjanikov et al. 2012 32/54

Functional correspondence ≈ a 1 + a 2 + ⋯ + a k f φ 1 φ 2 φ k ↓ ↓ T C ↓ ↓ g ≈ b 1 + b 2 + ⋯ + b k ψ 1 ψ 2 ψ k Representation in truncated Laplacain eigenbasis, T ≈ Ψ C Φ ⊺ If T is area-preserving, C ⊺ C = I Φ ⊺ = Ψ X ( Φ Y ) ⊺ (rotation of bases) Represent C = XY ⊺ , then T ≈ ˆ Ψˆ Ovsjanikov et al. 2012 32/54

Functional correspondence ≈ a 1 + a 2 + ⋯ + a k f φ 1 φ 2 φ k ↓ ↓ T C ↓ ↓ g ≈ b 1 + b 2 + ⋯ + b k ψ 1 ψ 2 ψ k Representation in truncated Laplacain eigenbasis, T ≈ Ψ C Φ ⊺ If T is area-preserving, C ⊺ C = I Φ ⊺ = Ψ X ( Φ Y ) ⊺ (rotation of bases) Represent C = XY ⊺ , then T ≈ ˆ Ψˆ Given known corresponding functions F = ( f 1 , ⋯ ,f q ) and G = ( g 1 , ⋯ ,g q ) , find C by solving linear system C Φ ⊺ F = Ψ ⊺ G Ovsjanikov et al. 2012 32/54

Functional correspondence ≈ a 1 + a 2 + ⋯ + a k f φ 1 φ 2 φ k ↓ ↓ T C ↓ ↓ g ≈ b 1 + b 2 + ⋯ + b k ψ 1 ψ 2 ψ k Representation in truncated Laplacain eigenbasis, T ≈ Ψ C Φ ⊺ If T is area-preserving, C ⊺ C = I Φ ⊺ = Ψ X ( Φ Y ) ⊺ (rotation of bases) Represent C = XY ⊺ , then T ≈ ˆ Ψˆ Given known corresponding Fourier coefficients A = Φ ⊺ F and B = Ψ ⊺ G , find C by solving linear system CA = B Ovsjanikov et al. 2012 32/54

Functional correspondence in shape collection A ij C ij ≈ B ij S j S L ⋱ S i S 1 S 2 Kovnatsky, B 2 , Glashoff, Kimmel 2013; Kovnatsky, Glashoff, B 2015 33/54

Functional correspondence in shape collection A ij X i ≈ B ij X j X j S j X i X L S L ⋱ S i X 1 X 2 S 1 S 2 Kovnatsky, B 2 , Glashoff, Kimmel 2013; Kovnatsky, Glashoff, B 2015 33/54

Functional correspondence as manifold optimization L ∥ A ij X i − B ij X j ∥ 2 , 1 + µ tr ( X ⊺ i Λ i X i ) ( X 1 , ⋯ ,X L )∈ S L ( k,k ) ∑ ∑ min i ≠ j i = 1 where A ij ,B ij are Fourier coefficients of given corresponding functions on shapes S i , S j , and Λ i are the first k eigenvalues of Laplacian ∆ S i Eynard, Kovnatsky, B 2 , Glashoff 2012; Kovnatsky, Glashoff, B 2015 34/54

Functional correspondence as manifold optimization L ∥ A ij X i − B ij X j ∥ 2 , 1 + µ tr ( X ⊺ i Λ i X i ) ( X 1 , ⋯ ,X L )∈ S L ( k,k ) ∑ ∑ min i ≠ j i = 1 where A ij ,B ij are Fourier coefficients of given corresponding functions on shapes S i , S j , and Λ i are the first k eigenvalues of Laplacian ∆ S i Joint diagonalization of Laplacians: find new bases ˆ Φ i = Φ i X i that approximately diagonalize ∆ i and are coupled Optimization on product of Stiefel manifolds S L ( k,k ) Eynard, Kovnatsky, B 2 , Glashoff 2012; Kovnatsky, Glashoff, B 2015 34/54

Functional correspondence as manifold optimization L ∥ A ij X i − B ij X j ∥ 2 , 1 + µ tr ( X ⊺ i Λ i X i ) ( X 1 , ⋯ ,X L )∈ S L ( k,k ) ∑ ∑ min i ≠ j i = 1 where A ij ,B ij are Fourier coefficients of given corresponding functions on shapes S i , S j , and Λ i are the first k eigenvalues of Laplacian ∆ S i Joint diagonalization of Laplacians: find new bases ˆ Φ i = Φ i X i that approximately diagonalize ∆ i and are coupled Optimization on product of Stiefel manifolds S L ( k,k ) L 2 , 1 -norm allows to cope with outliers in correspondence data Eynard, Kovnatsky, B 2 , Glashoff 2012; Kovnatsky, Glashoff, B 2015 34/54

Functional correspondence as manifold optimization L ∥ A ij X i − B ij X j ∥ 2 , 1 + µ tr ( X ⊺ i Λ i X i ) ( X 1 , ⋯ ,X L )∈ S L ( k,k ) ∑ ∑ min i ≠ j i = 1 where A ij ,B ij are Fourier coefficients of given corresponding functions on shapes S i , S j , and Λ i are the first k eigenvalues of Laplacian ∆ S i Joint diagonalization of Laplacians: find new bases ˆ Φ i = Φ i X i that approximately diagonalize ∆ i and are coupled Optimization on product of Stiefel manifolds S L ( k,k ) L 2 , 1 -norm allows to cope with outliers in correspondence data X -step: manifold-constrained minimization of a quadratic function Eynard, Kovnatsky, B 2 , Glashoff 2012; Kovnatsky, Glashoff, B 2015 34/54

Functional correspondence as manifold optimization L ∥ A ij X i − B ij X j ∥ 2 , 1 + µ tr ( X ⊺ i Λ i X i ) ( X 1 , ⋯ ,X L )∈ S L ( k,k ) ∑ ∑ min i ≠ j i = 1 where A ij ,B ij are Fourier coefficients of given corresponding functions on shapes S i , S j , and Λ i are the first k eigenvalues of Laplacian ∆ S i Joint diagonalization of Laplacians: find new bases ˆ Φ i = Φ i X i that approximately diagonalize ∆ i and are coupled Optimization on product of Stiefel manifolds S L ( k,k ) L 2 , 1 -norm allows to cope with outliers in correspondence data X -step: manifold-constrained minimization of a quadratic function Z -step: one iteration of shrinkage Eynard, Kovnatsky, B 2 , Glashoff 2012; Kovnatsky, Glashoff, B 2015 34/54

Correspondence data Example of correspondence data (10% of outliers shown in red) Kovnatsky, Glashoff, B 2015 35/54

Correspondence quality Least squares Robust (MADMM) Kovnatsky, Glashoff, B 2015 36/54

Correspondence quality Correspondence quality evaluated using Princeton protocol 1 0 . 8 % of correspondence 0 . 6 0 . 4 LS 0 . 2 MADMM 0 0 5 ⋅ 10 − 2 0 . 1 0 . 15 0 . 2 0 . 25 % geodesic diameter Kovnatsky, Glashoff, B 2015; data: B 2 , Kimmel 2008, benchmark: Kim et al. 2011 37/54

Convergence Convergence of different methods Smoothing 10 0 . 4 MADMM Cost 10 -8 10 0 . 2 10 -6 10 -4 0 2 4 6 8 10 Time (sec) Kovnatsky, Glashoff, B 2015 38/54

Multimodal spectral clustering Uncoupled No outliers 100% Kovnatsky, Glashoff, B 2015; Eynard, Kovnatsky, B 2 , Glashoff 2012 39/54

Multimodal spectral clustering Uncoupled Coupled ( L 2 ) No outliers No outliers 100% 53% Kovnatsky, Glashoff, B 2015; Eynard, Kovnatsky, B 2 , Glashoff 2012 39/54

Multimodal spectral clustering Coupled ( L 2 ) Uncoupled Coupled ( L 2 ) 10% outliers No outliers No outliers 100% 53% 72% Kovnatsky, Glashoff, B 2015; Eynard, Kovnatsky, B 2 , Glashoff 2012 39/54

Multimodal spectral clustering Coupled ( L 2 ) Coupled ( L 2 , 1 ) Uncoupled Coupled ( L 2 ) 10% outliers 10% outliers No outliers No outliers 100% 53% 72% 82% Kovnatsky, Glashoff, B 2015; Eynard, Kovnatsky, B 2 , Glashoff 2012 39/54

Multidimensional scaling 40/54

Multidimensional scaling 3 2 9 10 7 3 14 1 2 1 2 14 2 7 9 D = X = 9 1 2 2 2 10 2 7 2 13 7 1 9 2 13 MDS problem: given an n × n (squared) distance matrix D , find a k -dimensional configuration of points X ∈ R n × k such that ∥ x i − x j ∥ 2 2 ≈ d ij Cayton, Dasgupta 2006 41/54

Similarity vs Distance Equivalence between distances and similarities dist ( B ) = ( b ii + b jj − 2 b ij ) PSD EDM B = − 1 2 HDH (Squared) distances Similarities n 11 ⊺ is the double-centering matrix where H = I − 1 Sch¨ onberg 1938; Dattoro 2005; Cayton, Dasgupta 2006 42/54

Similarity vs Distance Equivalence between distances and similarities PSD EDM B ∗ = U Λ + U ⊺ B = − 1 2 HDH (Squared) distances Similarities n 11 ⊺ is the double-centering matrix where H = I − 1 Sch¨ onberg 1938; Dattoro 2005; Cayton, Dasgupta 2006 42/54

Classical MDS Algorithm 5 Classical MDS Input squared distance matrix D Compute similarity by double centering: B = − 1 2 HDH Perform eigendecomposition B = U Λ U ⊺ and take the largest k positive eigenvalues Λ k and corresponding eigenvectors U k Output X = U k Λ 1 / 2 k Young, Householder 1938 43/54

Classical MDS Algorithm 5 Classical MDS Input squared distance matrix D Compute similarity by double centering: B = − 1 2 HDH Perform eigendecomposition B = U Λ U ⊺ and take the largest k positive eigenvalues Λ k and corresponding eigenvectors U k Output X = U k Λ 1 / 2 k Classical MDS as optimization problem: minimize the strain X ∈ R n × k ∥ HDH − XX ⊺ ∥ 2 min F Young, Householder 1938 43/54

Sensitivity to outliers Error dispersion by double-centering ǫ ǫ / n ǫ B = − 1 2 HDH ǫ / n 2 (Squared) distance matrix Similarity matrix Cayton, Dasgupta 2006 44/54

MADMM: a generic algorithm for non-smooth optimization on manifolds - PowerPoint PPT Presentation

MADMM: a generic algorithm for non-smooth optimization on manifolds Michael Bronstein Faculty of Informatics Perceptual Computing Group University of Lugano Intel Corporation Switzerland Israel Louvain-la-Neuve, 25 September 2015 1/54

Planning and Optimization C14. Merge-and-Shrink Abstractions: Generic Algorithm Malte Helmert and

Strengthening Smooth Transition Strengthening Smooth Transition Strengthening Smooth Transition

What are Generics? e.g. Generics, Generic Programming, Generic Types, Generic Methods 6

Generic Programming in a Dependently Typed Language Generic proofs for generic programs Peter

Generic Methods 36 What are Generic Methods? Generic methods = methods that introduce type

1 Definition of a simple generic class Why generic programming (cont.) class Pair <T> {

Non-Smooth Convex Optimization in Data Sciences Jalal Fadili Normandie Universit-ENSICAEN,

Convergence of a Stochastic Gradient Method with Momentum for Non-Smooth Non-Convex Optimization

Generic classes Declaration Use Annotations 54 Generic classes Declaration add

Planning and Optimization D7. M&S: Generic Algorithm and Heuristic Properties Gabriele R

Extremal generalized smooth words Kolakoski word Run-length encoding Smooth words Generalized

Stochastic Optimization for DC Functions and Non-smooth Non-convex Regularizers with

Odds Algorithm An Online Algorithm Group Fibonado 20. Dec 2016 Group Fibonado Odds Algorithm

Optimization Process Done by an Optimization Algorithm Jose Rueda Torres Learning Objectives

15-780: Optimization J. Zico Kolter March 14-16, 2015 1 Outline Introduction to optimization

Inertial Block Proximal Methods for Non-Convex Non-Smooth Optimization L. T. K. Hien 1 N. Gillis 1

LIC-Based Regularization of Multi-Valued Images David Tschumperl CNRS UMR 6072 (GREYC/ENSICAEN)

Substructure from Simulations Can the standard reionization scenario explain the current

The realities of their world: Engaging Pre-service School Librarians with International

Hardness of exact distance queries in sparse graphs through hub labeling Adrian Kosowski,

Editing triangulated surfaces Nayla Lpez , Orlenys Lpez, Havana Univ. Victoria

Phenomenological Gestalten and Center of Mathematics figural completion: CNRS-EHESS, Paris

Overview Topics Sequential and binary search Recursion CSE 143 Java Searching and

Natural Logic: Visions, Results, Plans Larry Moss A presentation to Thomas Icards NASSLLI

Sambuz

Useful Links

Newsletter

Mail Us

MADMM: a generic algorithm for non-smooth optimization on manifolds - PowerPoint PPT Presentation

MADMM: a generic algorithm for non-smooth optimization on manifolds Michael Bronstein Faculty of Informatics Perceptual Computing Group University of Lugano Intel Corporation Switzerland Israel Louvain-la-Neuve, 25 September 2015 1/54

Planning and Optimization C14. Merge-and-Shrink Abstractions: Generic Algorithm Malte Helmert and

Strengthening Smooth Transition Strengthening Smooth Transition Strengthening Smooth Transition

What are Generics? e.g. Generics, Generic Programming, Generic Types, Generic Methods 6

Generic Programming in a Dependently Typed Language Generic proofs for generic programs Peter

Generic Methods 36 What are Generic Methods? Generic methods = methods that introduce type

1 Definition of a simple generic class Why generic programming (cont.) class Pair &lt;T&gt; {

Non-Smooth Convex Optimization in Data Sciences Jalal Fadili Normandie Universit-ENSICAEN,

Convergence of a Stochastic Gradient Method with Momentum for Non-Smooth Non-Convex Optimization

Generic classes Declaration Use Annotations 54 Generic classes Declaration add

Planning and Optimization D7. M&amp;S: Generic Algorithm and Heuristic Properties Gabriele R

Extremal generalized smooth words Kolakoski word Run-length encoding Smooth words Generalized

Stochastic Optimization for DC Functions and Non-smooth Non-convex Regularizers with

Odds Algorithm An Online Algorithm Group Fibonado 20. Dec 2016 Group Fibonado Odds Algorithm

Optimization Process Done by an Optimization Algorithm Jose Rueda Torres Learning Objectives

15-780: Optimization J. Zico Kolter March 14-16, 2015 1 Outline Introduction to optimization

Inertial Block Proximal Methods for Non-Convex Non-Smooth Optimization L. T. K. Hien 1 N. Gillis 1

LIC-Based Regularization of Multi-Valued Images David Tschumperl CNRS UMR 6072 (GREYC/ENSICAEN)

Substructure from Simulations Can the standard reionization scenario explain the current

The realities of their world: Engaging Pre-service School Librarians with International

Hardness of exact distance queries in sparse graphs through hub labeling Adrian Kosowski,

Editing triangulated surfaces Nayla Lpez , Orlenys Lpez, Havana Univ. Victoria

Phenomenological Gestalten and Center of Mathematics figural completion: CNRS-EHESS, Paris

Overview Topics Sequential and binary search Recursion CSE 143 Java Searching and

Natural Logic: Visions, Results, Plans Larry Moss A presentation to Thomas Icards NASSLLI

Sambuz

Useful Links

Newsletter

Mail Us

1 Definition of a simple generic class Why generic programming (cont.) class Pair <T> {

Planning and Optimization D7. M&S: Generic Algorithm and Heuristic Properties Gabriele R