Efficient First-Order Algorithms for Adaptive Signal Denoising - PowerPoint PPT Presentation

Efficient First-Order Algorithms for Adaptive Signal Denoising Dmitrii Ostrovskii * Zaid Harchaoui † ∗ INRIA Paris, Ecole Normale Sup´ erieure † University of Washington ICML 2018 Stockholm

Signal denoising problem Recover discrete-time signal x = ( x τ ) ∈ C 2 n +1 from noisy observations y τ = x τ + σξ τ , τ = − n , ..., n , where ξ τ are i.i.d. standard Gaussian random variables. 2 5 1.5 1 0.5 0 0 -0.5 -1 -1.5 -2 -5 0 20 40 60 80 100 0 20 40 60 80 100 Difficulty: unknown structure D. Ostrovskii, Z. Harchaoui 1 / 8 Efficient First-Order Algorithms for Adaptive Signal Denoising

Adaptive denoising: background* Linear time-invariant estimator : convolution of y with filter ϕ ∈ C n +1 : � � x t = [ ϕ ∗ y ] t := ϕ τ y t − τ , 0 ≤ t ≤ n , 0 ≤ τ ≤ n • Suppose x satisfies discrete ODE (sines, polynomials, exponentials): P (∆) x ≈ 0 , where [∆ x ] t := x t − 1 , and operator P (∆) = � d k =1 p k ∆ k is unknown . • Then there exists ϕ o with near-optimal risk and small ℓ 1 -norm of Discrete Fourier transform F n [ ϕ o ]: r �F n [ ϕ o ] � 1 ≤ √ n + 1 , r = poly(deg( P )) . ϕ ( y ) with similar properties to ϕ o . Goal: construct adaptive filter � ϕ = � *[Juditsky and Nemirovski, 2009, 2010; Harchaoui et al., 2015; Ostrovsky et al., 2016] D. Ostrovskii, Z. Harchaoui 2 / 8 Efficient First-Order Algorithms for Adaptive Signal Denoising

Estimators � � � F n [ y − ϕ ∗ y ] 2 n � minimize Res p ( ϕ ) := n p � � r subject to ϕ ∈ Φ( r ) := �F n [ ϕ ] � 1 ≤ √ n + 1 . Least Squares [Ostrovsky et al., Uniform Fit [Harchaoui et al., 2016]: 2015]: p = 2 ( ⇒ ℓ 2 -loss guarantees) p = ∞ ( ⇒ ℓ ∞ -loss guarantees) simple constraint: proximal mapping computed in O ( n ); first-order oracle: computed in O ( n log n ) by reducing to FFT; low accuracy: are crude approximate solutions sufficient? First-order methods D. Ostrovskii, Z. Harchaoui 3 / 8 Efficient First-Order Algorithms for Adaptive Signal Denoising

Strategies b = F n [[ y ] 2 n A u := F n [[ y ∗ ϕ ] 2 n Fourier-domain: u := F n [ ϕ ] , n ] , n ] . Least Squares: quadratic problem on ℓ 1 -ball: �A u − b � 2 min 2 . r � u � 1 ≤ √ n +1 • Fast Gradient Method: O (1 / T 2 ) convergence after T iterations.* Uniform Fit : reduced to a bilinear saddle-point problem: min �A u − b � ∞ = min � v � 1 ≤ 1 � v , A u � − � v , b � . max r r � u � 1 ≤ � u � 1 ≤ √ n +1 √ n +1 • Mirror Prox: O (1 / T ) convergence after T iterations.* ℓ 1 -adapted geometry, dual certificates, adaptive step, proximal terms. *[Nesterov and Nemirovski, 2013; Juditsky and Nemirovski, 2011] D. Ostrovskii, Z. Harchaoui 4 / 8 Efficient First-Order Algorithms for Adaptive Signal Denoising

Statistical accuracy: theoretical result Let � x � n , p be the “estimation norm” with the right scaling: � � 1 / p 2 n � 1 | x t | p � x � n , p = . n + 1 t = n • Exact solutions [Harchaoui et al., 2015; Ostrovsky et al., 2016]: � � � log( n /δ ) � x − � ϕ LS ∗ y � n , 2 ≥ C σ r ≤ δ, P n + 1 � � � log( n /δ ) ϕ UF ∗ y � n , ∞ ≥ C σ r 2 P � x − � ≤ δ. n + 1 • We extend these results to approximate solutions : Theorem A Approximate solutions ˜ ϕ with accuracy ε ∗ = σ r for Uniform Fit and ε ∗ = σ 2 r 2 for Least Squares admit the same bounds as the exact ones. D. Ostrovskii, Z. Harchaoui 5 / 8 Efficient First-Order Algorithms for Adaptive Signal Denoising

Experiment: early stopping Comparison of ℓ 2 -loss and computation time in two scenarios: sum of sines with 4 random frequencies and 2 pairs of close frequencies (right) ∗ . 10 0 10 0 0.5 0.5 ` 2 -error ` 2 -error 0.25 0.25 Lasso Lasso 10 -1 10 -1 Coarse Coarse 0.05 0.05 Fine Fine 0.025 0.025 0.06 0.12 0.25 0.5 1 2 4 0.06 0.12 0.25 0.5 1 2 4 SNR ! 1 SNR ! 1 10 1 10 1 CPU time (s) Lasso CPU time (s) Lasso 10 0 10 0 Coarse Coarse Fine Fine 10 -1 10 -1 10 -2 10 -2 10 -3 10 -3 0.06 0.12 0.25 0.5 1 2 4 0.06 0.12 0.25 0.5 1 2 4 SNR ! 1 SNR ! 1 • Coarse: crude Least Squares solution with accuracy ε ∗ = σ 2 r 2 ; • Fine: near-optimal Least Squares solution with accuracy 0 . 01 ε ∗ ; • Lasso: 10-fold oversampled Lasso estimator [Bhaskar et al., 2013]. Code available at https://github.com/ostrodmit/AlgoRec D. Ostrovskii, Z. Harchaoui 6 / 8 Efficient First-Order Algorithms for Adaptive Signal Denoising

Algorithmic complexity Theorem B To reach the statistical accuracy ε ∗ , in each case it is sufficient to perform T ∗ = O (PSNR + 1) steps of the corresponding algorithm. 10 2 10 2 T $ 10 1 T $ 10 1 CMP- ` 2 FGM- ` 2 10 0 10 0 10 -2 10 0 10 2 10 -2 10 0 10 2 SNR SNR Iteration at which accuracy ε ∗ is attained experimentally on the sum of sines with 4 random frequencies: Uniform Fit (left), Least Squares (right). D. Ostrovskii, Z. Harchaoui 7 / 8 Efficient First-Order Algorithms for Adaptive Signal Denoising

Thank you and see you at poster B#51 Where I will also show how to solve some non-smooth problems in O (1 / T 2 ) . D. Ostrovskii, Z. Harchaoui 8 / 8 Efficient First-Order Algorithms for Adaptive Signal Denoising

References Bhaskar, B., Tang, G., and Recht, B. (2013). Atomic norm denoising with applications to line spectral estimation. IEEE Trans. Signal Processing , 61(23):5987–5999. Harchaoui, Z., Juditsky, A., Nemirovski, A., and Ostrovsky, D. (2015). Adaptive recovery of signals by convex optimization. In Proceedings of The 28th Conference on Learning Theory (COLT) 2015, Paris, France, July 3-6, 2015 , pages 929–955. Juditsky, A. and Nemirovski, A. (2009). Nonparametric denoising of signals with unknown local structure, I: Oracle inequalities. Appl. & Comput. Harmon. Anal. , 27(2):157–179. Juditsky, A. and Nemirovski, A. (2010). Nonparametric denoising signals of unknown local structure, II: Nonparametric function recovery. Appl. & Comput. Harmon. Anal. , 29(3):354–367. Juditsky, A. and Nemirovski, A. (2011). First-order methods for nonsmooth convex large-scale optimization, II: Utilizing problem structure. Optimization for Machine Learning , pages 149–183. Nesterov, Y. and Nemirovski, A. (2013). On first-order algorithms for ℓ 1 /nuclear norm minimization. Acta Numerica , 22:509–575. Ostrovsky, D., Harchaoui, Z., Juditsky, A., and Nemirovski, A. (2016). Structure-blind signal recovery. In Advances in Neural Information Processing Systems , pages 4817–4825.

Convergence: numerical experiment Constrained uniform-fit Constrained least-squares (Mirror Prox) (Fast Gradient Method) 10 1 10 3 Absolute accuracy 10 2 10 0 10 1 10 0 10 -1 10 -1 10 -2 CMP- ` 2 FGM- ` 2 10 -3 CMP- ` 2 -Gap FGM- ` 2 -Gap 10 -2 10 -4 1 10 1 10 2 1 10 1 10 2 Convergence of the residual (95% upper confidence bound) for a sum of s = 4 sinusoids with random frequencies and amplitudes, SNR = 4. Dashed: online accuracy bounds via the dual certificate.

Efficient First-Order Algorithms for Adaptive Signal Denoising - PowerPoint PPT Presentation

Efficient First-Order Algorithms for Adaptive Signal Denoising Dmitrii Ostrovskii * Zaid Harchaoui INRIA Paris, Ecole Normale Sup erieure University of Washington ICML 2018 Stockholm Signal denoising problem Recover discrete-time

Efficient signal processing using Haskell and LLVM Henning Thielemann 2016-09-15 Efficient

Neural Nets for Adaptive Filter and Adaptive Neural Nets as Adaptive Filters Pattern Recognition

Adaptive Control Chapter 1: Introduction to Adaptive Control Adaptive Control Landau, Lozano,

Adaptive Control Chapter 11: Direct Adaptive Control 1 Adaptive Control Landau, Lozano,

DFW Experience With Rhythm Adaptive Signal Control Rhythm Adaptive Signal Control April 13 2012

Adaptive Signal Processing Stephen Casey American University scasey@american.edu February 21th,

Tx Signal: 1000 Hz sine wave; Attenuation; Random noise with 0.5ms spike Tx Signal Noise Rx

Adaptive Control Chapter 12: Indirect Adaptive Control 1 Adaptive Control Landau, Lozano,

Efficient audio signal processing using LLVM and Haskell Henning Thielemann 2013-04-30

ADAPTIVE SIGNAL CONTROL - When is it the right choice? JOINT TSITE/ITS TN FALL MEETING

Adaptive Control Chapter 13: Multimodel adaptive control with switching Chapter 13: Multimodel

Adaptive Control Chapter 14: Adaptive regulation Rejection of unknown disturbances 1

Signal Adaptive Frame Theory Stephen Casey American University & Norbert Wiener Center,

Reorder Buffer Method Issue Execute Write Classic 5-stage pipeline In-order In-order

Speech Processing 15-492/18-492 Speech Synthesis Signal Processing Signal Manipulation Signal

Waveform Generation Fundamental part of signal processing is the signal. Within the

Investor Community Conference Call Q2 2008 2008 2008 2008 Risk Review Tom Flynn Executive

DM550 / DM857 Introduction to Programming Peter Schneider-Kamp petersk@imada.sdu.dk

Valley Clean Energy Board Meeting November 12, 2020 Via Teleconference Item 15 2020 Local

SQL: The Query Language, Part I COMS4037 Databases Dmitry Shkatov Slides courtesy of Prof. Joe

Simple, Credible, and Approximately- Optimal auctions Constantinos Daskalakis*, Brendan Lucier#,

Important Notice The past performance of Keppel REIT is not necessarily indicative of its future

Automated Fixing of Programs with Contracts Yi Wei, Yu Pei, Carlo A. Furia, Lucas S. Silva,

Fixing Boolean networks asynchronously Juilio Aracena and Lilian Salinas Universidad de

Efficient First-Order Algorithms for Adaptive Signal Denoising - PowerPoint PPT Presentation

Efficient First-Order Algorithms for Adaptive Signal Denoising Dmitrii Ostrovskii * Zaid Harchaoui INRIA Paris, Ecole Normale Sup erieure University of Washington ICML 2018 Stockholm Signal denoising problem Recover discrete-time

Efficient signal processing using Haskell and LLVM Henning Thielemann 2016-09-15 Efficient

Neural Nets for Adaptive Filter and Adaptive Neural Nets as Adaptive Filters Pattern Recognition

Adaptive Control Chapter 1: Introduction to Adaptive Control Adaptive Control Landau, Lozano,

Adaptive Control Chapter 11: Direct Adaptive Control 1 Adaptive Control Landau, Lozano,

DFW Experience With Rhythm Adaptive Signal Control Rhythm Adaptive Signal Control April 13 2012

Adaptive Signal Processing Stephen Casey American University scasey@american.edu February 21th,

Tx Signal: 1000 Hz sine wave; Attenuation; Random noise with 0.5ms spike Tx Signal Noise Rx

Adaptive Control Chapter 12: Indirect Adaptive Control 1 Adaptive Control Landau, Lozano,

Efficient audio signal processing using LLVM and Haskell Henning Thielemann 2013-04-30

ADAPTIVE SIGNAL CONTROL - When is it the right choice? JOINT TSITE/ITS TN FALL MEETING

Adaptive Control Chapter 13: Multimodel adaptive control with switching Chapter 13: Multimodel

Adaptive Control Chapter 14: Adaptive regulation Rejection of unknown disturbances 1

Signal Adaptive Frame Theory Stephen Casey American University &amp; Norbert Wiener Center,

Reorder Buffer Method Issue Execute Write Classic 5-stage pipeline In-order In-order

Speech Processing 15-492/18-492 Speech Synthesis Signal Processing Signal Manipulation Signal

Waveform Generation Fundamental part of signal processing is the signal. Within the

Investor Community Conference Call Q2 2008 2008 2008 2008 Risk Review Tom Flynn Executive

DM550 / DM857 Introduction to Programming Peter Schneider-Kamp petersk@imada.sdu.dk

Valley Clean Energy Board Meeting November 12, 2020 Via Teleconference Item 15 2020 Local

SQL: The Query Language, Part I COMS4037 Databases Dmitry Shkatov Slides courtesy of Prof. Joe

Simple, Credible, and Approximately- Optimal auctions Constantinos Daskalakis*, Brendan Lucier#,

Important Notice The past performance of Keppel REIT is not necessarily indicative of its future

Automated Fixing of Programs with Contracts Yi Wei, Yu Pei, Carlo A. Furia, Lucas S. Silva,

Fixing Boolean networks asynchronously Juilio Aracena and Lilian Salinas Universidad de

Signal Adaptive Frame Theory Stephen Casey American University & Norbert Wiener Center,