nonconvex variance reduced optimization with arbitrary
play

Nonconvex Variance Reduced Optimization with Arbitrary Sampling - PowerPoint PPT Presentation

Nonconvex Variance Reduced Optimization with Arbitrary Sampling Samuel Horvth Peter Richtrik Empirical Risk Minimization n x R d f ( x ) := 1 X min f i ( x ) n i =1 Empirical Risk Minimization n x R d f ( x ) := 1 X min f i (


  1. Nonconvex Variance Reduced Optimization with Arbitrary Sampling Samuel Horváth Peter Richtárik

  2. Empirical Risk Minimization n x ∈ R d f ( x ) := 1 X min f i ( x ) n i =1

  3. Empirical Risk Minimization n x ∈ R d f ( x ) := 1 X min f i ( x ) n i =1 n is big

  4. Empirical Risk Minimization n x ∈ R d f ( x ) := 1 X min f i ( x ) n i =1 n is big non-convex, L i -smooth kr f i ( x ) � r f i ( y ) k  L i k x � y k

  5. Baseline Variance Reduced SGD Methods SVRG Johnson & Zhang NIPS 2013 SAGA Defazio, Bach & Lacoste-Julien NIPS 2014 SARAH Nguyen, Liu, Scheinberg & Takáč ICML 2017

  6. Baseline Variance Reduced SGD Methods SVRG Johnson & Zhang NIPS 2013 Uniform sampling SAGA Defazio, Bach & Lacoste-Julien NIPS 2014 Uniform sampling SARAH Nguyen, Liu, Scheinberg & Takáč ICML 2017 Uniform sampling

  7. Baseline Variance Reduced SGD Methods–Mini-batch SVRG Konečný & Richtárik FAMS 2017 SAGA Reddi, Hefny, Sra, Poczos, Smola CDC 2016 SARAH Nguyen, Liu, Scheinberg & Takáč 2017

  8. Baseline Variance Reduced SGD Methods–Mini-batch Mini-batch size SVRG Konečný & Richtárik FAMS 2017 Uniform sampling SAGA Reddi, Hefny, Sra, Poczos, Smola CDC 2016 Uniform sampling SARAH Nguyen, Liu, Scheinberg & Takáč 2017 Uniform sampling

  9. Contributions Analysis of SVRG, SAGA and SARAH in the arbitrary sampling paradigm • Construction of optimal minibatch sampling •

  10. Richtárik & Takáč (OL 2016; arXiv 2013) Contributions Qu, Richtárik & Zhang (NIPS 2015) Qu & Richtárik (COAP 2016) Chambolle, Ehrhardt, Richtárik & Schoenlieb (SIOPT 2018) Hanzely & Richtárik (AISTATS 2019) Qian, Qu & Richtárik (ICML 2019) Gower, Loizou, Qian, Sailanbayev, Shulgin & Richtárik (ICML 2019) Analysis of SVRG, SAGA and SARAH in the arbitrary sampling paradigm • Construction of optimal minibatch sampling •

  11. Richtárik & Takáč (OL 2016; arXiv 2013) Contributions Qu, Richtárik & Zhang (NIPS 2015) Qu & Richtárik (COAP 2016) Chambolle, Ehrhardt, Richtárik & Schoenlieb (SIOPT 2018) Hanzely & Richtárik (AISTATS 2019) Qian, Qu & Richtárik (ICML 2019) Gower, Loizou, Qian, Sailanbayev, Shulgin & Richtárik (ICML 2019) Analysis of SVRG, SAGA and SARAH in the arbitrary sampling paradigm • Construction of optimal minibatch sampling • First optimal/importance sampling for minibatches!

Recommend


More recommend