SLIDE 49 Experiments: Influence of κ
5 10 15 20 25 30 −10 −9 −8 −7 −6 −5 −4 −3 −2
Number of gradient evaluations Relative function value covtype, logistic, µ=1/100 n
QuickeNing-SVRG κ = 0.001 κ0 QuickeNing-SVRG κ = 0.01 κ0 QuickeNing-SVRG κ = 0.1 κ0 QuickeNing-SVRG κ = κ0 QuickeNing-SVRG κ = 10 κ0 QuickeNing-SVRG κ = 100 κ0 QuickeNing-SVRG κ = 1000 κ0
5 10 15 20 25 30 35 40 −10 −9 −8 −7 −6 −5 −4 −3 −2
Number of gradient evaluations Relative function value covtype, lasso, λ= 10 / n
QuickeNing-SVRG κ = 0.001 κ0 QuickeNing-SVRG κ = 0.01 κ0 QuickeNing-SVRG κ = 0.1 κ0 QuickeNing-SVRG κ = κ0 QuickeNing-SVRG κ = 10 κ0 QuickeNing-SVRG κ = 100 κ0 QuickeNing-SVRG κ = 1000 κ0
5 10 15 20 25 30 −7 −6 −5 −4 −3 −2 −1
Number of gradient evaluations Relative function value rcv1, logistic, µ=1/100 n
QuickeNing-SVRG κ = 0.001 κ0 QuickeNing-SVRG κ = 0.01 κ0 QuickeNing-SVRG κ = 0.1 κ0 QuickeNing-SVRG κ = κ0 QuickeNing-SVRG κ = 10 κ0 QuickeNing-SVRG κ = 100 κ0 QuickeNing-SVRG κ = 1000 κ0
5 10 15 20 25 30 35 40 −10 −8 −6 −4 −2
Number of gradient evaluations Relative function value rcv1, lasso, λ= 10 / n
QuickeNing-SVRG κ = 0.001 κ0 QuickeNing-SVRG κ = 0.01 κ0 QuickeNing-SVRG κ = 0.1 κ0 QuickeNing-SVRG κ = κ0 QuickeNing-SVRG κ = 10 κ0 QuickeNing-SVRG κ = 100 κ0 QuickeNing-SVRG κ = 1000 κ0
κ0 is the parameter (same as in Catalyst) used in all experiments; QuickeNing slows down when using κ > κ0; here, for SVRG, QuickeNing is robust to small values of κ!
Julien Mairal QuickeNing 28/30