A Closer Look at Adaptive Regret Dmitry Adamskiy Joint work with - PowerPoint PPT Presentation

A Closer Look at Adaptive Regret Dmitry Adamskiy Joint work with Wouter Koolen, Volodya Vovk and Alexey Chernov Department of Computer Science Royal Holloway, University of London 15/11/2014 Adamskiy (RHUL) A Closer Look at Adaptive Regret GTP-2014 Guanajuato Mexico 1 / 21

Outline Why adaptive regret? 1 Setup 2 Results 3 Adamskiy (RHUL) A Closer Look at Adaptive Regret GTP-2014 Guanajuato Mexico 2 / 21

Why adaptive regret? 1 Setup 2 Results 3 Adamskiy (RHUL) A Closer Look at Adaptive Regret GTP-2014 Guanajuato Mexico 3 / 21

Weather forecasting: adaptivity Predictor Expert Expert Expert Nature Adamskiy (RHUL) A Closer Look at Adaptive Regret GTP-2014 Guanajuato Mexico 4 / 21

Weather forecasting: adaptivity Predictor Expert 30% Expert 90% Expert 20% Nature Adamskiy (RHUL) A Closer Look at Adaptive Regret GTP-2014 Guanajuato Mexico 4 / 21

Weather forecasting: adaptivity Predictor 55% Expert 30% Expert 90% Expert 20% Nature Adamskiy (RHUL) A Closer Look at Adaptive Regret GTP-2014 Guanajuato Mexico 4 / 21

Weather forecasting: adaptivity Predictor 55% Expert 30% 40% Expert 90% 70% Expert 20% 65% Nature Adamskiy (RHUL) A Closer Look at Adaptive Regret GTP-2014 Guanajuato Mexico 4 / 21

Weather forecasting: adaptivity Predictor 55% 65% Expert 30% 40% Expert 90% 70% Expert 20% 65% Nature Adamskiy (RHUL) A Closer Look at Adaptive Regret GTP-2014 Guanajuato Mexico 4 / 21

Weather forecasting: adaptivity Predictor 55% 65% Expert 30% 40% . . . Expert 90% 70% . . . Expert 20% 65% . . . Nature Adamskiy (RHUL) A Closer Look at Adaptive Regret GTP-2014 Guanajuato Mexico 4 / 21

Weather forecasting: adaptivity Predictor 55% 65% . . . Expert 30% 40% . . . Expert 90% 70% . . . Expert 20% 65% . . . Nature Adamskiy (RHUL) A Closer Look at Adaptive Regret GTP-2014 Guanajuato Mexico 4 / 21

Weather forecasting: adaptivity Predictor 55% 65% . . . Expert 30% 40% . . . 90% 70% . . . Expert Expert 20% 65% . . . Nature . . . Goal: close to the best expert overall (solution: AA) Adamskiy (RHUL) A Closer Look at Adaptive Regret GTP-2014 Guanajuato Mexico 4 / 21

Weather forecasting: adaptivity Predictor 55% 65% . . . Expert 30% 40% . . . is bad on foggy days! 90% 70% . . . Expert Expert 20% 65% . . . Nature . . . Goal: close to the best expert overall (solution: AA) Adamskiy (RHUL) A Closer Look at Adaptive Regret GTP-2014 Guanajuato Mexico 4 / 21

Weather forecasting: adaptivity Predictor 55% 65% . . . Expert 30% 40% . . . is bad on foggy days! 90% 70% . . . Expert Expert 20% 65% . . . drunk on weekends! Nature . . . Goal: close to the best expert overall (solution: AA) Adamskiy (RHUL) A Closer Look at Adaptive Regret GTP-2014 Guanajuato Mexico 4 / 21

Weather forecasting: adaptivity Predictor 55% 65% . . . Expert 30% 40% . . . is bad on foggy days! 90% 70% . . . goes on training! Expert Expert 20% 65% . . . drunk on weekends! Nature . . . Goal: close to the best expert overall (solution: AA) Adamskiy (RHUL) A Closer Look at Adaptive Regret GTP-2014 Guanajuato Mexico 4 / 21

Weather forecasting: adaptivity Predictor 55% 65% . . . Expert 30% 40% . . . is bad on foggy days! Expert 90% 70% . . . goes on training! Expert 20% 65% . . . drunk on weekends! Nature . . . Goal: close to the best expert overall (solution: AA) Adaptive goal : close to the best expert on every time interval Adamskiy (RHUL) A Closer Look at Adaptive Regret GTP-2014 Guanajuato Mexico 4 / 21

Example continued Adamskiy (RHUL) A Closer Look at Adaptive Regret GTP-2014 Guanajuato Mexico 5 / 21

Example continued Non-adaptive predictor would lose trust in the first guy. Adamskiy (RHUL) A Closer Look at Adaptive Regret GTP-2014 Guanajuato Mexico 5 / 21

Adaptive algorithms We studied several approaches to adaptivity: Blowing up the set of experts to compete with virtual sleeping experts [DA, Koolen, Chernov, Vovk, 2012] Adamskiy (RHUL) A Closer Look at Adaptive Regret GTP-2014 Guanajuato Mexico 6 / 21

Adaptive algorithms We studied several approaches to adaptivity: Blowing up the set of experts to compete with virtual sleeping experts [DA, Koolen, Chernov, Vovk, 2012] Turned out to be Fixed Share [Herbster, Warmuth 1998]! Adamskiy (RHUL) A Closer Look at Adaptive Regret GTP-2014 Guanajuato Mexico 6 / 21

Adaptive algorithms We studied several approaches to adaptivity: Blowing up the set of experts to compete with virtual sleeping experts [DA, Koolen, Chernov, Vovk, 2012] Turned out to be Fixed Share [Herbster, Warmuth 1998]! Restarting existing algorithms and combining their predictions [Hazan, Seshadhri, 2009] Adamskiy (RHUL) A Closer Look at Adaptive Regret GTP-2014 Guanajuato Mexico 6 / 21

Adaptive algorithms We studied several approaches to adaptivity: Blowing up the set of experts to compete with virtual sleeping experts [DA, Koolen, Chernov, Vovk, 2012] Turned out to be Fixed Share [Herbster, Warmuth 1998]! Restarting existing algorithms and combining their predictions [Hazan, Seshadhri, 2009] Also turned out to be Fixed Share! Adamskiy (RHUL) A Closer Look at Adaptive Regret GTP-2014 Guanajuato Mexico 6 / 21

Adaptive properties of Fixed Share: results Fixed Share is known for tracking . Adamskiy (RHUL) A Closer Look at Adaptive Regret GTP-2014 Guanajuato Mexico 7 / 21

Adaptive properties of Fixed Share: results Fixed Share is known for tracking . L FS [ 1 , T ] − L S [ 1 , T ] ≤ ln N +( m − 1 ) ln ( N − 1 ) − ( m − 1 ) ln α − ( T − m ) ln ( 1 − α ) , Adamskiy (RHUL) A Closer Look at Adaptive Regret GTP-2014 Guanajuato Mexico 7 / 21

Adaptive properties of Fixed Share: results Fixed Share is known for tracking . L FS [ 1 , T ] − L S [ 1 , T ] ≤ ln N +( m − 1 ) ln ( N − 1 ) − ( m − 1 ) ln α − ( T − m ) ln ( 1 − α ) , What about its adaptivity? Our results Figured out the Worst-Case adaptive regret of Fixed Share 1 Adamskiy (RHUL) A Closer Look at Adaptive Regret GTP-2014 Guanajuato Mexico 7 / 21

Adaptive properties of Fixed Share: results Fixed Share is known for tracking . L FS [ 1 , T ] − L S [ 1 , T ] ≤ ln N +( m − 1 ) ln ( N − 1 ) − ( m − 1 ) ln α − ( T − m ) ln ( 1 − α ) , What about its adaptivity? Our results Figured out the Worst-Case adaptive regret of Fixed Share 1 Proved the optimality of Fixed Share — “no algorithm could have 2 better guarantees on all time intervals” Adamskiy (RHUL) A Closer Look at Adaptive Regret GTP-2014 Guanajuato Mexico 7 / 21

Why adaptive regret? 1 Setup 2 Results 3 Adamskiy (RHUL) A Closer Look at Adaptive Regret GTP-2014 Guanajuato Mexico 8 / 21

Protocol: Mix loss for t = 1 , 2 , . . . do Learner announces probability vector � w t ∈ △ N Reality announces loss vector � ℓ t ∈ [ −∞ , ∞ ] N t e − ℓ n n w n Learner suffers loss ℓ t := − ln � t end for Adamskiy (RHUL) A Closer Look at Adaptive Regret GTP-2014 Guanajuato Mexico 9 / 21

Adaptive Regret Goal: On every time interval [ t 1 , t 2 ] we want to be not much worse than the best expert on that interval. We are interested in small adaptive regret Adamskiy (RHUL) A Closer Look at Adaptive Regret GTP-2014 Guanajuato Mexico 10 / 21

Adaptive Regret Goal: On every time interval [ t 1 , t 2 ] we want to be not much worse than the best expert on that interval. We are interested in small adaptive regret Definition The adaptive regret of the algorithm on the interval [ t 1 , t 2 ] is the loss of the algorithm there minus the lost of the best expert there: L j R [ t 1 , t 2 ] := L [ t 1 , t 2 ] − min [ t 1 , t 2 ] j Adamskiy (RHUL) A Closer Look at Adaptive Regret GTP-2014 Guanajuato Mexico 10 / 21

AA and Fixed Share Aggregating Algorithm [Vovk 1990] updates weights as: t e − ℓ n w n t w n t + 1 := t . t e − ℓ n n w n � Adamskiy (RHUL) A Closer Look at Adaptive Regret GTP-2014 Guanajuato Mexico 11 / 21

AA and Fixed Share Aggregating Algorithm [Vovk 1990] updates weights as: t e − ℓ n w n t w n t + 1 := t . t e − ℓ n n w n � Fixed Share family is defined by the sequence of “switching rates” α t . Then the weight update is t e − ℓ n w n α t + 1 � N � t w n t + 1 := N − 1 + 1 − N − 1 α t + 1 t . t e − ℓ n n w n � Adamskiy (RHUL) A Closer Look at Adaptive Regret GTP-2014 Guanajuato Mexico 11 / 21

AA and Fixed Share Aggregating Algorithm [Vovk 1990] updates weights as: t e − ℓ n w n t w n t + 1 := t . t e − ℓ n n w n � Fixed Share family is defined by the sequence of “switching rates” α t . Then the weight update is t e − ℓ n w n α t + 1 � N � t w n t + 1 := N − 1 + 1 − N − 1 α t + 1 t . t e − ℓ n n w n � Adaptivity hides in the first term. Adamskiy (RHUL) A Closer Look at Adaptive Regret GTP-2014 Guanajuato Mexico 11 / 21

A Closer Look at Adaptive Regret Dmitry Adamskiy Joint work with - PowerPoint PPT Presentation

A Closer Look at Adaptive Regret Dmitry Adamskiy Joint work with Wouter Koolen, Volodya Vovk and Alexey Chernov Department of Computer Science Royal Holloway, University of London 15/11/2014 Adamskiy (RHUL) A Closer Look at Adaptive Regret

CLOSER 2019, May., 2-4, Heraklion, Greece 1 CLOSER 2019, May., 2-4, Heraklion, Greece 2 Cloud

Collection #1 LOOk 1/8 LOOk 2/8 LOOk 3/8 LOOk 4/8 LOOk 5/8 LOOk 6/8

Housing in the Netherlands Housing in the Netherlands A closer look at the Social A closer look

Neural Nets for Adaptive Filter and Adaptive Neural Nets as Adaptive Filters Pattern Recognition

Adaptive Control Chapter 1: Introduction to Adaptive Control Adaptive Control Landau, Lozano,

Adaptive Control Chapter 11: Direct Adaptive Control 1 Adaptive Control Landau, Lozano,

Counterfactual Regret Minimization and Domination in Extensive-Form Games Richard Gibson

No-Regret Learning in Convex Games Geoff Gordon, Amy Greenwald, Casey Marks, and Martin Zinkevich

On adaptive regret bounds for non- stochastic bandits Gergely Neu INRIA Lille, SequeL team

TS 83 DORMA DORMA TS 83 Easy-action Door Closer Easy-action door closer Data and features TS

Adaptive Control Chapter 12: Indirect Adaptive Control 1 Adaptive Control Landau, Lozano,

TAKING A CLOSER LOOK AT IRATI Supervisors: Koen Veelenturf Marijke Kaat - SURFnet

Wireless data networks More on 802.11 Martin Heusse A closer look at 802.11 1 Channels

Using the Network Layering DAG CS 118 Computer Network Fundamentals Peter Reiher Lecture 13 CS

Adaptive Control Chapter 13: Multimodel adaptive control with switching Chapter 13: Multimodel

Adaptive Control Chapter 14: Adaptive regulation Rejection of unknown disturbances 1

Robust Multilingual Statistical Morphology Generation Models Ondej Duek and Filip Jurek

Hashing Index Scheme for Persistent Memory Pengfei Zuo , Yu Hua, Jie Wu Huazhong University of

Fern Green Open House (CCE) CCE Structure Total Defence Leadership for Day All

An Overview of Human Computation Dr. Ling-Jyh Chen (cclljj@iis.sinica.edu.tw) Institute of

Theorie der Informatik 6. Formale Sprachen und Grammatiken Malte Helmert Gabriele R oger

Whole Person Care Los Angeles Clemens Hong MD MPH Director, Whole Person Care Medical

For the Open-Set Data Classification Zhuoyi Wang, Bo Dong, Yu Lin, Yigong Wang, Latifur Khan

Protecting Westminsters Means of Grace Ministry! Organization, Administration & Safety

Sambuz

Useful Links

Newsletter

Mail Us

A Closer Look at Adaptive Regret Dmitry Adamskiy Joint work with - PowerPoint PPT Presentation

A Closer Look at Adaptive Regret Dmitry Adamskiy Joint work with Wouter Koolen, Volodya Vovk and Alexey Chernov Department of Computer Science Royal Holloway, University of London 15/11/2014 Adamskiy (RHUL) A Closer Look at Adaptive Regret

CLOSER 2019, May., 2-4, Heraklion, Greece 1 CLOSER 2019, May., 2-4, Heraklion, Greece 2 Cloud

Collection #1 LOOk 1/8 LOOk 2/8 LOOk 3/8 LOOk 4/8 LOOk 5/8 LOOk 6/8

Housing in the Netherlands Housing in the Netherlands A closer look at the Social A closer look

Neural Nets for Adaptive Filter and Adaptive Neural Nets as Adaptive Filters Pattern Recognition

Adaptive Control Chapter 1: Introduction to Adaptive Control Adaptive Control Landau, Lozano,

Adaptive Control Chapter 11: Direct Adaptive Control 1 Adaptive Control Landau, Lozano,

Counterfactual Regret Minimization and Domination in Extensive-Form Games Richard Gibson

No-Regret Learning in Convex Games Geoff Gordon, Amy Greenwald, Casey Marks, and Martin Zinkevich

On adaptive regret bounds for non- stochastic bandits Gergely Neu INRIA Lille, SequeL team

TS 83 DORMA DORMA TS 83 Easy-action Door Closer Easy-action door closer Data and features TS

Adaptive Control Chapter 12: Indirect Adaptive Control 1 Adaptive Control Landau, Lozano,

TAKING A CLOSER LOOK AT IRATI Supervisors: Koen Veelenturf Marijke Kaat - SURFnet

Wireless data networks More on 802.11 Martin Heusse A closer look at 802.11 1 Channels

Using the Network Layering DAG CS 118 Computer Network Fundamentals Peter Reiher Lecture 13 CS

Adaptive Control Chapter 13: Multimodel adaptive control with switching Chapter 13: Multimodel

Adaptive Control Chapter 14: Adaptive regulation Rejection of unknown disturbances 1

Robust Multilingual Statistical Morphology Generation Models Ondej Duek and Filip Jurek

Hashing Index Scheme for Persistent Memory Pengfei Zuo , Yu Hua, Jie Wu Huazhong University of

Fern Green Open House (CCE) CCE Structure Total Defence Leadership for Day All

An Overview of Human Computation Dr. Ling-Jyh Chen (cclljj@iis.sinica.edu.tw) Institute of

Theorie der Informatik 6. Formale Sprachen und Grammatiken Malte Helmert Gabriele R oger

Whole Person Care Los Angeles Clemens Hong MD MPH Director, Whole Person Care Medical

For the Open-Set Data Classification Zhuoyi Wang, Bo Dong, Yu Lin, Yigong Wang, Latifur Khan

Protecting Westminsters Means of Grace Ministry! Organization, Administration &amp; Safety

Sambuz

Useful Links

Newsletter

Mail Us

Protecting Westminsters Means of Grace Ministry! Organization, Administration & Safety