Open Problem: Parameter-Free and Scale-Free Online Algorithms Francesco Orabona D´ avid P´ al Yahoo Research, New York June 25, 2016 COLT 2016
Online Linear Optimization Given a convex set K ⊆ R N For t = 1, 2, . . . • predict w t ∈ K • receive loss vector ℓ t ∈ R N • suffer loss � ℓ t , w t � T T ∑ ∑ Regret T ( u ) = � ℓ t , w t � − � ℓ t , u � t = 1 t = 1 � �������� �� �������� � � ������ �� ������ � algorithm’s loss competitor’s loss We focus on: 1 K = R N 2 K = ∆ N = { x ∈ R N : x ≥ 0, � x � 1 = 1 }
Two Types of Adaptivity 1 Adaptivity to competitor u (parameter-free, quantile bounds, ...) 2 Adaptivity to scale of ℓ 1 , ℓ 2 , . . . , ℓ T (scale-free, second-order bounds, ...) Open Problem (Informal) Design efficient doubly adaptive algorithms.
FTRL Bound Theorem (CBL’06, SS’11) If R : K → R is a non-negative 1 -strongly convex function w.r.t. �·� , then FTRL with regularizer R and learning rate η > 0 satisfies T Regret T ( u ) ≤ R ( u ) � ℓ t � 2 ∑ ∀ u ∈ K + η ∗ η t = 1 � R ( u ) / ∑ T t = 1 � ℓ t � 2 With learning rate η = ∗ � � T � � ℓ t � 2 ∑ Regret T ( u ) ≤ � R ( u ) ∗ t = 1 Two cheats 1 Bound holds only for fixed u t = 1 � ℓ t � 2 2 Need to know ∑ T ∗
Existing Results for K = ∆ N � � 1 Regularizer R ( u ) = D ( u � π ) sup u ∈ ∆ N R ( u ) = max i ln π i 1 For any ℓ 1 , ℓ 2 , . . . , ℓ T ∈ R N [deREGK’11, OP’15] � � 1 � T � � � ℓ t � 2 ∑ Regret T ( u ) ≤ � max ln ∞ i π i t = 1 2 Assuming that � ℓ t � ∞ ≤ 1 [CFH’09, CV’10, LS’14, LS’15, KE’15, FRS’15, OP’16] � Regret T ( u ) ≤ T ( 1 + D ( u � π )) 3 For any ℓ 1 , ℓ 2 , . . . , ℓ T ∈ R N [FRS’15+OP’15] � � T � � ℓ t � 2 ∑ Regret T ( u ) ≤ � ( 1 + D ( u � π )) ∞ t = 1 O ( N max i log log 1 π i ) memory and time per round
Existing Results for K = R N 2 � u � 2 Regularizer R ( u ) = 1 sup u ∈ R N R ( u ) = + ∞ 2 1 For any ℓ 1 , ℓ 2 , . . . , ℓ T ∈ R N [OP’15] � � √ T � � � 1 + � u � 2 � ℓ t � 2 ∑ Regret T ( u ) ≤ 2 + 1 ≤ t ≤ T � ℓ t � 2 � T max 2 t = 1 2 Assuming that � ℓ t � 2 ≤ 1 [SM’12, O’13, MA’13, MO’14, O’14, OP’16] � � � 1 + � u � 1 Regret T ( u ) ≤ T log ( 1 + � u � 2 ) 2
Open Problems Open Problem #1 Find an algorithm for K = ∆ N with O ( N ) per-round time complexity such that for any π ∈ ∆ N and any ℓ 1 , ℓ 2 , . . . , ℓ T � � T � � ℓ t � 2 ∑ ∀ u ∈ ∆ N Regret T ( u ) ≤ � ( 1 + D ( u � π )) ∞ t = 1 Open Problem #2 — Reward $100 for positive solution Find an algorithm for K = R N with O ( N ) per-round time complexity such that for any ℓ 1 , ℓ 2 , . . . , ℓ T � � T � � ℓ t � 2 ∀ u ∈ R N ∑ Regret T ( u ) ≤ ( 1 + � u � 2 · polylog ( 1 + � u � 2 )) � 2 t = 1
Recommend
More recommend