Data-Dependent Algorithms for Bandit Convex Optimization Mehryar - PowerPoint PPT Presentation

Mar 09, 2024 •822 likes •895 views

Data-Dependent Algorithms for Bandit Convex Optimization Mehryar Mohri 1 Scott Yang 2 1 Google, New York University 2 New York University NIPS Easy Data II, Dec 10, 2015 Scott Yang BCO Learning Scenario and Set-Up Bandit Convex Optimization

Data-Dependent Algorithms for Bandit Convex Optimization Mehryar Mohri 1 Scott Yang 2 1 Google, New York University 2 New York University NIPS Easy Data II, Dec 10, 2015 Scott Yang BCO
Learning Scenario and Set-Up Bandit Convex Optimization Sequential optimization problem K ⊂ R n compact action space, f t convex loss functions At time t , learner chooses action x t and suffers loss f t ( x t ) Goal: minimize regret T � max f t ( x t ) − f t ( x ) x ∈K t =1 Zero-th order convex optimization problem: learner has no access to gradient information! Scott Yang BCO
Historical results Summary of existing work: 1 Lipschitz [Flaxman et al 2005]: O ( T 3 / 4 ) √ 2 Smooth and strongly convex loss [Levy et al 2014]: O ( T ) 3 Smooth loss [Dekel et al 2015]: O ( T 5 / 8 ) 4 Strongly convex loss [Agarwal et al 2010]: O ( T 2 / 3 ) 5 etc. Remarks: 1 Results are not data-dependent 2 Algorithms require a priori knowledge of loss function regularity Scott Yang BCO
General framework for BCO Algorithms Idea: 1 Use zero-th order information to estimate the gradient 2 Feed the gradient estimate into a normal convex optimization algorithm Key part: estimating the gradient! Suppose we want to play x t Instead, sample and play point y t on ellipse E t around x t . ∇ f t ( x t ) ≈ ∇ E y ∈ E t [ ˜ f t ( y )] ≈ ∇ f t ( y t ) Scott Yang BCO
Data-dependent sampling Remark: Scaling of ellipse and learning rate both factor into the regret bound Historically both tuned based on worst-case data Algorithms do not adapt to easier data Questions: Can we derive algorithms that learn faster on easier data? Can we characterize what easier data is for BCO problems? Can we construct algorithms that consolidate some of the existing regret bounds? Scott Yang BCO
Data-dependent sampling Idea: Scale ellipse and learning rate optimally according to the actual data that we see. Consequences: Data-dependent regret bound in terms of average curvature of the ellpsoid. Adaptively attains smooth, strongly convex, etc. regret bounds as worst-case results. For more details, please stop by the poster. Thank you! Scott Yang BCO

Recommend

Reinforcement Learning n-armed bandit Kevin Spiteri April 21, 2015 n-armed bandit n-armed

Reinforcement Learning n-armed bandit Kevin Spiteri April 21, 2015 n-armed bandit n-armed bandit 0.9 0.5 0.1 0.9 0.5 0.1 0.0 0.0 0.0 0.0 estimate n-armed bandit n-armed bandit 0.9 0.5 0.1 0.9 0.5 0.1 0 0.0 0.0 0.0 0.0

677 views • 21 slides

Convex Hell 362 dnc CS 16: Convex Hull Whoops, I mean... Convex Hull Whats a Convex Hull?

CS 16: Convex Hull Welcome to... Convex Hell 362 dnc CS 16: Convex Hull Whoops, I mean... Convex Hull Whats a Convex Hull? 363 dnc CS 16: Convex Hull What is the Convex Hull? Let S be a set of points in the plane. Intuition: Imagine

720 views • 19 slides

Optimal Algorithms for Online Convex Optimization with Multi-Point Bandit Feedback Alekh Agarwal

Optimal Algorithms for Online Convex Optimization with Multi-Point Bandit Feedback Alekh Agarwal Ofer Dekel Lin Xiao UC Berkeley Microsoft Research Online Convex Optimization (Full-Info) Adversary Player Online Convex Optimization

762 views • 37 slides

Reinforcement Learning Kevin Spiteri April 21, 2015 n-armed bandit n-armed bandit 0.9 0.5

Reinforcement Learning Kevin Spiteri April 21, 2015 n-armed bandit n-armed bandit 0.9 0.5 0.1 n-armed bandit 0.9 0.5 0.1 0.0 0.0 0.0 0.0 estimate n-armed bandit 0.9 0.5 0.1 0 0.0 0.0 0.0 0.0 estimate 0 0 0 0.0 0

995 views • 84 slides

CS675: Convex and Combinatorial Optimization Fall 2019 Convex Optimization Problems Instructor:

CS675: Convex and Combinatorial Optimization Fall 2019 Convex Optimization Problems Instructor: Shaddin Dughmi Outline Convex Optimization Basics 1 Common Classes 2 Interlude: Positive Semi-Definite Matrices 3 More Convex Optimization

897 views • 62 slides

CS675: Convex and Combinatorial Optimization Spring 2018 Convex Optimization Problems

CS675: Convex and Combinatorial Optimization Spring 2018 Convex Optimization Problems Instructor: Shaddin Dughmi Outline Convex Optimization Basics 1 Common Classes 2 Interlude: Positive Semi-Definite Matrices 3 More Convex Optimization

887 views • 61 slides

constrained convex optimization virgil pavlu 1 convex set a set X in a vector space is convex if

constrained convex optimization virgil pavlu 1 convex set a set X in a vector space is convex if for any w 1 , w 2 X and [0 , 1] we have w 1 + (1 ) w 2 X 2 convex function a function f is convex(concave) on X Dom ( f

200 views • 18 slides

Convex Optimization 4. Convex Optimization Problems Prof. Ying Cui Department of Electrical

Convex Optimization 4. Convex Optimization Problems Prof. Ying Cui Department of Electrical Engineering Shanghai Jiao Tong University 2018 SJTU Ying Cui 1 / 64 Outline Optimization problems Convex optimization Linear optimization

816 views • 64 slides

One Armed Bandit source: http://dogbeforewicket.blogspot.ca EECS 1030 moodle.yorku.ca One Armed

One Armed Bandit source: http://dogbeforewicket.blogspot.ca EECS 1030 moodle.yorku.ca One Armed Bandit Utility /** * Returns the winnings from one pull of the one armed * bandit. * * @param coin the coin deposited in the one armed bandit.

623 views • 58 slides

CS675: Convex and Combinatorial Optimization Spring 2018 Convex Sets Instructor: Shaddin Dughmi

CS675: Convex and Combinatorial Optimization Spring 2018 Convex Sets Instructor: Shaddin Dughmi Outline Convex sets, Affine sets, and Cones 1 Examples of Convex Sets 2 Convexity-Preserving Operations 3 Separation Theorems 4 Convex Sets

958 views • 38 slides

CS675: Convex and Combinatorial Optimization Fall 2019 Convex Functions Instructor: Shaddin

CS675: Convex and Combinatorial Optimization Fall 2019 Convex Functions Instructor: Shaddin Dughmi Outline Convex Functions 1 Examples of Convex and Concave Functions 2 Convexity-Preserving Operations 3 Convex Functions A function f : R n

932 views • 54 slides

CS675: Convex and Combinatorial Optimization Fall 2019 Convex Sets Instructor: Shaddin Dughmi

CS675: Convex and Combinatorial Optimization Fall 2019 Convex Sets Instructor: Shaddin Dughmi Outline Convex sets, Affine sets, and Cones 1 Examples of Convex Sets 2 Convexity-Preserving Operations 3 Separation Theorems 4 Convex Sets A

515 views • 38 slides

CS675: Convex and Combinatorial Optimization Fall 2014 Convex Functions Instructor: Shaddin

CS675: Convex and Combinatorial Optimization Fall 2014 Convex Functions Instructor: Shaddin Dughmi Outline Convex Functions 1 Examples of Convex and Concave Functions 2 Convexity-Preserving Operations 3 Convex Functions A function f : R n

572 views • 54 slides

Convex hull 1 - 1 Convex hull 1 - 2 Convex hull 1 - 3 Convex hull Definition, extremal

Convex hull 1 - 1 Convex hull 1 - 2 Convex hull 1 - 3 Convex hull Definition, extremal point Jarvis algorithm Orientation predicate Buggy degenerate example Real RAM model and general position hypothesis Graham

1.41k views • 114 slides

CS133 Computational Geometry Convex Hull 1 Convex Hull Given a set of n points, find the

CS133 Computational Geometry Convex Hull 1 Convex Hull Given a set of n points, find the minimal convex polygon that contains all the points 2 Convex Hull Properties 3 Convex Hull Representation The convex hull is represented by all

1.07k views • 82 slides

Some Recent Advances in Non-convex Optimization Purushottam Kar IIT KANPUR Outline of the Talk

Some Recent Advances in Non-convex Optimization Purushottam Kar IIT KANPUR Outline of the Talk Recap of Convex Optimization Why Non-convex Optimization? Non-convex Optimization: A Brief Introduction Robust Regression : A

1.23k views • 88 slides

Deep Hep Reading Group 1611.05763 Learning To Reinforcement Learn 1611.02779 SchemaAc

Deep Hep Reading Group 1611.05763 Learning To Reinforcement Learn 1611.02779 SchemaAc Approach for solving Markov Decision Process Agent interacts with environment Takes acAons to move from one state to another Is rewarded or

623 views • 19 slides

What we learned last time 1. Intelligence is the computational part of the ability to achieve

What we learned last time 1. Intelligence is the computational part of the ability to achieve goals looking deeper: 1) its a continuum, 2) its an appearance, 3) it varies with observer and purpose 2. We will (probably) figure out how to make

455 views • 32 slides

An Estimation Based Allocation Rule with Super-linear Regret and Finite Lock-on Time for

An Estimation Based Allocation Rule with Super-linear Regret and Finite Lock-on Time for Time-dependent Multi-armed Bandit Processes Prokopis C. Prokopiou, Peter E. Caines, and Aditya Mahajan McGill University May 6, 2015 PP, PEC, AM (McGill

434 views • 29 slides

Spectrum Sharing Applications Sreeraj Rajendran rsreeraj@gmail.com FOSDEM 15 , Brussels

Spectrum Sharing Applications Sreeraj Rajendran rsreeraj@gmail.com FOSDEM 15 , Brussels February 1, 2015 Intro Algorithms Tools Contents 5G Spectrum Sharing Challenge Some approaches in literature Single channel solutions

1.31k views • 16 slides

Inverting Sampled Traffic Nicolas Hohn, Darryl Veitch Australian Research Council Special

Inverting Sampled Traffic Nicolas Hohn, Darryl Veitch Australian Research Council Special Research Center for Ultra-Broadband Information Networks T HE U NIVERSITY OF M ELBOURNE Inverting Sampled Traffic Motivation Sampling Techniques

451 views • 34 slides

Foundations of Machine Learning Boosting Weak Learning (Kearns and Valiant, 1994) Definition:

Foundations of Machine Learning Boosting Weak Learning (Kearns and Valiant, 1994) Definition: concept class is weakly PAC-learnable C if there exists a (weak) learning algorithm and L > 0 such that: for all , for all

527 views • 41 slides

IV and IV-GMM Christopher F Baum ECON 8823: Applied Econometrics Boston College, Spring 2016

IV and IV-GMM Christopher F Baum ECON 8823: Applied Econometrics Boston College, Spring 2016 Christopher F Baum (BC / DIW) IV and IV-GMM Boston College, Spring 2016 1 / 45 Instrumental variables estimators The IVGMM estimator To discuss

6.15k views • 45 slides

Towards Demystifying Overparameterization in Deep Learning Mahdi Soltanolkotabi Department of

Towards Demystifying Overparameterization in Deep Learning Mahdi Soltanolkotabi Department of Electrical and Computer Engineering April 4, 2019 Mathematics of Imaging Workshop # 3 Henri Poincare Institute April 4, 2019 Mathematics of Imaging

809 views • 49 slides