Online Convex Optimization in Adversarial MDPs Aviv Rosenberg - PowerPoint PPT Presentation

Mar 27, 2024 •378 likes •421 views

Poster #150 Online Convex Optimization in Adversarial MDPs Aviv Rosenberg Yishay Mansour Motivation: MDPs are very popular but dont consider time -changing environments BGP Routing is a great motivating example Adversarial MDP is an

Poster #150 Online Convex Optimization in Adversarial MDPs Aviv Rosenberg Yishay Mansour Motivation: ▪ MDPs are very popular but don’t consider time -changing environments ▪ BGP Routing is a great motivating example Adversarial MDP is an Model: MDP in which the losses might change arbitrarily ▪ Episodic MDP ▪ Transition Function is fixed but unknown to the learner ▪ Sequence of loss functions is chosen by an adversary ▪ Success is measures by the regret – comparing to the best policy in hindsight
Poster #150 Online Convex Optimization in Adversarial MDPs Aviv Rosenberg Yishay Mansour Problem Reformulation: ▪ The learner picks policies or occupancy measures equivalently ▪ Picking occupancy measures makes this an instance of online convex optimization Occupancy measure is a probability distribution Algorithm: over the state-action pairs ▪ Basic idea: run online mirror descent ▪ Problem: unknow transition function means we don’t know if an occupancy measure is legal ▪ Solution: maintain confidence sets that contain the MDP with high probability
Poster #150 Online Convex Optimization in Adversarial MDPs Aviv Rosenberg Yishay Mansour Challenges: Performance criterion is a ▪ Efficient implementation of the algorithm function that aggregates all the losses of a single episode. ▪ Regret analysis Examples involve risk-sensitivity and robustness. Contributions: Previous state-of-the-art: ▪ handling performance criteria that are convex • Based on Follow the Perturbed with respect to the occupancy measures Leader • Regret bound of 𝑃 𝐼 𝑇 𝐵 𝑈 ▪ High confidence regret bound of 𝑃 𝐼 𝑇 𝐵 𝑈 in expectation

Recommend

Convex Hell 362 dnc CS 16: Convex Hull Whoops, I mean... Convex Hull Whats a Convex Hull?

CS 16: Convex Hull Welcome to... Convex Hell 362 dnc CS 16: Convex Hull Whoops, I mean... Convex Hull Whats a Convex Hull? 363 dnc CS 16: Convex Hull What is the Convex Hull? Let S be a set of points in the plane. Intuition: Imagine

720 views • 19 slides

Planning and Optimization December 4, 2019 G1. Factored MDPs G1.1 Factored MDPs Planning and

Planning and Optimization December 4, 2019 G1. Factored MDPs G1.1 Factored MDPs Planning and Optimization G1. Factored MDPs G1.2 Probabilistic Planning Tasks Malte Helmert and Thomas Keller G1.3 Complexity Universit at Basel G1.4

278 views • 9 slides

CS675: Convex and Combinatorial Optimization Fall 2019 Convex Optimization Problems Instructor:

CS675: Convex and Combinatorial Optimization Fall 2019 Convex Optimization Problems Instructor: Shaddin Dughmi Outline Convex Optimization Basics 1 Common Classes 2 Interlude: Positive Semi-Definite Matrices 3 More Convex Optimization

897 views • 62 slides

CS675: Convex and Combinatorial Optimization Spring 2018 Convex Optimization Problems

CS675: Convex and Combinatorial Optimization Spring 2018 Convex Optimization Problems Instructor: Shaddin Dughmi Outline Convex Optimization Basics 1 Common Classes 2 Interlude: Positive Semi-Definite Matrices 3 More Convex Optimization

887 views • 61 slides

constrained convex optimization virgil pavlu 1 convex set a set X in a vector space is convex if

constrained convex optimization virgil pavlu 1 convex set a set X in a vector space is convex if for any w 1 , w 2 X and [0 , 1] we have w 1 + (1 ) w 2 X 2 convex function a function f is convex(concave) on X Dom ( f

200 views • 18 slides

Convex Optimization 4. Convex Optimization Problems Prof. Ying Cui Department of Electrical

Convex Optimization 4. Convex Optimization Problems Prof. Ying Cui Department of Electrical Engineering Shanghai Jiao Tong University 2018 SJTU Ying Cui 1 / 64 Outline Optimization problems Convex optimization Linear optimization

816 views • 64 slides

CS675: Convex and Combinatorial Optimization Spring 2018 Convex Sets Instructor: Shaddin Dughmi

CS675: Convex and Combinatorial Optimization Spring 2018 Convex Sets Instructor: Shaddin Dughmi Outline Convex sets, Affine sets, and Cones 1 Examples of Convex Sets 2 Convexity-Preserving Operations 3 Separation Theorems 4 Convex Sets

958 views • 38 slides

CS675: Convex and Combinatorial Optimization Fall 2019 Convex Functions Instructor: Shaddin

CS675: Convex and Combinatorial Optimization Fall 2019 Convex Functions Instructor: Shaddin Dughmi Outline Convex Functions 1 Examples of Convex and Concave Functions 2 Convexity-Preserving Operations 3 Convex Functions A function f : R n

932 views • 54 slides

CS675: Convex and Combinatorial Optimization Fall 2019 Convex Sets Instructor: Shaddin Dughmi

CS675: Convex and Combinatorial Optimization Fall 2019 Convex Sets Instructor: Shaddin Dughmi Outline Convex sets, Affine sets, and Cones 1 Examples of Convex Sets 2 Convexity-Preserving Operations 3 Separation Theorems 4 Convex Sets A

515 views • 38 slides

CS675: Convex and Combinatorial Optimization Fall 2014 Convex Functions Instructor: Shaddin

CS675: Convex and Combinatorial Optimization Fall 2014 Convex Functions Instructor: Shaddin Dughmi Outline Convex Functions 1 Examples of Convex and Concave Functions 2 Convexity-Preserving Operations 3 Convex Functions A function f : R n

572 views • 54 slides

Optimal Algorithms for Online Convex Optimization with Multi-Point Bandit Feedback Alekh Agarwal

Optimal Algorithms for Online Convex Optimization with Multi-Point Bandit Feedback Alekh Agarwal Ofer Dekel Lin Xiao UC Berkeley Microsoft Research Online Convex Optimization (Full-Info) Adversary Player Online Convex Optimization

762 views • 37 slides

Convex hull 1 - 1 Convex hull 1 - 2 Convex hull 1 - 3 Convex hull Definition, extremal

Convex hull 1 - 1 Convex hull 1 - 2 Convex hull 1 - 3 Convex hull Definition, extremal point Jarvis algorithm Orientation predicate Buggy degenerate example Real RAM model and general position hypothesis Graham

1.41k views • 114 slides

CS133 Computational Geometry Convex Hull 1 Convex Hull Given a set of n points, find the

CS133 Computational Geometry Convex Hull 1 Convex Hull Given a set of n points, find the minimal convex polygon that contains all the points 2 Convex Hull Properties 3 Convex Hull Representation The convex hull is represented by all

1.07k views • 82 slides

Some Recent Advances in Non-convex Optimization Purushottam Kar IIT KANPUR Outline of the Talk

Some Recent Advances in Non-convex Optimization Purushottam Kar IIT KANPUR Outline of the Talk Recap of Convex Optimization Why Non-convex Optimization? Non-convex Optimization: A Brief Introduction Robust Regression : A

1.23k views • 88 slides

A Primer in Convex Optimization Moritz Diehl partly based on material by Colin Jones, Stephen

A Primer in Convex Optimization Moritz Diehl partly based on material by Colin Jones, Stephen Boyd and Lieven Vandenberghe Overview Convex sets Convex functions Operations that preserve convexity Convex optimization Convex Sets

711 views • 35 slides

16. Review of convex optimization Convex sets and functions Convex programming models

CS/ECE/ISyE 524 Introduction to Optimization Spring 201718 16. Review of convex optimization Convex sets and functions Convex programming models Network flow problems Least squares problems Regularization and tradeoffs

714 views • 28 slides

Welcome to the Internet Seminar Case Studies to Assess Poten/al Impacts of

Welcome to the Internet Seminar Case Studies to Assess Poten/al Impacts of Hydraulic Fracturing on Drinking Water Resources Sponsored by: EPA Office of Research and

409 views • 6 slides

Stina Sderqvist, PhD. Cogmed R&D stina.soderqvist@pearson.com Overview Cogmed Progress

Cogmed Progress Indicator, Cogmed Questionnaire & Variable Protocols - What do they tell us? Stina Sderqvist, PhD. Cogmed R&D stina.soderqvist@pearson.com Overview Cogmed Progress Indicator Why and how was it developed?

504 views • 32 slides

Advanced Techniques for Web-based Comics @RachelNabors .com a spectrum of storytelling a

Advanced Techniques for Web-based Comics @RachelNabors .com a spectrum of storytelling a spectrum of storytelling madefire.com goo.gl/1o8zZu Storyboard for Ferdinand the Bull taken at the Disney Family Home Museum a spectrum of

1.02k views • 63 slides

Lect 14a - Line Arrangements: Definitions and Zone Theorem Lect 14b - Line Arrangements:

Lect 14a - Line Arrangements: Definitions and Zone Theorem Lect 14b - Line Arrangements: Definitions and Zone Theorem Lect 14c - Line Arrangements: Definitions and Zone Theorem

432 views • 5 slides

Engaging in Logical Code Reasoning with an Activity-Based Online Tool Computer Science n School of

Engaging in Logical Code Reasoning with an Activity-Based Online Tool Computer Science n School of Computing n Clemson University Jason O. Hallstrom (Florida Atlantic University) Joseph E. Hollingsworth (Rose-Hulman), Megan Fowler, Eileen T.

337 views • 19 slides

Data Warehousing Outline Overview of data warehousing Dimensional Modeling Online

Data Warehousing Outline Overview of data warehousing Dimensional Modeling Online Analytical Processing From OLTP to the Data Warehouse Traditionally, database systems stored data relevant to current business processes

414 views • 14 slides

The Glass Half Full Using Programmable Hardware Accelerators in Analytical Databases Zsolt

The Glass Half Full Using Programmable Hardware Accelerators in Analytical Databases Zsolt Istvn IMDEA Software Institute 1 IM IMDEA Soft ftware In Institute 16 Faculty in the areas of: Program Analysis and Verification

509 views • 32 slides

CloudDB: A Data Store for all Sizes in the Cloud Hakan Hacigumus Data Management Research NEC

CloudDB: A Data Store for all Sizes in the Cloud Hakan Hacigumus Data Management Research NEC Laboratories America http://www.nec-labs.com/dm www.nec-labs.com What I will try to cover Historical perspective and motivation (

911 views • 42 slides