Distributionally Robust Stochastic Optimization and Learning - PowerPoint PPT Presentation

Distributionally Robust Stochastic Optimization and Learning Models/Algorithms for Data-Driven Optimization and Learning Yinyu Ye 1 Department of Management Science and Engineering Institute of Computational and Mathematical Engineering Stanford University, Stanford US & Mexico Workshop on Optimization and its Applications in Honor of Don Goldfarb January 8-12, 2018 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Ye, Yinyu (Stanford) Distributionally Robust Optimization January 9, 2018 1 / 37

Outline Computation and Sample Complexity of Solving Markov Decision/Game Processes Distributionally Robust Optimization under Moment, Likelihood and Wasserstein Bounds, and its Applications . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Ye, Yinyu (Stanford) Distributionally Robust Optimization January 9, 2018 2 / 37

Outline Computation and Sample Complexity of Solving Markov Decision/Game Processes Distributionally Robust Optimization under Moment, Likelihood and Wasserstein Bounds, and its Applications Analyze and develop tractable and provable models and algorithms for optimization with uncertain and sampling data. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Ye, Yinyu (Stanford) Distributionally Robust Optimization January 9, 2018 2 / 37

Table of Contents Computation and Sample Complexity of Solving Markov 1 Decision/Game Processes Distributionally Robust Optimization under Moment, Likelihood 2 and Wasserstein Bounds, and its Applications . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Ye, Yinyu (Stanford) Distributionally Robust Optimization January 9, 2018 3 / 37

The Markov Decision/Game Process Markov decision processes (MDPs) provide a mathematical framework for modeling sequential decision-making in situations where outcomes are partly random and partly under the control of a decision maker. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Ye, Yinyu (Stanford) Distributionally Robust Optimization January 9, 2018 4 / 37

The Markov Decision/Game Process Markov decision processes (MDPs) provide a mathematical framework for modeling sequential decision-making in situations where outcomes are partly random and partly under the control of a decision maker. Markov game processes (MGPs) provide a mathematical framework for modeling sequential decision-making of two-person turn-based zero-sum game. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Ye, Yinyu (Stanford) Distributionally Robust Optimization January 9, 2018 4 / 37

The Markov Decision/Game Process Markov decision processes (MDPs) provide a mathematical framework for modeling sequential decision-making in situations where outcomes are partly random and partly under the control of a decision maker. Markov game processes (MGPs) provide a mathematical framework for modeling sequential decision-making of two-person turn-based zero-sum game. MDGPs are useful for studying a wide range of optimization/game problems solved via dynamic programming, where it was known at least as early as the 1950s (cf. Shapley 1953, Bellman 1957). . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Ye, Yinyu (Stanford) Distributionally Robust Optimization January 9, 2018 4 / 37

The Markov Decision/Game Process Markov decision processes (MDPs) provide a mathematical framework for modeling sequential decision-making in situations where outcomes are partly random and partly under the control of a decision maker. Markov game processes (MGPs) provide a mathematical framework for modeling sequential decision-making of two-person turn-based zero-sum game. MDGPs are useful for studying a wide range of optimization/game problems solved via dynamic programming, where it was known at least as early as the 1950s (cf. Shapley 1953, Bellman 1957). Modern applications include dynamic planning under uncertainty, reinforcement learning, social networking, and almost all other stochastic dynamic/sequential decision/game problems in Mathematical, Physical, Management and Social Sciences. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Ye, Yinyu (Stanford) Distributionally Robust Optimization January 9, 2018 4 / 37

The Markov Decision Process/Game continued At each time step, the process is in some state i = 1 , ..., m , and the decision maker chooses an action j ∈ A i that is available in state i , and giving the decision maker an immediate corresponding cost c j . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Ye, Yinyu (Stanford) Distributionally Robust Optimization January 9, 2018 5 / 37

The Markov Decision Process/Game continued At each time step, the process is in some state i = 1 , ..., m , and the decision maker chooses an action j ∈ A i that is available in state i , and giving the decision maker an immediate corresponding cost c j . The process responds at the next time step by randomly moving into a new state i ′ . The probability that the process enters i ′ is influenced by the chosen action in state i . Specifically, it is given by the state transition distribution probability p j ∈ R m . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Ye, Yinyu (Stanford) Distributionally Robust Optimization January 9, 2018 5 / 37

The Markov Decision Process/Game continued At each time step, the process is in some state i = 1 , ..., m , and the decision maker chooses an action j ∈ A i that is available in state i , and giving the decision maker an immediate corresponding cost c j . The process responds at the next time step by randomly moving into a new state i ′ . The probability that the process enters i ′ is influenced by the chosen action in state i . Specifically, it is given by the state transition distribution probability p j ∈ R m . But given state/action j , the distribution is conditionally independent of all previous states and actions; in other words, the state transitions of an MDP possess the Markov property. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Ye, Yinyu (Stanford) Distributionally Robust Optimization January 9, 2018 5 / 37

MDP Stationary Policy and Cost-to-Go Value A stationary policy for the decision maker is a function π = { π 1 , π 2 , · · · , π m } that specifies an action in each state, π i ∈ A i , that the decision maker will always choose; which also lead to a cost-to-go value for each state . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Ye, Yinyu (Stanford) Distributionally Robust Optimization January 9, 2018 6 / 37

MDP Stationary Policy and Cost-to-Go Value A stationary policy for the decision maker is a function π = { π 1 , π 2 , · · · , π m } that specifies an action in each state, π i ∈ A i , that the decision maker will always choose; which also lead to a cost-to-go value for each state The MDP is to find a stationary policy to minimize/maximize the expected discounted sum over the infinite horizon with a discount factor 0 ≤ γ < 1. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Ye, Yinyu (Stanford) Distributionally Robust Optimization January 9, 2018 6 / 37

MDP Stationary Policy and Cost-to-Go Value A stationary policy for the decision maker is a function π = { π 1 , π 2 , · · · , π m } that specifies an action in each state, π i ∈ A i , that the decision maker will always choose; which also lead to a cost-to-go value for each state The MDP is to find a stationary policy to minimize/maximize the expected discounted sum over the infinite horizon with a discount factor 0 ≤ γ < 1. If the states are partitioned into two sets, one is to minimize and the other is to maximize the discounted sum, then the process becomes a two-person turn-based zero-sum stochastic game. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Ye, Yinyu (Stanford) Distributionally Robust Optimization January 9, 2018 6 / 37

Distributionally Robust Stochastic Optimization and Learning - PowerPoint PPT Presentation

Distributionally Robust Stochastic Optimization and Learning Models/Algorithms for Data-Driven Optimization and Learning Yinyu Ye 1 Department of Management Science and Engineering Institute of Computational and Mathematical Engineering

On Distributionally Robust Chance Constrained Program with Wasserstein Distance Weijun Xie ISE,

Distributionally Robust Optimization with Decision-Dependent Ambiguity Set Nilay Noyan Sabanc

Effective Scenarios in Multistage Distributionally Robust Optimization with Total Variation

Principled Learning Method for Wasserstein Distributionally Robust Optimization with Local

Short Course in Supervised Learning Robust Optimization and Machine Learning Robust Supervised

Wasserstein Distributionally Robust Kalman Filtering (1) (1) (1) Soroosh Shafieezadeh-Abadeh,

Dual Effect in Stochastic Optimization February 10, 2015 P. Carpentier Master MMMEF Cours

Outlier Outlier Outlier- Outlier - -robust - robust robust robust identification

Moment-based Distributionally Robust Server Allocation and Scheduling Problems Yiling Zhang 1 ,

Distributionally Robust Approaches for Optimal Power Flow with Uncertain Reserves from Load

Solving 0-1 Semidefinite Programs for Distributionally Robust Allocation of Surgery Blocks Yiling

Distributionally Robust Planning Tool for Sustainable Microgrids Shahab Dehghan 1 , Agnes

Regularized & Distributionally Robust Data-Enabled Predictive Control Florian D orfler

Stochastic Optimization and Discretization January 06, 2021 P. Carpentier Master Optimization

Stochastic optimization in Hilbert spaces Aymeric Dieuleveut Aymeric Dieuleveut Stochastic

Introduction to Stochastic Optimization January 13, 2015 P. Carpentier Master MMMEF Cours

Fourth quarter 2009 results Alex Wynaendts, CEO Press conference The Hag ue, February , y 25

Sinefitting : Robust Curvature Estimator On Surface Triangulation J er ome Charton, Stefka

SHOPP Ten Year Plan Maintenance Five Year Plan Michael B. Johnson State Asset Management

The System Core Unique Factory-produced W-profiled Core panels Manufactured from Commercial

DDoS attacks on electronic payment systems Sean Rijs and Joris Claassen Supervisor: Stefan

Foreign Investments Mostly Robust Despite Global Downturn; Shift into Services by Amat Adarov,

Volvo Group 2017 Day in and day out, all around the year, peoples decisions and basic needs

Substance use and treatment-seeking experiences for adults on probation in a county residential

Distributionally Robust Stochastic Optimization and Learning - PowerPoint PPT Presentation

Distributionally Robust Stochastic Optimization and Learning Models/Algorithms for Data-Driven Optimization and Learning Yinyu Ye 1 Department of Management Science and Engineering Institute of Computational and Mathematical Engineering

On Distributionally Robust Chance Constrained Program with Wasserstein Distance Weijun Xie ISE,

Distributionally Robust Optimization with Decision-Dependent Ambiguity Set Nilay Noyan Sabanc

Effective Scenarios in Multistage Distributionally Robust Optimization with Total Variation

Principled Learning Method for Wasserstein Distributionally Robust Optimization with Local

Short Course in Supervised Learning Robust Optimization and Machine Learning Robust Supervised

Wasserstein Distributionally Robust Kalman Filtering (1) (1) (1) Soroosh Shafieezadeh-Abadeh,

Dual Effect in Stochastic Optimization February 10, 2015 P. Carpentier Master MMMEF Cours

Outlier Outlier Outlier- Outlier - -robust - robust robust robust identification

Moment-based Distributionally Robust Server Allocation and Scheduling Problems Yiling Zhang 1 ,

Distributionally Robust Approaches for Optimal Power Flow with Uncertain Reserves from Load

Solving 0-1 Semidefinite Programs for Distributionally Robust Allocation of Surgery Blocks Yiling

Distributionally Robust Planning Tool for Sustainable Microgrids Shahab Dehghan 1 , Agnes

Regularized &amp; Distributionally Robust Data-Enabled Predictive Control Florian D orfler

Stochastic Optimization and Discretization January 06, 2021 P. Carpentier Master Optimization

Stochastic optimization in Hilbert spaces Aymeric Dieuleveut Aymeric Dieuleveut Stochastic

Introduction to Stochastic Optimization January 13, 2015 P. Carpentier Master MMMEF Cours

Fourth quarter 2009 results Alex Wynaendts, CEO Press conference The Hag ue, February , y 25

Sinefitting : Robust Curvature Estimator On Surface Triangulation J er ome Charton, Stefka

SHOPP Ten Year Plan Maintenance Five Year Plan Michael B. Johnson State Asset Management

The System Core Unique Factory-produced W-profiled Core panels Manufactured from Commercial

DDoS attacks on electronic payment systems Sean Rijs and Joris Claassen Supervisor: Stefan

Foreign Investments Mostly Robust Despite Global Downturn; Shift into Services by Amat Adarov,

Volvo Group 2017 Day in and day out, all around the year, peoples decisions and basic needs

Substance use and treatment-seeking experiences for adults on probation in a county residential

Regularized & Distributionally Robust Data-Enabled Predictive Control Florian D orfler