Fast Convergence to Wardrop Equilibria by Adaptive Sampling Methods - PowerPoint PPT Presentation

Introduction and Wardrop’s traffic model Exploration replication policy Symmetric games Lower bounds . . Fast Convergence to Wardrop Equilibria by Adaptive Sampling Methods . . . . . Gouleakis Themistoklis June 2, 2011 . . . . . . Fast Convergence to Wardrop Equilibria by Adaptive Sampling Metho Gouleakis Themistoklis 1 / 20

Introduction and Wardrop’s traffic model Problem definition Exploration replication policy Wardrop’s traffic model Symmetric games Potential function Lower bounds . Problem definition . The problem we are going to deal with has the following properties: The game is a selfish routing game divided into rounds. There is an infinite number of agents each responsible for an infinitesimal amount of traffic. In each round, each agent samples an alternative routing path and compares the latency on this path with its current latency. In the next round all the agents have the opportunity to choose a different path (simultaneusly). . . . . . . Fast Convergence to Wardrop Equilibria by Adaptive Sampling Metho Gouleakis Themistoklis 2 / 20

Introduction and Wardrop’s traffic model Problem definition Exploration replication policy Wardrop’s traffic model Symmetric games Potential function Lower bounds Problem:The latency of some agent may increase! Even worse: the game may get stuck in oscillations (and never reach an equilibrium). Solution: Let the agents sample alternative routes at random and migrate with a probability depending on the observed latency difference. . . . . . . Fast Convergence to Wardrop Equilibria by Adaptive Sampling Metho Gouleakis Themistoklis 3 / 20

Introduction and Wardrop’s traffic model Problem definition Exploration replication policy Wardrop’s traffic model Symmetric games Potential function Lower bounds . Wardrop’s traffic model . We consider a model for selfish routing where an infinite population of agents carries an infinitesimal amount of load each Let E denote a set of resources (edges). Continuous, non-decreasing latency functions e : [0 , 1] → R + . A set of commodities with flow demands or rates r i , i ∈ [ k ] such that ∑ k i =1 r i = 1. For every commodity i ∈ [ k ] let P i ⊆ 2 E denote a set of strategies (paths) available for commodity i. Let P = ∪ i ∈ [ k ] P i and let L = max p ∈ P | p | . An instance is symmetric if k = 1 and asymmetric otherwise. An instance is single-resource if for all p ∈ P , | p | = 1. . . . . . . Fast Convergence to Wardrop Equilibria by Adaptive Sampling Metho Gouleakis Themistoklis 4 / 20

Introduction and Wardrop’s traffic model Problem definition Exploration replication policy Wardrop’s traffic model Symmetric games Potential function Lower bounds . Definition: Wardrop equilibrium . . . A feasible flow vector ( f p ) p ∈ P is at a Wardrop equilibrium for the instance Γ if for every commodity i ∈ [ k ] and every p , p ′ ∈ P i with f p > 0 it holds that l p ( f ) ≤ l p ′ ( f ). . . . . . Potential function: ∫ f e Φ( f ) = ∑ 0 l ( x ) dx e ∈ E The set of allocations in equilibrium coincides with the set of allocations minimizing the potential function. Our goal is the design of distributed rerouting policies that approximate the Wardrop equilibrium. . . . . . . Fast Convergence to Wardrop Equilibria by Adaptive Sampling Metho Gouleakis Themistoklis 5 / 20

Introduction and Wardrop’s traffic model Problem definition Exploration replication policy Wardrop’s traffic model Symmetric games Potential function Lower bounds . Shifted potential . Observe, however, for certain instances of the routing game, Φ ∗ might be zero. In this case, we suggest to shift the potential by some positive additive term. So, we get an α -shifted potential. Φ ∗ + α is strictly positive. This is equivalent to adding a virtual amount of to the latency observed on every path. . . . . . . Fast Convergence to Wardrop Equilibria by Adaptive Sampling Metho Gouleakis Themistoklis 6 / 20

Introduction and Wardrop’s traffic model Problem definition Exploration replication policy Wardrop’s traffic model Symmetric games Potential function Lower bounds . Definition: Relative slope . . . A differentiable latency function l has relative slope d at x if l ′ ( x ) ≤ d · l ( x ) x . A latency function has relative slope d if it has relative slope d over the entire range [0 , 1] and a class of latency functions L has relative slope d if every l ∈ L has relative slope d . . . . . . Related to the derivative of xl ( x ). Examples: polynomials and exponentials. . . . . . . Fast Convergence to Wardrop Equilibria by Adaptive Sampling Metho Gouleakis Themistoklis 7 / 20

Introduction and Wardrop’s traffic model Exploration replication policy Formal definition Symmetric games Convergence Lower bounds . Rerouting policy . In every round, an agent is activated with constant probability λ = 1/32. Then he performs the following two steps: . . . Sampling: With probability (1 − β ) perform step 1(a) and with 1 probability β perform step 1(b). (a) Proportional sampling: Sample path Q ∈ P i with probability f Q r i . (b) Uniform sampling: Sample path Q ∈ P i with probability 1 | P i | . . . . Migration: If l Q < l P , migrate to path Q with probabillity 2 l p − l Q d ( l p + α ) The parameter β must be chosen subject to the constraint min p ∈ P l p (0) + α β ≤ (1) L ∗ max e ∈ E max x ∈ [0 ,β ] l ′ e ( x ) . . . . . . Fast Convergence to Wardrop Equilibria by Adaptive Sampling Metho Gouleakis Themistoklis 8 / 20

Introduction and Wardrop’s traffic model Exploration replication policy Formal definition Symmetric games Convergence Lower bounds . Definition: Exploration - replication policy . . . For an instance Γ let d ≥ 1 be an upper bound on the relative slope of the latency functions and let β be chosen as in Equation (1). For every commodity i ∈ [ k ] and every path P , Q ∈ P i with l Q ≤ l P , the ( α, β )-exploration-replication policy migrates a fraction of ) l P − l Q ( µ PQ = λ · 1 (1 − β ) · f Q 1 + β · d r i |P i | l P + α with λ = 1 32 agents from path P to path Q . . . . . . . . . . . . Fast Convergence to Wardrop Equilibria by Adaptive Sampling Metho Gouleakis Themistoklis 9 / 20

Introduction and Wardrop’s traffic model Exploration replication policy Formal definition Symmetric games Convergence Lower bounds . Fact . . . Let Γ be an instance of the congestion game and let Γ + α be an instance that we obtain from Γ by inserting a new resource e P for every P ∈ P with constant latency function l e P ( x ) = α . Let Φ and Φ + α denote the respective potential functions. . . . 1 The ( α, β )-exploration-replication policy behaves on Gamma precisely as the (0 , β )-exploration-replication policy does on Γ+ α . . . . 2 If Φ + α ( f ) ≤ (1 + ǫ )(Φ + α ), then Φ( f ) ≤ (1 + ǫ )Φ + ǫα . . . . . . . . . . . . Fast Convergence to Wardrop Equilibria by Adaptive Sampling Metho Gouleakis Themistoklis 10 / 20

Introduction and Wardrop’s traffic model Exploration replication policy Formal definition Symmetric games Convergence Lower bounds . Definition . . . For two flow vectors f and f’ of consecutive rounds, the virtual potential gain is the potential gain that would occur if the latencies were fixed at the beginning of the round, i. e. ∑ V ( f , f ′ ) = l e ( f )( f ′ e = f e ) e ∈ E By our policy, this value is always negative. . . . . . . Lemma . . . Consider an instance Γ and the ( α, β )-exploration-replication policy changing the flow vector from f to f ′ in one step. Then we have P , Q ∈P µ PQ ( l Q − l P ) = V ( f , f ′ ) ∆Φ = Φ( f ′ ) − Φ( f ) ≥ 1 ∑ . . 2 2 . . . . . . . . . . Fast Convergence to Wardrop Equilibria by Adaptive Sampling Metho Gouleakis Themistoklis 11 / 20

Introduction and Wardrop’s traffic model Bicriteria approximation Exploration replication policy Approximation of the potential Symmetric games Asymmetric games Lower bounds . Definition: δ − ǫ equilibrium . . . For a flow vector f let P + ( δ ) = { P ∈ P| l P ( f ) ≥ (1 + δ ) l ( f ) } denote the set of δ -expensive strategies and let P ( δ ) = { P ∈ P| l P ( f ) ≤ (1 − δ ) l ( f ) } denote the set of δ -cheap strategies. The population f is in a δ − ǫ -equilibrium iff at most ǫ agents utilize δ -expensive and δ -cheap strategies. We write P + and P − if δ is clear from the context. . . . . . . . . . . . Fast Convergence to Wardrop Equilibria by Adaptive Sampling Metho Gouleakis Themistoklis 12 / 20

Introduction and Wardrop’s traffic model Bicriteria approximation Exploration replication policy Approximation of the potential Symmetric games Asymmetric games Lower bounds . Theorem . . . Consider a symmetric congestion game Γ and an initial flow vector f init . For the ( α, β ) -exploration-replication policy, the number of rounds in which the population vector is not δ − ǫ -equilibrium w.r.t Γ + α (as defined in Fact 3) is bounded from above by: ( d ( Φ( f init ) + α )) O ǫδ 2 log Φ ∗ + α . . . . . . . . . . . Fast Convergence to Wardrop Equilibria by Adaptive Sampling Metho Gouleakis Themistoklis 13 / 20

Fast Convergence to Wardrop Equilibria by Adaptive Sampling Methods - PowerPoint PPT Presentation

Introduction and Wardrops traffic model Exploration replication policy Symmetric games Lower bounds . . Fast Convergence to Wardrop Equilibria by Adaptive Sampling Methods . . . . . Gouleakis Themistoklis June 2, 2011 . . . . .

Wardrop Equilibria and Price of Stability in Bottleneck Games With Splittable Traffic Vladimir

Wardrop s equilibrium and adjustment of OD matrices Farhad Hatami Universitat Autnoma de

Chemistry 2000 Slide Set 19b: Organic acids Acid dissociation equilibria Marc R. Roussel March

Sustainable Equilibria I Myerson (1996) argued informally for a new refinement concept that he

Neural Nets for Adaptive Filter and Adaptive Neural Nets as Adaptive Filters Pattern Recognition

Adaptive Control Chapter 1: Introduction to Adaptive Control Adaptive Control Landau, Lozano,

Adaptive Control Chapter 11: Direct Adaptive Control 1 Adaptive Control Landau, Lozano,

Adaptive Control Chapter 12: Indirect Adaptive Control 1 Adaptive Control Landau, Lozano,

Equilibria in large one-arm bandit games A. Salomon Universit e Paris 13 HEC Paris November

From Cliques to Equilibria: From Cliques to Equilibria: The Dominant- -Set Framework for

Fast Bayesian automatic Fast Bayesian automatic adaptive quadrature adaptive quadrature Gh.

Being a METS Startup Fast Failure; Fast Reward November 2016 Fast Failure; Fast Reward

Women en in in Agr gricu icultu lture: e: my my journey and our future Caroline Wardrop

Adaptive Control Chapter 13: Multimodel adaptive control with switching Chapter 13: Multimodel

Adaptive Control Chapter 14: Adaptive regulation Rejection of unknown disturbances 1

Convergence of Adaptive and Interacting MCMC algorithms Gersende FORT LTCI / CNRS - TELECOM

Incremental Maintenance for Non-Distributive Aggregate Functions Themis Palpanas (U of Toronto)

TASM: Top- k Approximate Subtree Matching Nikolaus Augsten 1 Denilson Barbosa 2 ohlen 3 Themis

Riding the Big IoT Data Wave Complex Analytics for IoT Data Series Themis Palpanas Paris

MarsSIpulami V3.0 E-Mars Web-GIS data server Sommaire 1. Context 2. Design architecture 3.

On the origin of plasma sheet evolution during the substorm growth phase Evgeny Gordeev (SPSU)

WORKING IN THE FIELD The use of technology in conservation Dr Liz Greengrass, Head of

What are IoTs? 2 More Than The Sum Of Its Things SEC 20 What are IoT Middlewares? 3 More

The Carers (Scotland)Act 2016 (implemented 1 April 2018) Eileen McShane Senior Officer