Algorithms for finding Nash Equilibria Ethan Kim School of Computer - PowerPoint PPT Presentation

Introduction Simplifications Setting up polytopes Lemke-Howson Algorithm Lifting simplifications Conclusions Algorithms for finding Nash Equilibria Ethan Kim School of Computer Science McGill University Algorithms for finding Nash Equilibria

Introduction Simplifications Setting up polytopes Lemke-Howson Algorithm Lifting simplifications Conclusions Outline 1 Definition of bimatrix games 2 Simplifications 3 Setting up polytopes 4 Lemke-Howson algorithm 5 Lifting simplifications Algorithms for finding Nash Equilibria

Introduction Simplifications Setting up polytopes Lemke-Howson Algorithm Lifting simplifications Conclusions Bimatrix Games • Given a bimatrix game ( A , B ) with m × n payoff matrices A and B , a mixed strategy for player 1 is a vector x ∈ R m with nonnegative components that sum to 1. For player 2, a mixed strategy is a vector y ∈ R n . • The support of a mixed strategy is the set of pure strategies that have positive probability. A best response to y is a mixed strategy x that maximizes the expected payof x T Ay , and vice versa. A Nash equilibrium is a pair of mutual best responses. Algorithms for finding Nash Equilibria

Introduction Simplifications Setting up polytopes Lemke-Howson Algorithm Lifting simplifications Conclusions Best Response Condition Lemma A mixed strategy x is a best response to a mixed strategy y if and only if all pure strategies in its support are pure best responses to y (And vice versa). Proof. Let ( Ay ) i be the i th component of Ay , which is the expected payoff to player 1 when playing row i . Let u = max i ( Ay ) i . Then, x T Ay = � � � x i ( Ay ) i = x i ( u − ( u − ( Ay ) i )) = u − x i ( u − ( Ay ) i ) . i i i Since the sum � i x i ( u − ( Ay ) i ) is nonnegative (for x i ≥ 0, u − ( Ay ) i ≥ 0), x T Ay ≤ u . The expected payoff x T Ay achieves the maximum u iff that sum is 0. So if x i > 0, then ( Ay ) i = u . Algorithms for finding Nash Equilibria

Introduction Simplifications Setting up polytopes Lemke-Howson Algorithm Lifting simplifications Conclusions Some simplifications.. • Symmetry assumption: We first assume that the game is symmetric . So the payoff matrix C is an n × n matrix C = A = B T . Algorithms for finding Nash Equilibria

Introduction Simplifications Setting up polytopes Lemke-Howson Algorithm Lifting simplifications Conclusions Some simplifications.. • Symmetry assumption: We first assume that the game is symmetric . So the payoff matrix C is an n × n matrix C = A = B T . • Nondegeneracy assumption: A bimatrix game is nondegenerate if the # of pure best responses to any mixed strategy never exceeds the size of its support. → the submatrices induced by the supports are full-rank. Algorithms for finding Nash Equilibria

Introduction Simplifications Setting up polytopes Lemke-Howson Algorithm Lifting simplifications Conclusions Some simplifications.. • Symmetry assumption: We first assume that the game is symmetric . So the payoff matrix C is an n × n matrix C = A = B T . • Nondegeneracy assumption: A bimatrix game is nondegenerate if the # of pure best responses to any mixed strategy never exceeds the size of its support. → the submatrices induced by the supports are full-rank. • So in a symmetric, nondegenerate game, a NE has support size equal to the # of pure best responses. Algorithms for finding Nash Equilibria

Introduction Simplifications Setting up polytopes Lemke-Howson Algorithm Lifting simplifications Conclusions An Example of Symmetric Games Consider the payoff matrices:  0 3 0   = A = B T C = 0 0 3  2 2 2 Algorithms for finding Nash Equilibria

Introduction Simplifications Setting up polytopes Lemke-Howson Algorithm Lifting simplifications Conclusions Best Response Condition gives a polyhedron.. • By the Best Response Condition, an equilibrium is given if any pure strategy is either a best response (to a mixed strategy) or is played with probability 0. Algorithms for finding Nash Equilibria

Introduction Simplifications Setting up polytopes Lemke-Howson Algorithm Lifting simplifications Conclusions Best Response Condition gives a polyhedron.. • By the Best Response Condition, an equilibrium is given if any pure strategy is either a best response (to a mixed strategy) or is played with probability 0. • This can be captured by polytopes whose facets represent pure strategies, either as best responses, or having probability zero. Algorithms for finding Nash Equilibria

Introduction Simplifications Setting up polytopes Lemke-Howson Algorithm Lifting simplifications Conclusions Best Response Polyhedron • Define the maximum expected payoff for a strategy x k for k ∈ N as: u = max { ( Ay ) k | k ∈ N } • A best response polyhedron of a player is the set of the player’s mixed strategies with the upper envelop of expected payoffs to the opponent . • E.g. For player 2, it is ( y 4 , y 5 , y 6 , u ) that fulfill the following: 0 y 4 + 3 y 5 + 0 y 6 ≤ u 0 y 4 + 0 y 5 + 3 y 6 ≤ u 2 y 4 + 2 y 5 + 2 y 6 ≤ u y 4 , y 5 , y 6 ≥ 0 y 4 + y 5 + y 6 = 1 Algorithms for finding Nash Equilibria

Introduction Simplifications Setting up polytopes Lemke-Howson Algorithm Lifting simplifications Conclusions Best Response Polyhedron In general, the set of mixed strategies are represented by the polyhedron: P = { ( x , u ) ∈ R N × R| x ≥ 0 , 1 T x = 1 , C T x ≤ 1 u } We can simplify this polyhedron, first by assuming: • C is nonnegative and has no zero column. • (We can do this by adding a constant to C ) Then, we will elimiate the payoff variable u . Algorithms for finding Nash Equilibria

Introduction Simplifications Setting up polytopes Lemke-Howson Algorithm Lifting simplifications Conclusions From P to P .. • For P , we divide each inequaility � i ∈ N c ij x i ≤ u by u , which gives � i ∈ N c ij ( x i / u ) ≤ 1. • Treat each z i = x i / u as new variable, and call the resulting polyhedron P . We then have: P = { z ∈ R N | z ≥ 0 , C T z ≤ 1 } . • In effect: (1) the expected payoffs u are normalized to 1, and (2) the conditions 1 T x = 1 are dropped. • Non-zero vectors z ∈ P are converted back to probability 1 vectors by multiplying u = i z i , and this scaling factor u is � the expected payoff to the opponent. Algorithms for finding Nash Equilibria

Introduction Simplifications Setting up polytopes Lemke-Howson Algorithm Lifting simplifications Conclusions From P to P .. • The set P is in 1-1 correspondence with P − { 0 } with the map ( x , u ) �→ x · (1 / u ). (“projective transformations”) • Since binding inequality in P corresponds to a binding inequality in P , the transformation preserves face incidences. Algorithms for finding Nash Equilibria

Introduction Simplifications Setting up polytopes Lemke-Howson Algorithm Lifting simplifications Conclusions Best Response Polytope • Because C is nonnegative & has no zero column, P is a bounded, fully dimensional polytope. • Because of nondegeneracy assumption, P is simple , i.e. every vertex lies on exactly N facets of the polytope. • A facet is obtained by making one of the inequalities binding , i.e. converting it to an equality. Algorithms for finding Nash Equilibria

Introduction Simplifications Setting up polytopes Lemke-Howson Algorithm Lifting simplifications Conclusions Best Response Polytope We say a strategy i is represented at a vertex z , if either z i = 0, or C i z = 1, or both (i.e. At least one of the two inequalities for strategy i is tight at z .). Then: Theorem If a vertex z represents all strategies, then either z = 0 , or the corresponding ( x , x ) is a symmetric Nash. Proof. Assume z � = 0 . Then, the corresponding x = u · z is well defined, and x i ’s are nonnegative numbers adding to 1. To see ( x , x ) is a Nash, observe that x satisfies the Best Response Condition: for every positive x i ’s, C i z = 1. Thus, every support is a best response. Algorithms for finding Nash Equilibria

Introduction Simplifications Setting up polytopes Lemke-Howson Algorithm Lifting simplifications Conclusions Lemke-Howson Algorithm • Finds a vertex z � = 0 , where every strategy is represented. Algorithms for finding Nash Equilibria

Introduction Simplifications Setting up polytopes Lemke-Howson Algorithm Lifting simplifications Conclusions Lemke-Howson Algorithm • Finds a vertex z � = 0 , where every strategy is represented. • First, we label each facet of P by the strategy it represents: note that there are two facets (one for ( Cz ) i = 1 and the other for z i = 0). Algorithms for finding Nash Equilibria

Introduction Simplifications Setting up polytopes Lemke-Howson Algorithm Lifting simplifications Conclusions Lemke-Howson Algorithm • Finds a vertex z � = 0 , where every strategy is represented. • First, we label each facet of P by the strategy it represents: note that there are two facets (one for ( Cz ) i = 1 and the other for z i = 0). • Then, label each vertex by the labels of adjacent facets. Algorithms for finding Nash Equilibria

Introduction Simplifications Setting up polytopes Lemke-Howson Algorithm Lifting simplifications Conclusions Lemke-Howson Algorithm • Due to nondegeneracy, each vertex has precisely N adjacent facets, i.e. representing strategies. Algorithms for finding Nash Equilibria

Algorithms for finding Nash Equilibria Ethan Kim School of Computer - PowerPoint PPT Presentation

Introduction Simplifications Setting up polytopes Lemke-Howson Algorithm Lifting simplifications Conclusions Algorithms for finding Nash Equilibria Ethan Kim School of Computer Science McGill University Algorithms for finding Nash

Finding Nash Equilibria in Certain Classes of 2-Player Game Adrian Vetta McGill University

Nash demand game Julio D avila 2009 Julio D avila Nash demand game Nash demand game

Strategic Games: Social Optima and Nash Equilibria Krzysztof R. Apt CWI & University of

Strategic Games: Social Optima and Nash Equilibria Krzysztof R. Apt CWI & University of

Sustainable Equilibria I Myerson (1996) argued informally for a new refinement concept that he

Computing Pure Nash Equilibria in Symmetric Action Graph Games Albert Xin Jiang Kevin

Uniqueness of Nash Equilibria in Atomic Splittable Congestion Games Veerle Timmermans Tobias

Oblivious AQM and Nash Equilibria Dutta, Goal and Heidmann In Proceedings of the IEEE Infocom,

Chemistry 2000 Slide Set 19b: Organic acids Acid dissociation equilibria Marc R. Roussel March

Finding Nash Equilibria in Dueling Games Dehghani, Gholami, Seddighin University of Maryland

The Weihrauch Degree of Finding Nash Equilibria in Multiplayer Games Tonicha Crook Joint work

COMPUTING A NASH EQUILIBRIUM Who cares?? If centralized, specially designed algorithms cannot

Concepts Game Theory MohammadAmin Fazli Algorithmic Game Theory 1 TOC Computing the Nash

Algorithmic Game Theory CoReLab (NTUA) Lecture 3: Tractability of Nash Equilibria PPAD

Two-Timescale Algorithms for Learning Nash Equilibria in General-Sum Stochastic Games H.L. Prasad

Harvard Medical School Editor in Chief Hepatology Up to Date Def Definition of of NASH NASH

Introduction to game theory Introduction to game theory Jie Gao Computer Science Department

Strategic Games. Famous because? Proving Nash. C D N players. n players. C (3,3) (0,5) Each

The Complexity of Pure Nash Equilibria Alex Fabrikant Christos Papadimitriou Kunal Talwar CS

Mini-course on Epistemic Game Theory Lecture 2: Nash Equilibrium Andrs Perea EPICENTER &

3.4. Bertrand Model Matilde Machado 3.4. Bertrand Model In Cournot, firms decide how

Game theory (Ch. 17.5) Find best strategy How do we formally find a Nash equilibrium? If it is

Multiagent Resource Allocation with Sharable Items: Simple Protocols and Nash Equilibria

Biased-Belief Equilibrium Yuval Heller (Bar Ilan) and Eyal Winter (Hebrew University) Bar Ilan,

Algorithms for finding Nash Equilibria Ethan Kim School of Computer - PowerPoint PPT Presentation

Introduction Simplifications Setting up polytopes Lemke-Howson Algorithm Lifting simplifications Conclusions Algorithms for finding Nash Equilibria Ethan Kim School of Computer Science McGill University Algorithms for finding Nash

Finding Nash Equilibria in Certain Classes of 2-Player Game Adrian Vetta McGill University

Nash demand game Julio D avila 2009 Julio D avila Nash demand game Nash demand game

Strategic Games: Social Optima and Nash Equilibria Krzysztof R. Apt CWI &amp; University of

Strategic Games: Social Optima and Nash Equilibria Krzysztof R. Apt CWI &amp; University of

Sustainable Equilibria I Myerson (1996) argued informally for a new refinement concept that he

Computing Pure Nash Equilibria in Symmetric Action Graph Games Albert Xin Jiang Kevin

Uniqueness of Nash Equilibria in Atomic Splittable Congestion Games Veerle Timmermans Tobias

Oblivious AQM and Nash Equilibria Dutta, Goal and Heidmann In Proceedings of the IEEE Infocom,

Chemistry 2000 Slide Set 19b: Organic acids Acid dissociation equilibria Marc R. Roussel March

Finding Nash Equilibria in Dueling Games Dehghani, Gholami, Seddighin University of Maryland

The Weihrauch Degree of Finding Nash Equilibria in Multiplayer Games Tonicha Crook Joint work

COMPUTING A NASH EQUILIBRIUM Who cares?? If centralized, specially designed algorithms cannot

Concepts Game Theory MohammadAmin Fazli Algorithmic Game Theory 1 TOC Computing the Nash

Algorithmic Game Theory CoReLab (NTUA) Lecture 3: Tractability of Nash Equilibria PPAD

Two-Timescale Algorithms for Learning Nash Equilibria in General-Sum Stochastic Games H.L. Prasad

Harvard Medical School Editor in Chief Hepatology Up to Date Def Definition of of NASH NASH

Introduction to game theory Introduction to game theory Jie Gao Computer Science Department

Strategic Games. Famous because? Proving Nash. C D N players. n players. C (3,3) (0,5) Each

The Complexity of Pure Nash Equilibria Alex Fabrikant Christos Papadimitriou Kunal Talwar CS

Mini-course on Epistemic Game Theory Lecture 2: Nash Equilibrium Andrs Perea EPICENTER &amp;

3.4. Bertrand Model Matilde Machado 3.4. Bertrand Model In Cournot, firms decide how

Game theory (Ch. 17.5) Find best strategy How do we formally find a Nash equilibrium? If it is

Multiagent Resource Allocation with Sharable Items: Simple Protocols and Nash Equilibria

Biased-Belief Equilibrium Yuval Heller (Bar Ilan) and Eyal Winter (Hebrew University) Bar Ilan,

Strategic Games: Social Optima and Nash Equilibria Krzysztof R. Apt CWI & University of

Strategic Games: Social Optima and Nash Equilibria Krzysztof R. Apt CWI & University of

Mini-course on Epistemic Game Theory Lecture 2: Nash Equilibrium Andrs Perea EPICENTER &