approximate nash equilibrium computation
play

Approximate Nash Equilibrium Computation Paul W. Goldberg 1 1 - PowerPoint PPT Presentation

Approximate Nash Equilibrium Computation Paul W. Goldberg 1 1 Department of Computer Science University of Oxford, U. K. Invited talk, WINE conference 12th Dec. 2015 Goldberg Approximate Nash Equilibrium Computation The computational challenge


  1. Approximate Nash Equilibrium Computation Paul W. Goldberg 1 1 Department of Computer Science University of Oxford, U. K. Invited talk, WINE conference 12th Dec. 2015 Goldberg Approximate Nash Equilibrium Computation

  2. The computational challenge Input: payoff matrices R , C of bimatrix game G Output: a Nash equilibrium of G Centralised. No “strategic data”. I don’t care about social welfare. Goldberg Approximate Nash Equilibrium Computation

  3. The computational challenge Input: payoff matrices R , C of bimatrix game G Output: a Nash equilibrium of G Centralised. No “strategic data”. I don’t care about social welfare. Two key (annoying?) features PPAD -complete pseudo-polytime (but not polytime) algorithm known for approximate version Goldberg Approximate Nash Equilibrium Computation

  4. The computational challenge Input: payoff matrices R , C of bimatrix game G Output: a Nash equilibrium of G Centralised. No “strategic data”. I don’t care about social welfare. Two key (annoying?) features PPAD -complete pseudo-polytime (but not polytime) algorithm known for approximate version Unusual answers for both exact and approx versions... Coincidence?? Goldberg Approximate Nash Equilibrium Computation

  5. The computational challenge Input: payoff matrices R , C of bimatrix game G Output: a Nash equilibrium of G Centralised. No “strategic data”. I don’t care about social welfare. Two key (annoying?) features PPAD -complete pseudo-polytime (but not polytime) algorithm known for approximate version Unusual answers for both exact and approx versions... Coincidence?? similar results for other classes of games. Goldberg Approximate Nash Equilibrium Computation

  6. PPAD -completeness Goldberg Approximate Nash Equilibrium Computation

  7. PPAD -completeness PPA ... what? — Papadimitriou Goldberg Approximate Nash Equilibrium Computation

  8. PPAD -completeness PPA ... what? — Papadimitriou Pathways to Equilibria: Pretty Pictures And Diagrams (PPAD) — von Stengel Goldberg Approximate Nash Equilibrium Computation

  9. PPAD -completeness PPA ... what? — Papadimitriou Pathways to Equilibria: Pretty Pictures And Diagrams (PPAD) — von Stengel My attempt at a talk title Goldberg Approximate Nash Equilibrium Computation

  10. PPAD -completeness PPA ... what? — Papadimitriou Pathways to Equilibria: Pretty Pictures And Diagrams (PPAD) — von Stengel My attempt at a talk title P rospects for P rogress in A pproximate Goldberg Approximate Nash Equilibrium Computation

  11. PPAD -completeness PPA ... what? — Papadimitriou Pathways to Equilibria: Pretty Pictures And Diagrams (PPAD) — von Stengel My attempt at a talk title P rospects for P rogress in A pproximate D uo.../ D ouble... (can’t think of a good word) Goldberg Approximate Nash Equilibrium Computation

  12. PPAD -completeness PPA ... what? — Papadimitriou Pathways to Equilibria: Pretty Pictures And Diagrams (PPAD) — von Stengel My attempt at a talk title P rospects for P rogress in A pproximate D uo.../ D ouble... (can’t think of a good word) In fact, something like “ P olynomial P arity A rgument on a graph, D irected version” Papadimitriou, C.H.: On the Complexity of the Parity Argument and Other Inefficient Proofs of Existence, J. Comput. Syst. Sci. (1994) Goldberg Approximate Nash Equilibrium Computation

  13. What’s wrong with NP -completeness? NP -complete problems have yes-instances and no-instances... Goldberg Approximate Nash Equilibrium Computation

  14. What’s wrong with NP -completeness? NP -complete problems have yes-instances and no-instances... In searching for a Nash equilibrium, every instance (game) is a yes-instance! Every game has one (Nash 1951), and a suggested NE is easy to check. Goldberg Approximate Nash Equilibrium Computation

  15. What’s wrong with NP -completeness? NP -complete problems have yes-instances and no-instances... In searching for a Nash equilibrium, every instance (game) is a yes-instance! Every game has one (Nash 1951), and a suggested NE is easy to check. reduce from (say) SAT to NASH: what should happen to the no-instances? It’s conceivable some other “NASH is as hard as NP ” proof could exist... Goldberg Approximate Nash Equilibrium Computation

  16. TFNP : total function computation in NP NASH ∈ TFNP : “TF”: every game has an outcome “NP”: a “transparent”, easily-checkable outcome Best-known “hard” TFNP problem: FACTORING — given a number, output its prime factorisation; hardness needed for much crypto Goldberg Approximate Nash Equilibrium Computation

  17. TFNP : total function computation in NP NASH ∈ TFNP : “TF”: every game has an outcome “NP”: a “transparent”, easily-checkable outcome Best-known “hard” TFNP problem: FACTORING — given a number, output its prime factorisation; hardness needed for much crypto Hard TFNP problems: an unhappy family Happy families are all alike; every unhappy family is unhappy in its own way. — Leo Tolstoy For our purposes: NP -complete problems are all alike; every hard TFNP problem is hard in its own way. — don’t quote me Work in progress on this... Goldberg Approximate Nash Equilibrium Computation

  18. PPAD : a happy subfamily of TFNP END OF THE LINE Circuits Succ and Pred; n inputs, n outputs; graph on 2 n vertices with arc from u to v iff Succ( u )= v , Pred( v )= u Given 0 has successor but no predecessor, find another vertex of degree 1. Goldberg Approximate Nash Equilibrium Computation

  19. PPAD : a happy subfamily of TFNP PPA : same sort of thing, but undirected graph As it happens, FACTORING belongs to PPA , related to PPAD ... Emil Jeˇ r´ abek: Integer factoring and modular square roots J. Comput. Syst. Sci. , to appear suggestive —but only suggestive— that PPAD is hard Digression: oracle model of PPAD assumes query access to functions Succ and Pred:2 n → 2 n . Query complexity of search for a solution is poly in the circuit model but not in the oracle model. There are oracle separation results for PPAD and other subclasses of TFNP Goldberg Approximate Nash Equilibrium Computation

  20. From NASH to ǫ -NASH: Bounded rationality fixes irrationality With 3 players, NE may have irrational values (Nash ’51); ....even for 3-player, 2-strategy anonymous games G and Turchetta: Query Complexity of Approximate Equilibria in Anonymous Games, these proceedings and in general, for any k > 2 players, n strategies, algebraic degree of values may be exponential in n ... also for graphical games ǫ -Nash equilibrium No incentive —————– ≤ ǫ incentive to deviate — solution can have values that are multiples of ǫ/ kn ∈ Q . To be meaningful, assume payoffs in some bounded range, usually [0 , 1]. Negative (hardness) results carry over to exact NE (useful for first PPAD -hardness results) Goldberg Approximate Nash Equilibrium Computation

  21. ǫ -NASH versus ǫ - Well-Supported NASH ǫ -NASH: average payoff is worse than best-response by at most ǫ — but player may do much worse, with low probability ǫ -WSNE ( stronger! ): anything a player does with positive probability, pays at most ǫ less than best-response. The support of a probability distribution is the set of events that get non-zero probability — for a mixed strategy, all the pure strategies that may get chosen. i.e. anything in the support of a player’s mixed strategy, is within ǫ of best Goldberg Approximate Nash Equilibrium Computation

  22. A good start: ǫ = 1 2 in poly time Daskalakis, Mehta, and Papadimitriou: A note on approximate Nash equilibria. WINE ’07; TCS 2009 0.2 0.9 0.2 1 0 0.1 0.2 2 0.2 0.1 0.2 0.3 0.4 0.5 0.2 0.2 0.8 0.6 0.7 0.8 1 Player 1 chooses arbitrary strategy i ; gives it probability 1 2 . Goldberg Approximate Nash Equilibrium Computation

  23. A good start: ǫ = 1 2 in poly time Daskalakis, Mehta, and Papadimitriou: A note on approximate Nash equilibria. WINE ’07; TCS 2009 1 0.2 0.9 0.2 1 0 0.1 0.2 2 0.2 0.1 0.2 0.3 0.4 0.5 0.2 0.2 0.8 0.6 0.7 0.8 1 Player 1 chooses arbitrary strategy i ; gives it probability 1 2 . 2 Player 2 chooses best response j ; gives it probability 1. Goldberg Approximate Nash Equilibrium Computation

  24. A good start: ǫ = 1 2 in poly time Daskalakis, Mehta, and Papadimitriou: A note on approximate Nash equilibria. WINE ’07; TCS 2009 1 0.2 0.9 0.2 1 0 0.1 0.2 2 0.2 0.1 0.2 0.3 0.4 0.5 0.2 0.2 0.8 1 0.6 0.7 0.8 2 1 Player 1 chooses arbitrary strategy i ; gives it probability 1 2 . 2 Player 2 chooses best response j ; gives it probability 1. 3 Player 1 finds best response k to j ; gives it probability 1 2 . They also find 5 6 -WSNE in poly-time... Goldberg Approximate Nash Equilibrium Computation

Recommend


More recommend