a game in normal form consists of
play

A game in normal form consists of: Set of players = {1, , } - PowerPoint PPT Presentation

T RUTH J USTICE A LGOS Game Theory I: Basic Concepts Teachers: Ariel Procaccia (this time) and Alex Psomas NORMAL-FORM GAME A game in normal form consists of: Set of players = {1, , } Strategy set For each


  1. T RUTH J USTICE A LGOS Game Theory I: Basic Concepts Teachers: Ariel Procaccia (this time) and Alex Psomas

  2. NORMAL-FORM GAME β€’ A game in normal form consists of: β—¦ Set of players 𝑂 = {1, … , π‘œ} β—¦ Strategy set 𝑇 β—¦ For each 𝑗 ∈ 𝑂 , utility function 𝑣 𝑗 : 𝑇 π‘œ β†’ ℝ : if each j ∈ 𝑂 plays the strategy 𝑑 π‘˜ ∈ 𝑇 , the utility of player 𝑗 is 𝑣 𝑗 (𝑑 1 , … , 𝑑 π‘œ )

  3. THE PRISONER’S DILEMMA β€’ Two men are charged with a crime β€’ They are told that: β—¦ If one rats out and the other does not, the rat will be freed, other jailed for nine years β—¦ If both rat out, both will be jailed for six years β€’ They also know that if neither rats out, both will be jailed for one year

  4. THE PRISONER’S DILEMMA Cooperate Defect -1,-1 -9,0 Cooperate Defect 0,-9 -6,-6 What would you do?

  5. ON TV http://youtu.be/S0qjK3TWZE8

  6. THE PROFESSOR’S DILEMMA Class Listen Sleep 10 6 ,10 6 -10,0 Professor Make effort Slack off 0,-10 0,0 Dominant strategies?

  7. NASH EQUILIBRIUM β€’ In a Nash equilibrium, no player wants to unilaterally deviate β€’ E ach player’s strategy is a best response to strategies of others β€’ Formally, a Nash equilibrium is a vector of strategies 𝒕 = 𝑑 1 … , 𝑑 π‘œ ∈ 𝑇 π‘œ such β€² ∈ 𝑇, that for all 𝑗 ∈ 𝑂, 𝑑 𝑗 β€² , 𝑑 𝑗+1 , … , 𝑑 π‘œ ) 𝑣 𝑗 𝒕 β‰₯ 𝑣 𝑗 (𝑑 1 , … , 𝑑 π‘—βˆ’1 , 𝑑 𝑗

  8. THE PROFESSOR’S DILEMMA Class Listen Sleep 10 6 ,10 6 -10,0 Professor Make effort Slack off 0,-10 0,0 Nash equilibria?

  9. ROCK-PAPER-SCISSORS R P S R 0,0 -1,1 1,-1 P 1,-1 0,0 -1,1 S -1,1 1,-1 0,0 Nash equilibria?

  10. MIXED STRATEGIES β€’ A mixed strategy is a probability distribution over (pure) strategies β€’ The mixed strategy of player 𝑗 ∈ 𝑂 is 𝑦 𝑗 , where 𝑦 𝑗 (𝑑 𝑗 ) = Pr[𝑗 plays 𝑑 𝑗 ] β€’ The utility of player 𝑗 ∈ 𝑂 is π‘œ 𝑣 𝑗 𝑦 1 , … , 𝑦 π‘œ = ෍ 𝑣 𝑗 𝑑 1 , … , 𝑑 π‘œ β‹… ΰ·‘ 𝑦 π‘˜ (𝑑 π‘˜ ) (𝑑 1 ,…,𝑑 π‘œ )βˆˆπ‘‡ π‘œ π‘˜=1

  11. EXERCISE: MIXED NE 1 1 β€’ Exercise: player 1 plays 2 , 2 , 0 , player 2 1 1 plays 0, 2 , 2 . What is 𝑣 1 ? 1 1 1 β€’ Exercise: Both players play 3 , 3 , 3 . What is 𝑣 1 ? R P S R 0,0 -1,1 1,-1 P 1,-1 0,0 -1,1 S -1,1 1,-1 0,0

  12. EXERCISE: MIXED NE Poll 1 ? Which is a NE? 1 1 1 1 1 1 1 1 1 1 1. 2 , 2 , 0 , 2 , 2 , 0 3. 3 , 3 , 3 , 3 , 3 , 3 1 1 1 1 1 2 2 1 2. 2 , 2 , 0 , 2 , 0, 4. 3 , 3 , 0 , 3 , 0, 2 3 R P S R 0,0 -1,1 1,-1 P 1,-1 0,0 -1,1 S -1,1 1,-1 0,0

  13. NASH’S THEOREM β€’ Theorem [Nash, 1950]: In any (finite) game there exists at least one (possibly mixed) Nash equilibrium β€’ What about computing a Nash equilibrium? Stay tuned…

  14. DOES NE MAKE SENSE? β€’ Two players, strategies are {2, … , 100} β€’ If both choose the same number, that is what they get β€’ If one chooses 𝑑 , the other 𝑒 , and 𝑑 < 𝑒 , the former player gets 𝑑 + 2 , and the latter gets 𝑑 βˆ’ 2 β€’ Poll 2: What would you choose? 95 96 97 98 99 100

  15. CORRELATED EQUILIBRIUM β€’ Let 𝑂 = {1,2} for simplicity β€’ A mediator chooses a pair of strategies (𝑑 1 , 𝑑 2 ) according to a distribution π‘ž over 𝑇 2 β€’ Reveals 𝑑 1 to player 1 and 𝑑 2 to player 2 β€’ When player 1 gets 𝑑 1 ∈ 𝑇 , he knows the distribution over strategies of 2 is Pr 𝑑 2 𝑑 1 = Pr 𝑑 1 ∧ 𝑑 2 = π‘ž 𝑑 1 , 𝑑 2 Pr 𝑑 1 Pr[𝑑 1 ]

  16. CORRELATED EQUILIBRIUM β€² ∈ 𝑇 β€’ Player 1 is best responding if for all 𝑑 1 β€² , 𝑑 2 ) ෍ Pr 𝑑 2 𝑑 1 𝑣 1 𝑑 1 , 𝑑 2 β‰₯ ෍ Pr 𝑑 2 𝑑 1 𝑣 1 (𝑑 1 𝑑 2 βˆˆπ‘‡ 𝑑 2 βˆˆπ‘‡ β€’ Equivalently, β€² , 𝑑 2 ) ෍ π‘ž 𝑑 1 , 𝑑 2 𝑣 1 𝑑 1 , 𝑑 2 β‰₯ ෍ π‘ž 𝑑 1 , 𝑑 2 𝑣 1 (𝑑 1 𝑑 2 βˆˆπ‘‡ 𝑑 2 βˆˆπ‘‡ β€’ π‘ž is a correlated equilibrium (CE) if both players are best responding β€’ Every Nash equilibrium is a correlated equilibrium, but not vice versa

  17. GAME OF CHICKEN http://youtu.be/u7hZ9jKrwvo

  18. GAME OF CHICKEN β€’ Social welfare is the sum of utilities Dare Chicken β€’ Pure NE: (C,D) and 0,0 4,1 Dare (D,C), social welfare = 5 β€’ Mixed NE: both 1,4 3,3 (1/2,1/2), social Chicken welfare = 4 β€’ Optimal social welfare = 6

  19. GAME OF CHICKEN β€’ Correlated equilibrium: β—¦ (D,D): 0 Dare Chicken 1 β—¦ (D,C): 0,0 4,1 Dare 3 1 β—¦ (C,D): 3 1,4 3,3 Chicken 1 β—¦ (C,C): 3 16 β€’ Social welfare of CE = 3

  20. IMPLEMENTATION OF CE β€’ Instead of a mediator, use a hat! β€’ Balls in hat are labeled with β€œchicken” or β€œdare”, each blindfolded player takes a ball Poll 3 ? Which balls implement the distribution of the previous slide? 1. 1 chicken, 1 dare 3. 2 chicken, 1 dare 2. 1 chicken, 2 dare 4. 2 chicken, 2 dare

  21. CE AS LP β€’ Can compute CE via linear programming in polynomial time! find βˆ€π‘‘ 1 , 𝑑 2 ∈ 𝑇, π‘ž 𝑑 1 , 𝑑 2 β€² ∈ 𝑇, s.t .t. βˆ€π‘‘ 1 , 𝑑 1 β€² , 𝑑 2 ) ෍ π‘ž 𝑑 1 , 𝑑 2 𝑣 1 𝑑 1 , 𝑑 2 β‰₯ ෍ π‘ž 𝑑 1 , 𝑑 2 𝑣 1 (𝑑 1 𝑑 2 βˆˆπ‘‡ 𝑑 2 βˆˆπ‘‡ β€² ∈ 𝑇, βˆ€π‘‘ 2 , 𝑑 2 β€² ) ෍ π‘ž 𝑑 1 , 𝑑 2 𝑣 2 𝑑 1 , 𝑑 2 β‰₯ ෍ π‘ž 𝑑 1 , 𝑑 2 𝑣 2 (𝑑 1 , 𝑑 2 𝑑 1 βˆˆπ‘‡ 𝑑 1 βˆˆπ‘‡ ෍ π‘ž 𝑑 1 , 𝑑 2 = 1 𝑑 1 ,𝑑 2 βˆˆπ‘‡ βˆ€π‘‘ 1 , 𝑑 2 ∈ 𝑇, π‘ž 𝑑 1 , 𝑑 2 ∈ [0,1]

Recommend


More recommend