Emergent systems Spring-13 The Prisoners Dilemma - PDF document

Umeå University Department of Computing Science Emergent systems Spring-13 The Prisoners ´ Dilemma http://www.cs.umu.se/kurser/5DV017 Last time ❒ Evolutionary computation ❍ Overview ❒ Genetic programming ❒ Aspects of evolution ❒ Classifier systems ❒ General on cooperation ❒ Game theory 21/2 - 13 Emergent Systems, Jonny Pettersson, UmU Today ❒ Prisoners’ Dilemma and other dilemmas ❍ Iterated Prisoners’ Dilemma ❍ Ecological models ❍ Spatial models ❒ Short conclusion of the course 21/2 - 13 Emergent Systems, Jonny Pettersson, UmU 1

The Prisoners’ Dilemma ❒ Melvin Dresher and Merrill Flood, RAND Corporation, 1950 ❒ Further developed by Albert W. Tucker ❒ Used in philosophy, ethics, biology, sociology, political science, economics, game theory, computer science, mathematics, ... ❒ ” The E. Coli of social psychology ” - Axelrod 21/2 - 13 Emergent Systems, Jonny Pettersson, UmU The Prisoners’ Dilemma ❒ The story ❍ Two criminals were caught, they have committed a crime but the police have insufficient evidence ❍ They can not communicate with each other ❍ If both plead guilty, each get 10 years ❍ If one plead guilty and accuses the other • The one that plead guilty goes free • The accused get 20 years ❍ If no one plead guilty, each get 1 year 21/2 - 13 Emergent Systems, Jonny Pettersson, UmU The Prisoners’ Dilemma ❒ Payoff matrix Bob Cooperate Defect Cooperate -1, -1 -20, 0 Alice Defect 0, -20 -10, -10 ❒ Alice rational analysis (dominant strategy) ❍ If Bob cooperate – Defect ❍ If Bob defect – Defect ❍ (Same analysis for Bob) ❒ Dominant strategy equilibrium � 10 year each ❒ Make an irrational decision (coop) � 1 year each 21/2 - 13 Emergent Systems, Jonny Pettersson, UmU 2

The Prisoners’ Dilemma ❒ Summary ❍ Individual rationality is not optimal ❍ An example of a dilemma in game theory 21/2 - 13 Emergent Systems, Jonny Pettersson, UmU Dilemmas in game theory ❒ ”A situation requiring a choice between alternatives that are equivalent” ❒ ” Damned if you do, damned if you don’t ” ❒ In game theory: Each player acts rationally, but the result is not desirable 21/2 - 13 Emergent Systems, Jonny Pettersson, UmU Dilemmas in game theory ❒ General payoff matrix Bob Cooperate Defect CC (R) CD (S) Cooperate Reward Sucker’s payoff Alice DC (T) DD (P) Defect Temptation to Punishment defect 21/2 - 13 Emergent Systems, Jonny Pettersson, UmU 3

Bob Dilemmas Coop Def Coop CC CD Alice Def DC DD ❒ General terms of a dilemma ❍ You will always win if the other cooperates • CC > CD and DC > DD ❍ Sometimes you win by defecting • DC > CC or DD > CD ❍ Mutual cooperation is preferable • CC > DD ❒ 24 permutations but only 3 are dilemmas 21/2 - 13 Emergent Systems, Jonny Pettersson, UmU Bob Dilemmas Coop Def Coop CC CD Alice Def DC DD ❒ Prisoners’ Dilemma ❍ DC > CC > DD > CD ❍ Better to defect no matter what the other do ❍ Nash equilibrium is DD ❒ Chicken ❍ DC > CC > CD > DD ❍ Mutual defect is worst ❍ Two Nash equilibrium, DC and CD ❒ Stag Hunt ❍ CC > DC > DD > CD ❍ Best to cooperate with cooperators ❍ Two Nash equilibrium, CC and DD 21/2 - 13 Emergent Systems, Jonny Pettersson, UmU Iterated Prisoners’ Dilemma ❒ Assumptions ❍ No agreements or threats ❍ A player's next move can not be predicted ❍ No way to eliminate players or avoid interaction ❍ No way of changing the payoff ❍ Communication only via direct interaction 21/2 - 13 Emergent Systems, Jonny Pettersson, UmU 4

Iterated Prisoners’ Dilemma ❒ Axelrod’s experiment (1980) ❍ Intuitive assumption that future interactions may affect the rationality of the decision ❍ Round-robin tournament for strategies • All are competing against everyone, including itself • 200 iterations of PD – Should be unknown! • Each program/strategy can remember earlier actions • 14 program were received • CC = 3, CD = 0, DC = 5, DD = 1 Bob • (Requirement : DC + CD < 2 * CC) Coop Def Coop 3, 3 0, 5 Alice Def 5, 0 1, 1 21/2 - 13 Emergent Systems, Jonny Pettersson, UmU Iterated Prisoners’ Dilemma ❒ Expected payoff for three simple strategies ALL-C RAND ALL-D Medel ALL-C 3.0 1.5 0.0 1.5 RAND 4.0 2.0 0.5 2.167 ALL-D 5.0 3.0 1.0 3.0 21/2 - 13 Emergent Systems, Jonny Pettersson, UmU Iterated Prisoners’ Dilemma ❒ Result ❍ Winner: Rapoport’s Tit-For-Tat (TFT) ❍ Cooperate at first interaction ❍ Then, do what your opponent did the previous move ❒ Second experiment ❍ 62 program ❍ Everyone knew that TFT won last time ❍ TFT won again ❒ Tit-for-Two-Tats ❍ More forgiving than TFT ❍ TFT is better in a noisy environment ❍ Would have won the first experiment, but performed poorly in the second 21/2 - 13 Emergent Systems, Jonny Pettersson, UmU 5

Iterated Prisoners’ Dilemma ❒ Characteristics for a successful strategy ❍ Don’t be envious ❍ Be nice ❍ Reciprocate ❍ Don’t be too clever ❍ Be a generalist ❍ Agree with yourself ❍ Be an evolutionarily stable strategy • John Maynard Smith • Resistant against invasion by other strategies 21/2 - 13 Emergent Systems, Jonny Pettersson, UmU Ecological models ❒ What happens if we allow more successful strategies to spread in the population at the expense of less successful? ❒ Success is measured as a percentage of population ❒ Percentage of population = The probability that a random program uses this strategy ❒ On the board ❒ Simulation ❍ Fig 17.3a 21/2 - 13 Emergent Systems, Jonny Pettersson, UmU Pavlov – new strategy ❒ Pavlov (PAV) ❍ ”Win-Stay, Lose-Shift” ❍ Start with cooperation ❍ If the other cooperate, continue with the current behavior ❍ If the other defect, switch behavior ❍ Fig 17.3b 21/2 - 13 Emergent Systems, Jonny Pettersson, UmU 6

Noise ❒ An action is misinterpreted for some reason ❒ TFT ❍ Leads to a cycle of alternating CD and DC ❍ Broken only by a further error ❒ PAV ❍ Can self-correct itself : DC � DD � CC ❍ Can exploit ALL-C in an environment where errors can occur ❍ Fig 17.3c 21/2 - 13 Emergent Systems, Jonny Pettersson, UmU IPD and GA ❒ Let the population evolve using genetic algorithms ❒ Simulation facts : ❍ N = 100 ❍ 50 bouts/individual/generation with each bout consisting of 20 PD rounds ❍ Random strategies initially ❍ Coding: Strings with five letters, each C or D First CC CD DC DD 21/2 - 13 Emergent Systems, Jonny Pettersson, UmU IPD and GA (cont.) ❒ Result 21/2 - 13 Emergent Systems, Jonny Pettersson, UmU 7

Stimulating cooperation in PD ❒ Know something about the other ❍ Iterative ❍ Rumor ❒ Spatial ❍ Interact with neighbors ❍ Example • MANET • Society 21/2 - 13 Emergent Systems, Jonny Pettersson, UmU 2 x 2 game ❒ 2 players, 2 strategies ❒ Payoff matrix Coop Def Coop R = 1 S Def T P = 0 ❒ 12 permutations of R, S, T, P ❍ Fig 1 in the article by Hauert 21/2 - 13 Emergent Systems, Jonny Pettersson, UmU 2 x 2 game Well mixed populations ❒ Fig 2 in the article by Hauert ❒ Defection: S < 0, T > 1 a) • Stable fix point = 0 (proportion of cooperators) • The area for Prisoners’ Dilemma b) Co-existence: S > 0, T > 1 • Stable fix point = S / (S + T – 1) Bi-stability: S < 0, T < 1 c) • Instable fix point = S / (S + T – 1) • Stable fix points = 0, 1 d) Cooperation: S > 0, T < 1 • Stable fix point = 1 21/2 - 13 Emergent Systems, Jonny Pettersson, UmU 8

Spatial Prisoners’ Dilemma ❒ The overall payoff determines an individual's success and is determined by interaction with neighbors ❒ Each individual has a place in a grid ❒ In each generation N individuals are selected ❒ N = number of locations in the grid ❒ Every individual play against its neighbors and then each individual get the opportunity to update its strategy 21/2 - 13 Emergent Systems, Jonny Pettersson, UmU Spatial PD - Variations ❒ Updating of the grid ❍ Synchronous ❍ Asynchronous ❍ Delayed asynchronous ❒ Updating of individuals ❍ Best takes over (deterministic) ❍ Proportional updating ❒ Number of neighbors ❍ von Neumann – 4 ❍ Moore – 8 ❒ Initial frequency of cooperators ❍ 0.2, 0.5 or 0.8 ❒ Simulation 21/2 - 13 Emergent Systems, Jonny Pettersson, UmU Spatial PD - Discussion ❒ Hauert (2001) ❍ Spatial extensions has an effect ❍ Differences in initial frequency of cooperators will not affect ❍ Increased stochasticity in updating the grid has surprisingly little impact ❍ Increased stochasticity in updating of individuals decrease cooperation 21/2 - 13 Emergent Systems, Jonny Pettersson, UmU 9

Emergent systems Spring-13 The Prisoners Dilemma - PDF document

Ume University Department of Computing Science Emergent systems Spring-13 The Prisoners Dilemma http://www.cs.umu.se/kurser/5DV017 Last time Evolutionary computation Overview Genetic programming Aspects of evolution

Emergent behaviour in virtual agents Emergent behaviour in virtual agents Colin Chibaya Colin

Emergent Distributed Bio-Organization: A Framework for Achieving Emergent Properties in

Emergent Invasive Plant Program A CNPS Chapter model for early detection and effective response to

Emergent Trilateralism in Developing Asia Emergent Trilateralism in Developing Asia Long Term

Emergent Trilateralism in Developing Asia Emergent Trilateralism in Developing Asia Long Term

1 Fractals Coined by Benoit Mandelbrot To differentiate from pure geometric figures

functional and QoS interoperability at the middleware layer Emergent mobile systems in the IoT

On Emergent Misbehavior John Rushby With help from Hermann Kopetz Computer Science Laboratory

Emergent a syst em C-kur s, 5 pong, HT-04 J onny Pet t ersson j onny@cs.umu.se 2/11 - 04

RICE OPTICS Land, Water, and Markets in Emergent Rural-Urban Topographies of Vietnam and Nigeria

Data on Data Breaches: Past, Present and Future Adam Shostack and Chris Walsh Emergent Chaos

Riel Miller, UNESCO, 2012 Riel Miller Head of Foresight Riel Miller, UNESCO, 2012 The Emergent

EMERGENT RESPONDING: GETTING MORE BANG FOR YOUR BUCK WHEN TEACHING VERBAL BEHAVIOR S A R A

through Glo lobal Lib iberal Arts Responsive, Emergent Vision and Effective, Collaborative

GRAVITY AS AN EMERGENT FORCE Erik Verlinde University of Amsterdam Emergence Current Paradigm

The Emergent Church and Last Days Ecumenism By Dr. Andy Woods Adapted from Roger Oakland, Faith

Bookkeeper Rule # perms bo 1 o 2 k 1 k 2 e 1 e 2 pe 3 r = 10! map perm o 1 be 1 o 2 k 1 rk

09 - Introduction to Tensors Data Mining and Matrices Universitt des Saarlandes, Saarbrcken

Lab 5: 16 th April 2012 Exercises on Neural Networks 1. What are the values of weights w 0 , w 1 ,

CSC2542 SAT-Based Planning Sheila McIlraith Department of Computer Science University of

Introduction and Goals of the Meeting Elizabeth Sexton-Kennedy 2nd Meeting of the International

IM2010: Operations Research Nonlinear Programming (Chapter 11) Ling-Chieh Kung Department of

Hidden Lake Vista -Montana Tufa Towers Mono Lake California Sylvan Lake ,Black Hills-South Dakota

Break ak Relax And Enjoy The Music And Lakes es Auto Advanc nce Vermillion Lake