cumulated effects in learning
play

Cumulated Effects in Learning rik Martin-Dorel Sergei Soloviev - PowerPoint PPT Presentation

Cumulated Effects in Learning rik Martin-Dorel Sergei Soloviev ACADIE team IRIT laboratory Universit de Toulouse 30 March 2018 1 / 18 rik Martin-Dorel (IRIT/UPS) Cumulated Effects in Learning Random Boolean games Cumulative effects


  1. Cumulated Effects in Learning Érik Martin-Dorel Sergei Soloviev ACADIE team IRIT laboratory Université de Toulouse 30 March 2018 1 / 18 Érik Martin-Dorel (IRIT/UPS) Cumulated Effects in Learning

  2. Random Boolean games Cumulative effects in learning Positioning of this talk Starting point: a game theory result, formally verified in Coq Methodology: probabilistic analysis of entire classes of games Objective: discuss implications of this result, possible generalizations and connection with learning 2 / 18 Érik Martin-Dorel (IRIT/UPS) Cumulated Effects in Learning

  3. Random Boolean games Cumulative effects in learning Outline A Coq theory of bool. games with random formulas as payoff functions 1 Discussion on modeling cumulative effects in learning 2 3 / 18 Érik Martin-Dorel (IRIT/UPS) Cumulated Effects in Learning

  4. Random Boolean games Cumulative effects in learning Boolean games A lot of literature (since 2001) 4 / 18 Érik Martin-Dorel (IRIT/UPS) Cumulated Effects in Learning

  5. Random Boolean games Cumulative effects in learning Boolean games A lot of literature (since 2001) Particular case of win/lose games (only 2 outcomes): 4 / 18 Érik Martin-Dorel (IRIT/UPS) Cumulated Effects in Learning

  6. Random Boolean games Cumulative effects in learning Boolean games A lot of literature (since 2001) Particular case of win/lose games (only 2 outcomes): Strategies of players are vectors of bits. In a 2-player setting: Alice controls k bits Bob controls n − k bits 4 / 18 Érik Martin-Dorel (IRIT/UPS) Cumulated Effects in Learning

  7. Random Boolean games Cumulative effects in learning Boolean games A lot of literature (since 2001) Particular case of win/lose games (only 2 outcomes): Strategies of players are vectors of bits. In a 2-player setting: Alice controls k bits Bob controls n − k bits � Game represented by a Boolean function F : 2 k × 2 n − k → 2 � �� � ≃ 2 n 4 / 18 Érik Martin-Dorel (IRIT/UPS) Cumulated Effects in Learning

  8. Random Boolean games Cumulative effects in learning Boolean games A lot of literature (since 2001) Particular case of win/lose games (only 2 outcomes): Strategies of players are vectors of bits. In a 2-player setting: Alice controls k bits Bob controls n − k bits � Game represented by a Boolean function F : 2 k × 2 n − k → 2 � �� � ≃ 2 n Alice wins (with strat. a ) against Bob (with strat. b ) iff F ( a, b ) = 1 4 / 18 Érik Martin-Dorel (IRIT/UPS) Cumulated Effects in Learning

  9. Random Boolean games Cumulative effects in learning Methodology As part of our project FAGames (Formal analysis of games using ITP): 5 / 18 Érik Martin-Dorel (IRIT/UPS) Cumulated Effects in Learning

  10. Random Boolean games Cumulative effects in learning Methodology As part of our project FAGames (Formal analysis of games using ITP): Use probability to explore entire classes of games 5 / 18 Érik Martin-Dorel (IRIT/UPS) Cumulated Effects in Learning

  11. Random Boolean games Cumulative effects in learning Methodology As part of our project FAGames (Formal analysis of games using ITP): Use probability to explore entire classes of games Assume the type of a game is known, but not its parameters in advance Randomly pick a game in the considered class Estimate the probability of various situations (A has a winning strategy, B has a winning strategy, no player has a winning strategy. . . ) 5 / 18 Érik Martin-Dorel (IRIT/UPS) Cumulated Effects in Learning

  12. Random Boolean games Cumulative effects in learning Methodology As part of our project FAGames (Formal analysis of games using ITP): Use probability to explore entire classes of games Assume the type of a game is known, but not its parameters in advance Randomly pick a game in the considered class Estimate the probability of various situations (A has a winning strategy, B has a winning strategy, no player has a winning strategy. . . ) The size of the considered class (number of n -var. Boolean functions) grows very fast ( 2 2 n ) � exhaustive computation does not scale 5 / 18 Érik Martin-Dorel (IRIT/UPS) Cumulated Effects in Learning

  13. Random Boolean games Cumulative effects in learning Methodology As part of our project FAGames (Formal analysis of games using ITP): Use probability to explore entire classes of games Assume the type of a game is known, but not its parameters in advance Randomly pick a game in the considered class Estimate the probability of various situations (A has a winning strategy, B has a winning strategy, no player has a winning strategy. . . ) The size of the considered class (number of n -var. Boolean functions) grows very fast ( 2 2 n ) � exhaustive computation does not scale Derive symbolic results (that can then be numerically evaluated) 5 / 18 Érik Martin-Dorel (IRIT/UPS) Cumulated Effects in Learning

  14. Random Boolean games Cumulative effects in learning Overview of the library Coq library “ RandBoolGames ” Based on SSReflect/MathComp ( fintype , finfun , finset , bigop ) as well as on the infotheo library [Affeldt et al.] 3.1k lines of Coq code – automation: introduce a new tactic “ under ” https://sourcesup.renater.fr/coq-bool-games/ 6 / 18 Érik Martin-Dorel (IRIT/UPS) Cumulated Effects in Learning

  15. Random Boolean games Cumulative effects in learning Probability setting Focus on Ω := 2 2 n and S := P (Ω) = 2 2 2 n 7 / 18 Érik Martin-Dorel (IRIT/UPS) Cumulated Effects in Learning

  16. Random Boolean games Cumulative effects in learning Probability setting Focus on Ω := 2 2 n and S := P (Ω) = 2 2 2 n Which probability distribution? 7 / 18 Érik Martin-Dorel (IRIT/UPS) Cumulated Effects in Learning

  17. Random Boolean games Cumulative effects in learning Probability setting Focus on Ω := 2 2 n and S := P (Ω) = 2 2 2 n Which probability distribution? Prior choice: assign proba. to bool. functions or to bool. formulas? 7 / 18 Érik Martin-Dorel (IRIT/UPS) Cumulated Effects in Learning

  18. Random Boolean games Cumulative effects in learning Probability setting Focus on Ω := 2 2 n and S := P (Ω) = 2 2 2 n Which probability distribution? Prior choice: assign proba. to bool. functions or to bool. formulas? 7 / 18 Érik Martin-Dorel (IRIT/UPS) Cumulated Effects in Learning

  19. Random Boolean games Cumulative effects in learning Probability setting Focus on Ω := 2 2 n and S := P (Ω) = 2 2 2 n Which probability distribution? Prior choice: assign proba. to bool. functions or to bool. formulas? First step: consider any P : S → [0 , 1] 7 / 18 Érik Martin-Dorel (IRIT/UPS) Cumulated Effects in Learning

  20. Random Boolean games Cumulative effects in learning First result Theorem ( Pr_ex_winA ) For any finite probability space (Ω , S , P ) , the probability that there exists some strategy a = ( a 1 , . . . , a k ) of A that is winning satisfies: � � � 2 k � � ( − 1) m − 1 P ( ∃ a. win A ( a )) = P W a , m =1 a ∈ J J ⊆ 2 k Card J = m denoting for any a ∈ 2 k , win A ( a ) := ∀ b ∈ 2 n − k . F ( a, b ) = 1 ω a := { v ∈ 2 n | v 1 = a 1 ∧ · · · ∧ v k = a k } ∈ Ω W a := { ω ∈ Ω | ω a ⊆ ω } ∈ S . 8 / 18 Érik Martin-Dorel (IRIT/UPS) Cumulated Effects in Learning

  21. Random Boolean games Cumulative effects in learning Specialization of the probability setting We define P n ; p as a finite Bernoulli process: we construct bool. functions F from the truth-set F − 1 ( { 1 } ) as follows: 9 / 18 Érik Martin-Dorel (IRIT/UPS) Cumulated Effects in Learning

  22. Random Boolean games Cumulative effects in learning Specialization of the probability setting We define P n ; p as a finite Bernoulli process: we construct bool. functions F from the truth-set F − 1 ( { 1 } ) as follows: we consider all vectors v ∈ 2 n for each v , we decide with proba. p if it belongs to the truth-set of F � 2 n independent Bernoulli trials with parameter p 9 / 18 Érik Martin-Dorel (IRIT/UPS) Cumulated Effects in Learning

  23. Random Boolean games Cumulative effects in learning Specialization of the probability setting We define P n ; p as a finite Bernoulli process: we construct bool. functions F from the truth-set F − 1 ( { 1 } ) as follows: we consider all vectors v ∈ 2 n for each v , we decide with proba. p if it belongs to the truth-set of F � 2 n independent Bernoulli trials with parameter p Remark This setting subsumes the simpler case where all functions have the same elementary probability (just take p = 1 2 ) 9 / 18 Érik Martin-Dorel (IRIT/UPS) Cumulated Effects in Learning

  24. Random Boolean games Cumulative effects in learning Results I Theorem ( Pr_ex_winA_Bern ) For all p ∈ [0 , 1] and for all integers n , k , the probability that player A has a winning strategy is: 1 − p 2 n − k � 2 k � P n ; p ( ∃ a ∈ 2 k . win A ( a )) = 1 − . 10 / 18 Érik Martin-Dorel (IRIT/UPS) Cumulated Effects in Learning

Recommend


More recommend