negative momentum for improved game dynamics
play

Negative Momentum for Improved Game Dynamics Gauthier Gidel* , - PowerPoint PPT Presentation

Negative Momentum for Improved Game Dynamics Gauthier Gidel* , Reyhane Askari Hemmat*, Mohammad Pezeshki, Gabriel Huang, Remi Lepriol, Simon Lacoste-Julien, Ioannis Mitliagkas *equal contribution Simple Min-max smooth game: Gradient dynamic:


  1. Negative Momentum for Improved Game Dynamics Gauthier Gidel* , Reyhane Askari Hemmat*, Mohammad Pezeshki, Gabriel Huang, Remi Lepriol, Simon Lacoste-Julien, Ioannis Mitliagkas *equal contribution

  2. Simple Min-max smooth game: Gradient dynamic: Gauthier Gidel, Workshop on learning and strategic behavior, August 22, 2018

  3. Simple Min-max smooth game: Gradient dynamic: Gauthier Gidel, Workshop on learning and strategic behavior, August 22, 2018

  4. Simple Min-max smooth game: Gradient dynamic: Gauthier Gidel, Workshop on learning and strategic behavior, August 22, 2018

  5. Way to optimize bilinear games Gauthier Gidel, Workshop on learning and strategic behavior, August 22, 2018

  6. Way to optimize bilinear games Gauthier Gidel, Workshop on learning and strategic behavior, August 22, 2018

  7. Way to optimize bilinear games Gauthier Gidel, Workshop on learning and strategic behavior, August 22, 2018

  8. Way to optimize bilinear games Gauthier Gidel, Workshop on learning and strategic behavior, August 22, 2018

  9. Way to optimize bilinear games (Improvements) > (Improvements) > This talk Gauthier Gidel, Workshop on learning and strategic behavior, August 22, 2018

  10. General 2 player games: Two players aim to minimize their respective cost functions: Gauthier Gidel, Workshop on learning and strategic behavior, August 22, 2018

  11. General 2 player games: Two players aim to minimize their respective cost functions: Examples: Simple class of zero-sum games: ( ) ● Gauthier Gidel, Workshop on learning and strategic behavior, August 22, 2018

  12. General 2 player games: Two players aim to minimize their respective cost functions: Examples: Simple class of zero-sum games: ( ) ● Generative Adversarial Networks: ● (non-saturating GAN from Goodfellow et al. 2014) Gauthier Gidel, Workshop on learning and strategic behavior, August 22, 2018

  13. General 2 player games: Two players aim to minimize their respective cost functions: Dynamics of gradient based method depends on the gradient vector fields: Gauthier Gidel, Workshop on learning and strategic behavior, August 22, 2018

  14. General 2 player games: Two players aim to minimize their respective cost functions: Dynamics of gradient based method depends on the gradient vector fields: And its associated Jacobian, Gauthier Gidel, Workshop on learning and strategic behavior, August 22, 2018

  15. Fixed point dynamics Gradient method is defined as the repetition of the operator: Thus, the sequence computed is Gauthier Gidel, Workshop on learning and strategic behavior, August 22, 2018

  16. Fixed point dynamics Gradient method is defined as the repetition of the operator: Thus, the sequence computed is We aim to converge to a Nash Equilibrium : Gauthier Gidel, Workshop on learning and strategic behavior, August 22, 2018

  17. Tuning the step size Jacobian of our fixed point operator: To have fixed point we need to be definite positive. ● Thus, small enough step-size Eigenvalues in the unit disk. ● Want to find optimal step-size. ● Gauthier Gidel, Workshop on learning and strategic behavior, August 22, 2018

  18. Fixed point dynamics Jacobian of our fixed point operator: Local convergence. ● Stationary point may not be a Nash equilibrium. (See Adolphs et al. 2018) ● But any Nash equilibrium is an stationary point. ● In this talk: local results on stationary points. ● Gauthier Gidel, Workshop on learning and strategic behavior, August 22, 2018

  19. Tuning the step size Gauthier Gidel, Workshop on learning and strategic behavior, August 22, 2018

  20. Negative Momentum Recall Polyak’s momentum : Fixed point operator requires a state augmentation : (because need previous iterates) Gauthier Gidel, Workshop on learning and strategic behavior, August 22, 2018

  21. Negative Momentum Gauthier Gidel, Workshop on learning and strategic behavior, August 22, 2018

  22. Negative Momentum ● Fixed momentum. (- 0.25) ● Step-size is not fixed. Helps when the eigenvalue ● has large imaginary part. Gauthier Gidel, Workshop on learning and strategic behavior, August 22, 2018

  23. What happens in practice ? Fashion MNIST: Gauthier Gidel, Workshop on learning and strategic behavior, August 22, 2018

  24. What happen in practice ? CIFAR-10: Gauthier Gidel, Workshop on learning and strategic behavior, August 22, 2018

  25. Negative Momentum To sum up: Negative momentum seems to improve the behaviour of the “bad” eigenvalues. ● If small enough seems to always help. ● It also allows larger step-size. ● Gauthier Gidel, Workshop on learning and strategic behavior, August 22, 2018

  26. Thank you ! If you are interested in that topic: NIPS Workshop : Smooth Games Optimization and Machine Learning ● Co-organized with: Simon Lacoste-Julien · Ioannis Mitliagkas · Vasilis Syrgkanis · Eva Tardos · Leon Bottou · Sebastian Nowozin Soon : Call for contributions !!! Gauthier Gidel, Workshop on learning and strategic behavior, August 22, 2018

Recommend


More recommend