Negative Momentum for Improved Game Dynamics Gauthier Gidel* , - PowerPoint PPT Presentation

Negative Momentum for Improved Game Dynamics Gauthier Gidel* , Reyhane Askari Hemmat*, Mohammad Pezeshki, Gabriel Huang, Remi Lepriol, Simon Lacoste-Julien, Ioannis Mitliagkas *equal contribution

Simple Min-max smooth game: Gradient dynamic: Gauthier Gidel, Workshop on learning and strategic behavior, August 22, 2018

Way to optimize bilinear games Gauthier Gidel, Workshop on learning and strategic behavior, August 22, 2018

Way to optimize bilinear games (Improvements) > (Improvements) > This talk Gauthier Gidel, Workshop on learning and strategic behavior, August 22, 2018

General 2 player games: Two players aim to minimize their respective cost functions: Gauthier Gidel, Workshop on learning and strategic behavior, August 22, 2018

General 2 player games: Two players aim to minimize their respective cost functions: Examples: Simple class of zero-sum games: ( ) ● Gauthier Gidel, Workshop on learning and strategic behavior, August 22, 2018

General 2 player games: Two players aim to minimize their respective cost functions: Examples: Simple class of zero-sum games: ( ) ● Generative Adversarial Networks: ● (non-saturating GAN from Goodfellow et al. 2014) Gauthier Gidel, Workshop on learning and strategic behavior, August 22, 2018

General 2 player games: Two players aim to minimize their respective cost functions: Dynamics of gradient based method depends on the gradient vector fields: Gauthier Gidel, Workshop on learning and strategic behavior, August 22, 2018

General 2 player games: Two players aim to minimize their respective cost functions: Dynamics of gradient based method depends on the gradient vector fields: And its associated Jacobian, Gauthier Gidel, Workshop on learning and strategic behavior, August 22, 2018

Fixed point dynamics Gradient method is defined as the repetition of the operator: Thus, the sequence computed is Gauthier Gidel, Workshop on learning and strategic behavior, August 22, 2018

Fixed point dynamics Gradient method is defined as the repetition of the operator: Thus, the sequence computed is We aim to converge to a Nash Equilibrium : Gauthier Gidel, Workshop on learning and strategic behavior, August 22, 2018

Tuning the step size Jacobian of our fixed point operator: To have fixed point we need to be definite positive. ● Thus, small enough step-size Eigenvalues in the unit disk. ● Want to find optimal step-size. ● Gauthier Gidel, Workshop on learning and strategic behavior, August 22, 2018

Fixed point dynamics Jacobian of our fixed point operator: Local convergence. ● Stationary point may not be a Nash equilibrium. (See Adolphs et al. 2018) ● But any Nash equilibrium is an stationary point. ● In this talk: local results on stationary points. ● Gauthier Gidel, Workshop on learning and strategic behavior, August 22, 2018

Tuning the step size Gauthier Gidel, Workshop on learning and strategic behavior, August 22, 2018

Negative Momentum Recall Polyak’s momentum : Fixed point operator requires a state augmentation : (because need previous iterates) Gauthier Gidel, Workshop on learning and strategic behavior, August 22, 2018

Negative Momentum Gauthier Gidel, Workshop on learning and strategic behavior, August 22, 2018

Negative Momentum ● Fixed momentum. (- 0.25) ● Step-size is not fixed. Helps when the eigenvalue ● has large imaginary part. Gauthier Gidel, Workshop on learning and strategic behavior, August 22, 2018

What happens in practice ? Fashion MNIST: Gauthier Gidel, Workshop on learning and strategic behavior, August 22, 2018

What happen in practice ? CIFAR-10: Gauthier Gidel, Workshop on learning and strategic behavior, August 22, 2018

Negative Momentum To sum up: Negative momentum seems to improve the behaviour of the “bad” eigenvalues. ● If small enough seems to always help. ● It also allows larger step-size. ● Gauthier Gidel, Workshop on learning and strategic behavior, August 22, 2018

Thank you ! If you are interested in that topic: NIPS Workshop : Smooth Games Optimization and Machine Learning ● Co-organized with: Simon Lacoste-Julien · Ioannis Mitliagkas · Vasilis Syrgkanis · Eva Tardos · Leon Bottou · Sebastian Nowozin Soon : Call for contributions !!! Gauthier Gidel, Workshop on learning and strategic behavior, August 22, 2018

Negative Momentum for Improved Game Dynamics Gauthier Gidel* , - PowerPoint PPT Presentation

Negative Momentum for Improved Game Dynamics Gauthier Gidel* , Reyhane Askari Hemmat, Mohammad Pezeshki, Gabriel Huang, Remi Lepriol, Simon Lacoste-Julien, Ioannis Mitliagkas equal contribution Simple Min-max smooth game: Gradient dynamic:

Momentum and Conservation of Momentum Momentum Conservation of Momentum

e-Bug Junior Game Junior Game Game Style Game Process Demo Game Mechanics and

e-Bug Senior Game Senior Game Game Style Game Process Demo Game Puzzles and

Angular Momentum Angular Momentum Newtons Second Law Conservation of Angular

Algebra Based Physics Momentum 20160120 www.njctl.org Momentum Click on the topic to go to

Algebra Based Physics Momentum 2015-12-02 www.njctl.org Slide 3 / 65 Slide 4 / 65 Momentum

Game interoperability with functors functor AgsFun (structure Game : GAME) :> sig structure

ANGULAR MOMENTUM ANGULAR MOMENTUM AND THE ROLLING CHAIN AND THE ROLLING CHAIN Anthony Toljanich

Momentum Conservation of Momentum Types of Collisions Collisions in Two Dimensions Return

Momentum Conservation of Momentum Types of Collisions Collisions in Two Dimensions Return

Slide 1 / 135 Momentum Problems Slide 2 / 135 Momentum of a Single Object Slide 3 / 135 1

Universality of transverse-momentum Universality of transverse-momentum Universality of

Supporting Your Business Contents Company Profjle Our Objective Momentum Products

The Negative Marker in Romanian Negative Concord Gianina Iord achioaia Seminar f ur

VIDEOGAMES ARE A MESS Ian Bogost WHAT IS A GAME? Is a game a system of rules, or is a game a

Nash demand game Julio D avila 2009 Julio D avila Nash demand game Nash demand game

Inductive general game playing Andrew Cropper, Richard Evans, and Mark Law Learning game rules

League of Legends: Scaling to Millions of Ninjas, Yordles,

Language Understanding for Text-based Games Using Deep Reinforcement Learning Karthik

Exam review, Game of Life work time Turn in your questions on material in preparation for

Game Loops CIS 580 - Fundamentals of Game Programming Hangman Game Phases Game Loop

COMP558 Network Games Martin Gairing University of Liverpool, Computer Science Dept 2nd

A Generalized Model for Games of Cops and Robbers with Randomness Franois Laviolette Jose

Game Playing Part 2 Alpha-Beta Pruning Yingyu Liang yliang@cs.wisc.edu Computer Sciences

Explore More Topics

Sambuz

Useful Links

Newsletter

Mail Us

Negative Momentum for Improved Game Dynamics Gauthier Gidel* , - PowerPoint PPT Presentation

Negative Momentum for Improved Game Dynamics Gauthier Gidel* , Reyhane Askari Hemmat*, Mohammad Pezeshki, Gabriel Huang, Remi Lepriol, Simon Lacoste-Julien, Ioannis Mitliagkas *equal contribution Simple Min-max smooth game: Gradient dynamic:

Momentum and Conservation of Momentum Momentum Conservation of Momentum

e-Bug Junior Game Junior Game Game Style Game Process Demo Game Mechanics and

e-Bug Senior Game Senior Game Game Style Game Process Demo Game Puzzles and

Angular Momentum Angular Momentum Newtons Second Law Conservation of Angular

Algebra Based Physics Momentum 20160120 www.njctl.org Momentum Click on the topic to go to

Algebra Based Physics Momentum 2015-12-02 www.njctl.org Slide 3 / 65 Slide 4 / 65 Momentum

Game interoperability with functors functor AgsFun (structure Game : GAME) :&gt; sig structure

ANGULAR MOMENTUM ANGULAR MOMENTUM AND THE ROLLING CHAIN AND THE ROLLING CHAIN Anthony Toljanich

Momentum Conservation of Momentum Types of Collisions Collisions in Two Dimensions Return

Momentum Conservation of Momentum Types of Collisions Collisions in Two Dimensions Return

Slide 1 / 135 Momentum Problems Slide 2 / 135 Momentum of a Single Object Slide 3 / 135 1

Universality of transverse-momentum Universality of transverse-momentum Universality of

Supporting Your Business Contents Company Profjle Our Objective Momentum Products

The Negative Marker in Romanian Negative Concord Gianina Iord achioaia Seminar f ur

VIDEOGAMES ARE A MESS Ian Bogost WHAT IS A GAME? Is a game a system of rules, or is a game a

Nash demand game Julio D avila 2009 Julio D avila Nash demand game Nash demand game

Inductive general game playing Andrew Cropper, Richard Evans, and Mark Law Learning game rules

League of Legends: Scaling to Millions of Ninjas, Yordles,

Language Understanding for Text-based Games Using Deep Reinforcement Learning Karthik

Exam review, Game of Life work time Turn in your questions on material in preparation for

Game Loops CIS 580 - Fundamentals of Game Programming Hangman Game Phases Game Loop

COMP558 Network Games Martin Gairing University of Liverpool, Computer Science Dept 2nd

A Generalized Model for Games of Cops and Robbers with Randomness Franois Laviolette Jose

Game Playing Part 2 Alpha-Beta Pruning Yingyu Liang yliang@cs.wisc.edu Computer Sciences

Explore More Topics

Sambuz

Useful Links

Newsletter

Mail Us

Negative Momentum for Improved Game Dynamics Gauthier Gidel* , Reyhane Askari Hemmat, Mohammad Pezeshki, Gabriel Huang, Remi Lepriol, Simon Lacoste-Julien, Ioannis Mitliagkas equal contribution Simple Min-max smooth game: Gradient dynamic:

Game interoperability with functors functor AgsFun (structure Game : GAME) :> sig structure