Some results on GAN dynamics Ioannis Mitliagkas Game dynamics are - PowerPoint PPT Presentation

Some results on GAN dynamics Ioannis Mitliagkas

Game dynamics are weird fascinating

Start with optimization dynamics

Optimization Smooth, differentiable cost function, L → Looking for stationary (fixed) points (gradient is 0) → Gradient descent

Optimization Ferenc Huszar Conservative vector field → Straightforward dynamics

Gradient descent Fixed-point analysis Conservative vector field → Jacobian of operator Straightforward dynamics Hessian of objective, L

Local convergence Eigenvalues of op. Jacobian Jacobian of operator If ρ ( θ *)=max | λ ( θ *)|<1, then fast local convergence Hessian of objective, L Symmetric, real-eigenvalues

Implicit generative models ● Generative moment matching networks [Li et al. 2017] Other, domain-specific losses can be used ● ● Variational AutoEncoders [Kingma, Welling, 2014] Autoregressive models (PixelRNN [van den Oord, 2016]) ●

Generative Adversarial Networks Both differentiable Generator network, G Given latent code, z, produces sample G(z) Discriminator network, D Given sample x or G(z), estimates probability it is real

Generative Adversarial Networks

Games Smooth, differentiable L Nash Equilibrium → Looking for local Nash equil. → Gradient descent → Simultaneous → Alternating

Game dynamics Non-conservative vector field → Rotational dynamics

Game dynamics under gradient descent Jacobian is non-symmetric, with complex eigenvalues → Rotations in decision space Games demonstrate rotational dynamics.

The Numerics of GANs by Mescheder, Nowozin, Geiger

A word on notation and formulation Maximization vs minimization Step size Warning:

Eigen-analysis, gradient descent

The Numerics of GANs

Make vector field “more conservative” Idea 1: Minimize the norm of the gradient

Idea 1: Minimize vector field norm

Idea 2: use L as regularizer

Other ways to control these rotations?

Momentum (heavy ball, Polyak 1964) Jacobian of momentum operator Non-symmetric, with complex eigenvalues → Rotations in augmented state-space

Summary Positive momentum can be bad for adversarial games Practice that was very common when GANs were first invented. → Recent work reduced the momentum parameter. → Not an accident

Negative Momentum for Improved Game Dynamics Gidel, Askari Hemmat, Pezeshki, Huang, Lepriol, Lacoste-Julien, Mitliagkas AISTATS 2019

Our results Negative momentum is optimal on simple bilinear game Negative momentum values are locally preferrable near 0 on a more general class of games Negative momentum is empirically best for certain zero sum games like “saturating GANs’’

Momentum on games Recall Polyak’s momentum (on top of simultaneous grad. desc.): Fixed point operator requires a state augmentation : (because we need previous iterate)

Bilinear game

“Proof by picture” Gradient descent → Simultaneous → Alternating Momentum → Positive → Negative

General games

Eigen-analysis, 0 momentum

Zero vs negative momentum Momentum → Zero → Negative

Negative Momentum

Empirical results

What happens in practice ? Fashion MNIST:

What happens in practice ? CIFAR-10:

Negative Momentum To sum up: Negative momentum seems to improve the behaviour due to ● “bad” eigenvalues. Optimal for a class of games ● Empirically optimal on “saturating” GANs ●

Some results on GAN dynamics Ioannis Mitliagkas Game dynamics are - PowerPoint PPT Presentation

Some results on GAN dynamics Ioannis Mitliagkas Game dynamics are weird fascinating Start with optimization dynamics Optimization Smooth, differentiable cost function, L Looking for stationary (fixed) points (gradient is 0)

Bridging Theory and Practice of GANs MIX+GAN Ian Goodfellow, Sta ff Research Scientist, Google

GANs for Creativity and Design MIX+GAN Ian Goodfellow, Sta ff Research Scientist, Google Brain

GANs for Limited Labeled Data MIX+GAN Ian Goodfellow, Sta ff Research Scientist, Google Brain

Generative Adversarial Networks MIX+GAN Ian Goodfellow, Sta ff Research Scientist, Google Brain

Adversarial Machine Learning MIX+GAN Ian Goodfellow, Sta ff Research Scientist, Google Brain

Adversarial Machine Learning MIX+GAN Ian Goodfellow, Sta ff Research Scientist, Google Brain

Adversarial Machine Learning MIX+GAN Ian Goodfellow, Sta ff Research Scientist, Google Brain

Generative Adversarial Networks MIX+GAN Ian Goodfellow, Sta ff Research Scientist, Google Brain

Le GaN dans les systmes militaires GaN in military systems Francis Doukhan

Simulating GaN Based Devices Optical and Electrical GaN Device Simulations Contents

Introduction to GANs LSGAN SAGAN MIX+GAN Ian Goodfellow, Sta ff Research Scientist, Google Brain

Introduction to GANs LSGAN SAGAN MIX+GAN Ian Goodfellow, Sta ff Research Scientist, Google Brain

Wasserstein GAN Martin Arjovsky, Soumith Chintala, Lon Bottou, ICML 2017 Presented by Yaochen

650V GaN Technology for a New Generation of OBCs, DC/DCs and Inverters in EVs Luc Van de Perre

A Macro Automates your Table 1 Geliang Gan Biostatistician YCAS geliang.gan@yale.edu Contents

Characterization of N-polar GaN/AlGaN/ GaN HEMTs on Sapphire by Metal Organic Chemical Vapor

Morse-Conley theory for combinatorial vector fields P Q L M N O G H I J K C D E F A

Last time: vector fields Let C be the line segment from (0 , 0) to (1 , 2). Consider the vector

Spatial Data: 3D Scalar Fields CSC444 Recap: 2D contouring

MA 105: Calculus Lecture 20 . Prof. B.V. Limaye IIT Bombay Friday, 17 March 2017 B.V. Limaye

Math 5490 11/10/2014 Dynamical Systems Math 5490 Nonlinear Systems November 10, 2014 dx dt

HAMILTONIAN VECTOR FIELDS, OBSERVABLES AND LIE SERIES Bronis law Jakubczyk Institute of

Homotopy Poisson actions Rajan Mehta November 8, 2010 Conventional perspectives Definition A

Effective Field Theory of Anisotroic Inflation Asieh Karami IPM - School of Astronomy GC2018