Generative Adversarial Networks presented by Ian Goodfellow - PowerPoint PPT Presentation

Generative Adversarial Networks presented by Ian Goodfellow presentation co-developed with Aaron Courville 1

In today’s talk … • “Generative Adversarial Networks” Goodfellow et al., NIPS 2014 • “Conditional Generative Adversarial Nets” Mirza and Osindero, NIPS Deep Learning Workshop 2014 • “On Distinguishability Criteria for Estimating Generative Models” Goodfellow, ICLR Workshop 2015 • “Deep Generative Image Models using a Laplacian Pyramid of Adversarial Networks” Denton, Chintala, et al., ArXiv 2015 Deep Learning Workshop, ICML 2015 --- Ian Goodfellow 2

Generative modeling • Have training examples x ~ p data ( x ) • Want a model that can draw samples: x ~ p model ( x ) • Where p model ≈ p data x ~ p data ( x ) x ~ p model ( x ) Deep Learning Workshop, ICML 2015 --- Ian Goodfellow 3

Why generative models? • Conditional generative models Speech synthesis: Text ⇒ Speech - Machine Translation: French ⇒ English - French: Si mon tonton tond ton tonton, ton tonton sera tondu. • English: If my uncle shaves your uncle, your uncle will be shaved • Image ⇒ Image segmentation - • Environment simulator Reinforcement learning - Planning - • Leverage unlabeled data? Deep Learning Workshop, ICML 2015 --- Ian Goodfellow 4

Maximum likelihood: the dominant approach • ML objective function m 1 ⇣ ⌘ θ ∗ = max X x ( i ) ; θ log p θ m i =1 Deep Learning Workshop, ICML 2015 --- Ian Goodfellow 5

Undirected graphical models • Flagship undirected graphical model: Deep Boltzmann machines • Several “hidden layers” h h (3) p ( h, x ) = 1 h (2) Z ˜ p ( h, x ) h (1) p ( h, x ) = exp( − E ( h, x )) ˜ � x Z = p ( h, x ) ˜ h,x Deep Learning Workshop, ICML 2015 --- Ian Goodfellow 6

Boltzmann Machines: disadvantage • Model is badly parameterized for learning high quality samples: peaked distributions -> slow mixing • Why poor mixing? Coordinated flipping of low- level features MNIST dataset 1st layer features (RBM) Deep Learning Workshop, ICML 2015 --- Ian Goodfellow 7

Directed graphical models p ( x, h ) = p ( x | h (1) ) p ( h (1) | h (2) ) . . . p ( h ( L − 1) | h ( L ) ) p ( h ( L ) ) h (3) 1 d d log p ( x ) = p ( x ) h (2) p ( x ) d θ i d θ i � p ( x ) = p ( x | h ) p ( h ) h (1) h x • Two problems: 1. Summation over exponentially many states in h 2. Posterior inference, i.e. calculating p ( h | x ) , is intractable. Deep Learning Workshop, ICML 2015 --- Ian Goodfellow 8

Variational Autoencoder Noise z Sample from q(z) Differentiable Differentiable encoder decoder x x sampled E[x|z] from data Maximize log p ( x ) � D KL ( q ( x ) k p ( z | x )) (Kingma and Welling, 2014, Rezende et al 2014) Deep Learning Workshop, ICML 2015 --- Ian Goodfellow 9

Generative stochastic networks • General strategy: Do not write a formula for p ( x ) , just learn to sample incrementally. ... • Main issue: Subject to some of the same constraints on mixing as undirected graphical models. (Bengio et al 2013) Deep Learning Workshop, ICML 2015 --- Ian Goodfellow 10

Generative adversarial networks • Don’t write a formula for p ( x ), just learn to sample directly. • No Markov Chain • No variational bound • How? By playing a game. Deep Learning Workshop, ICML 2015 --- Ian Goodfellow 11

Game theory: the basics • N>1 players • Clearly defined set of actions each player can take • Clearly defined relationship between actions and outcomes • Clearly defined value of each outcome • Can’t control the other player’s actions Deep Learning Workshop, ICML 2015 --- Ian Goodfellow 12

Two-player zero-sum game • Your winnings + your opponent’s winnings = 0 • Minimax theorem: a rational strategy exists for all such finite games Deep Learning Workshop, ICML 2015 --- Ian Goodfellow 13

Two-player zero-sum game • Strategy: specification of which moves you make in which circumstances. • Equilibrium: each player’s strategy is the best possible for their opponent’s strategy. Your opponent • Example: Rock-paper-scissors: Rock Paper Scissors - Mixed strategy equilibrium Rock 0 -1 1 - Choose your action at random Paper You 1 0 -1 Scissors -1 1 0 Deep Learning Workshop, ICML 2015 --- Ian Goodfellow 14

Adversarial nets framework • A game between two players: 1. Discriminator D 2. Generator G • D tries to discriminate between: - A sample from the data distribution. - And a sample from the generator G. • G tries to “trick” D by generating samples that are hard for D to distinguish from data. Deep Learning Workshop, ICML 2015 --- Ian Goodfellow 15

Adversarial nets framework D tries to D tries to output 1 output 0 Differentiable Differentiable function D function D x sampled x sampled from data from model x x Differentiable function G Input noise Z z Deep Learning Workshop, ICML 2015 --- Ian Goodfellow 16

Zero-sum game • Minimax value function: min G max D V ( D, G ) = E x ∼ p data ( x ) [log D ( x )] + E z ∼ p z ( z ) [log(1 − D ( G ( z )))] . Discriminator’s Discriminator’s Discriminator ability to ability to pushes up recognize data as recognize Generator being real generator pushes samples as being down fake Deep Learning Workshop, ICML 2015 --- Ian Goodfellow 17

Discriminator strategy • Optimal strategy for any p model ( x ) is always p data ( x ) D ( x ) = p data ( x ) + p model ( x ) Deep Learning Workshop, ICML 2015 --- Ian Goodfellow 18

Learning process Data distribution D (x) Model distribution ... Mixed strategy After updating D After updating G Poorly fit model equilibrium Deep Learning Workshop, ICML 2015 --- Ian Goodfellow 19

Theoretical properties min G max D V ( D, G ) = E x ∼ p data ( x ) [log D ( x )] + E z ∼ p z ( z ) [log(1 − D ( G ( z )))] . • Theoretical properties (assuming infinite data, infinite model capacity, direct updating of generator’s distribution): - Unique global optimum. - Optimum corresponds to data distribution. - Convergence to optimum guaranteed. In practice: no proof that SGD converges Deep Learning Workshop, ICML 2015 --- Ian Goodfellow 23

Oscillation (Alec Radford) Deep Learning Workshop, ICML 2015 --- Ian Goodfellow 24

Visualization of model samples MNIST TFD CIFAR-10 (fully connected) CIFAR-10 (convolutional) Deep Learning Workshop, ICML 2015 --- Ian Goodfellow 25

Learned 2-D manifold of MNIST Deep Learning Workshop, ICML 2015 --- Ian Goodfellow 26

Visualizing trajectories 1. Draw sample (A) B 2. Draw sample (B) 3. Simulate samples along the path between A and B 4. Repeat steps 1-3 as A desired. Deep Learning Workshop, ICML 2015 --- Ian Goodfellow 27

Visualization of model trajectories MNIST digit dataset Toronto Face Dataset (TFD) Deep Learning Workshop, ICML 2015 --- Ian Goodfellow 28

Visualization of model trajectories CIFAR-10 (convolutional) Deep Learning Workshop, ICML 2015 --- Ian Goodfellow 29

GANs vs VAEs • Both use backprop through continuous random number generation • VAE: - generator gets direct output target - need REINFORCE to do discrete latent variables - possible underfitting due to variational approximation - gets global image composition right but blurs details • GAN: - generator never sees the data - need REINFORCE to do discrete visible variables - possible underfitting due to non-convergence - gets local image features right but not global structure Deep Learning Workshop, ICML 2015 --- Ian Goodfellow 30

VAE + GAN VAE VAE+GAN -Reduce VAE blurriness -Reduce GAN oscillation (Alec Radford, 2015) Deep Learning Workshop, ICML 2015 --- Ian Goodfellow 31

MMD-based generator nets (Li et al 2015) (Dziugaite et al 2015) Deep Learning Workshop, ICML 2015 --- Ian Goodfellow 32

Supervised Generator Nets Generator nets are powerful—it is our ability to infer a mapping from an unobserved space that is limited. (Dosovitskiy et al 2014) Deep Learning Workshop, ICML 2015 --- Ian Goodfellow 33

General game Deep Learning Workshop, ICML 2015 --- Ian Goodfellow 34

Extensions • Inference net: - Learn a network to model p(z | x) - Wake/Sleep style approach - Sample z from prior - Sample x from p(z|x) - Learn mapping from x to z - Infinite training set! Deep Learning Workshop, ICML 2015 --- Ian Goodfellow 35

Extensions • Conditional model: - Learn p ( x | y ) - Discriminator is trained on ( x , y ) pairs - Generator net gets y and z as input - Useful for: Translation, (Mirza and Osindero, 2014) speech synth, image segmentation. Deep Learning Workshop, ICML 2015 --- Ian Goodfellow 36

Generative Adversarial Networks presented by Ian Goodfellow - PowerPoint PPT Presentation

Generative Adversarial Networks presented by Ian Goodfellow presentation co-developed with Aaron Courville 1 In todays talk Generative Adversarial Networks Goodfellow et al., NIPS 2014 Conditional Generative

Generative Adversarial Networks Benjamin Striner CMU 11-785 March 21, 2018 Benjamin Striner

CSC421/2516 Lecture 18: Generative Adversarial Networks Roger Grosse and Jimmy Ba Roger Grosse

Generative Adversarial Nets(GANs) Troy Cary and Chenzhi Zhao A generative adversarial net is

Robust Estimation and Generative Adversarial Networks Weizhi ZHU Hong Kong University of Science

GAN-based Photo Video Synthesis Summary of Generative Adversarial Nets Lei Zhang What is

Generative networks part 2: GANs 23 / 54 Recap on generative networks Generative networks provide

CSC321 Lecture 19: Generative Adversarial Networks Roger Grosse Roger Grosse CSC321 Lecture 19:

Generative Adversarial Networks Aaron Mishkin UBC MLRG 2018W2 1 Generative Adversial Networks

Adversarial Training Attacks on Deep Networks and Generative Adversarial Networks Erkut Erdem

Applications of GANs Photo-Realistic Single Image Super-Resolution Using a Generative

Applications of GANs Photo-Realistic Single Image Super-Resolution Using a Generative

Generative Adversarial Networks Sahin Olut Department of Computer Engineering Istanbul Technical

LAB MEETING: A Connection Between Generative Adversarial Networks, Inverse Reinforcement Learning

Model-Assisted Generative Adversarial Networks Leigh Whitehead ICL Seminar 05/06/20

Bregman and Wasserstein, with Applications to Generative Adversarial Networks (GANs) and beyond

generative design systems Generative Brief Design Definitions Workshop Processes

Deep Counterfactual Regret Min inimization Noam Brown* 12 , Adam Lerer* 1 , Sam Gross 1 , Tuomas

Deep Reinforcement Learning Philipp Koehn 21 April 2020 Philipp Koehn Artificial Intelligence:

Combining Deduction and Algebraic Constraints for Hybrid System Analysis Andr e Platzer

Planning and Satisfiability Conclusion References Jussi Rintanen SAT-SMT School, Trento, June

Monte-Carlo Game Tree Search: Advanced Techniques Tsan-sheng Hsu tshsu@iis.sinica.edu.tw

CS 4803 / 7643: Deep Learning Website: http://www.cc.gatech.edu/classes/AY2020/cs7643_spring/

Transfer learning with neural language models CS 685, Spring 2020 Advanced Natural Language

CS440/ECE448 Lecture 12: Stochastic Games, Stochastic Search, and Learned Evaluation Functions