Generative Adversarial Networks Stefano Ermon, Aditya Grover - PowerPoint PPT Presentation

Generative Adversarial Networks Stefano Ermon, Aditya Grover Stanford University Lecture 10 Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 10 1 / 17

Selected GANs https://github.com/hindupuravinash/the-gan-zoo The GAN Zoo: List of all named GANs Today Rich class of likelihood-free objectives via f -GANs Inferring latent representations via BiGAN Application: Image-to-image translation via CycleGANs Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 10 2 / 17

Beyond KL and Jenson-Shannon Divergence What choices do we have for d ( · )? KL divergence: Autoregressive Models, Flow models (scaled and shifted) Jenson-Shannon divergence: original GAN objective Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 10 3 / 17

f divergences Given two densities p and q , the f -divergence is given by � � p ( x ) �� D f ( p , q ) = E x ∼ q f q ( x ) where f is any convex, lower-semicontinuous function with f (1) = 0. Convex: Line joining any two points lies above the function Lower-semicontinuous: function value at any point x 0 is close to f ( x 0 ) or greater than f ( x 0 ) Example: KL divergence with f ( u ) = u log u Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 10 4 / 17

f divergences Many more f-divergences! Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 10 5 / 17

f -GAN: Variational Divergence Minimization To use f -divergences as a two-sample test objective for likelihood-free learning, we need to be able to estimate it only via samples Fenchel conjugate: For any function f ( · ), its convex conjugate is defined as f ∗ ( t ) = sup ( ut − f ( u )) u ∈ dom f Duality: f ∗∗ = f . When f ( · ) is convex, lower semicontinous, so is f ∗ ( · ) ( tu − f ∗ ( t )) f ( u ) = sup t ∈ dom f ∗ Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 10 6 / 17

f -GAN: Variational Divergence Minimization We can obtain a lower bound to any f -divergence via its Fenchel conjugate � � �� p ( x ) D f ( p , q ) = E x ∼ q f q ( x ) � � �� t p ( x ) q ( x ) − f ∗ ( t ) = E x ∼ q sup t ∈ dom f ∗ � � T ( x ) p ( x ) q ( x ) − f ∗ ( T ( x )) := E x ∼ q � X [ T ( x ) p ( x ) − f ∗ ( T ( x )) q ( x )] d x = � X ( T ( x ) p ( x ) − f ∗ ( T ( x )) q ( x )) d x ≥ sup T ∈T = sup T ∈T ( E x ∼ p [ T ( x )] − E x ∼ q [ f ∗ ( T ( x )))]) where T : X �→ R is an arbitrary class of functions Note: Lower bound is likelihood-free w.r.t. p and q Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 10 7 / 17

f -GAN: Variational Divergence Minimization Variational lower bound ( E x ∼ p [ T ( x )] − E x ∼ q [ f ∗ ( T ( x )))]) D f ( p , q ) ≥ sup T ∈T Choose any f -divergence Let p = p data and q = p G Parameterize T by φ and G by θ Consider the following f -GAN objective F ( θ, φ ) = E x ∼ p data [ T φ ( x )] − E x ∼ p G θ [ f ∗ ( T φ ( x )))] min θ max φ Generator G θ tries to minimize the divergence estimate and discriminator T φ tries to tighten the lower bound Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 10 8 / 17

Inferring latent representations in GANs The generator of a GAN is typically a directed, latent variable model with latent variables z and observed variables x How can we infer the latent feature representations in a GAN? Unlike a normalizing flow model, the mapping G : z �→ x need not be invertible Unlike a variational autoencoder, there is no inference network q ( · ) which can learn a variational posterior over latent variables Solution 1 : For any point x , use the activations of the prefinal layer of a discriminator as a feature representation Intuition: Similar to supervised deep neural networks, the discriminator would have learned useful representations for x while distinguishing real and fake x Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 10 9 / 17

Inferring latent representations in GANs If we want to directly infer the latent variables z of the generator, we need a different learning algorithm A regular GAN optimizes a two-sample test objective that compares samples of x from the generator and the data distribution Solution 2 : To infer latent representations, we will compare samples of x , z from the joint distributions of observed and latent variables as per the model and the data distribution For any x generated via the model, we have access to z (sampled from a simple prior p ( z )) For any x from the data distribution, the z is however unobserved (latent) Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 10 10 / 17

Bidirectional Generative Adversarial Networks (BiGAN) In a BiGAN, we have an encoder network E in addition to the generator network G The encoder network only observes x ∼ p data ( x ) during training to learn a mapping E : x �→ z As before, the generator network only observes the samples from the prior z ∼ p ( z ) during training to learn a mapping G : z �→ x Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 10 11 / 17

Bidirectional Generative Adversarial Networks (BiGAN) The discriminator D observes samples from the generative model z , G ( z ) and the encoding distribution E ( x ) , x The goal of the discriminator is to maximize the two-sample test objective between z , G ( z ) and E ( x ) , x After training is complete, new samples are generated via G and latent representations are inferred via E Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 10 12 / 17

Translating across domains Image-to-image translation: We are given images from two domains, X and Y Paired vs. unpaired examples Paired examples can be expensive to obtain. Can we translate from X ↔ Y in an unsupervised manner? Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 10 13 / 17

CycleGAN: Adversarial training across two domains To match the two distributions, we learn two parameterized conditional generative models G : X ↔ Y and F : Y ↔ X G maps an element of X to an element of Y . A discriminator D Y compares the observed dataset Y and the generated samples ˆ Y = G ( X ) Similarly, F maps an element of Y to an element of X . A discriminator D X compares the observed dataset X and the generated samples ˆ X = F ( Y ) Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 10 14 / 17

CycleGAN: Cycle consistency across domains Cycle consistency: If we can go from X to ˆ Y via G , then it should also be possible to go from ˆ Y back to X via F F ( G ( X )) ≈ X Similarly, vice versa: G ( F ( Y )) ≈ Y Overall loss function F , G , D X , D Y L GAN ( G , D Y , X , Y ) + L GAN ( F , D X , X , Y ) min + λ ( E X [ � F ( G ( X )) − X � 1 ] + E Y [ � G ( F ( Y )) − Y � 1 ]) � �� cycle consistency Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 10 15 / 17

CycleGAN in practice Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 10 16 / 17

Summary of Generative Adversarial Networks Key observation: Samples and likelihoods are not correlated in practice Two-sample test objectives allow for learning generative models only via samples (likelihood-free) Wide range of two-sample test objectives covering f -divergences (and more) Latent representations can be inferred via BiGAN Interesting applications such as CycleGAN Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 10 17 / 17

Generative Adversarial Networks Stefano Ermon, Aditya Grover - PowerPoint PPT Presentation

Generative Adversarial Networks Stefano Ermon, Aditya Grover Stanford University Lecture 10 Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 10 1 / 17 Selected GANs https://github.com/hindupuravinash/the-gan-zoo The GAN

Generative Adversarial Networks Benjamin Striner CMU 11-785 March 21, 2018 Benjamin Striner

CSC421/2516 Lecture 18: Generative Adversarial Networks Roger Grosse and Jimmy Ba Roger Grosse

Generative Adversarial Nets(GANs) Troy Cary and Chenzhi Zhao A generative adversarial net is

Robust Estimation and Generative Adversarial Networks Weizhi ZHU Hong Kong University of Science

GAN-based Photo Video Synthesis Summary of Generative Adversarial Nets Lei Zhang What is

Generative networks part 2: GANs 23 / 54 Recap on generative networks Generative networks provide

CSC321 Lecture 19: Generative Adversarial Networks Roger Grosse Roger Grosse CSC321 Lecture 19:

Generative Adversarial Networks presented by Ian Goodfellow presentation co-developed with Aaron

Generative Adversarial Networks Aaron Mishkin UBC MLRG 2018W2 1 Generative Adversial Networks

Adversarial Training Attacks on Deep Networks and Generative Adversarial Networks Erkut Erdem

Applications of GANs Photo-Realistic Single Image Super-Resolution Using a Generative

Applications of GANs Photo-Realistic Single Image Super-Resolution Using a Generative

Generative Adversarial Networks Sahin Olut Department of Computer Engineering Istanbul Technical

LAB MEETING: A Connection Between Generative Adversarial Networks, Inverse Reinforcement Learning

Model-Assisted Generative Adversarial Networks Leigh Whitehead ICL Seminar 05/06/20

Bregman and Wasserstein, with Applications to Generative Adversarial Networks (GANs) and beyond

Deep learning 11.3. Conditional GAN and image translation Fran cois Fleuret

CMP722 ADVANCED COMPUTER VISION Lecture #8 Image Synthesis Aykut Erdem // Hacettepe

Advanced Machine Learning CS 7140 - Spring 2018 Lecture 20: Generative Adversarial Networks

Days 3&4: ELAN Our class Google Drive folder: Lesson 2 bit.ly/DigLangDocLSA2019 Andrea

Anyway S102 Functions # Select "name" and "value" columns from secure

Understanding Design Pattern Density with Aspects A Case Study in JHotDraw with AspectJ Simon

Unit 1: Data Fitting Motivation Data fitting: Construct a continuous function that represents

Management and Business Strategy 2015/2016 MANUEL DE NICOLA LEARNING OBJECTIVES LO 1 What we

Generative Adversarial Networks Stefano Ermon, Aditya Grover - PowerPoint PPT Presentation

Generative Adversarial Networks Stefano Ermon, Aditya Grover Stanford University Lecture 10 Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 10 1 / 17 Selected GANs https://github.com/hindupuravinash/the-gan-zoo The GAN

Generative Adversarial Networks Benjamin Striner CMU 11-785 March 21, 2018 Benjamin Striner

CSC421/2516 Lecture 18: Generative Adversarial Networks Roger Grosse and Jimmy Ba Roger Grosse

Generative Adversarial Nets(GANs) Troy Cary and Chenzhi Zhao A generative adversarial net is

Robust Estimation and Generative Adversarial Networks Weizhi ZHU Hong Kong University of Science

GAN-based Photo Video Synthesis Summary of Generative Adversarial Nets Lei Zhang What is

Generative networks part 2: GANs 23 / 54 Recap on generative networks Generative networks provide

CSC321 Lecture 19: Generative Adversarial Networks Roger Grosse Roger Grosse CSC321 Lecture 19:

Generative Adversarial Networks presented by Ian Goodfellow presentation co-developed with Aaron

Generative Adversarial Networks Aaron Mishkin UBC MLRG 2018W2 1 Generative Adversial Networks

Adversarial Training Attacks on Deep Networks and Generative Adversarial Networks Erkut Erdem

Applications of GANs Photo-Realistic Single Image Super-Resolution Using a Generative

Applications of GANs Photo-Realistic Single Image Super-Resolution Using a Generative

Generative Adversarial Networks Sahin Olut Department of Computer Engineering Istanbul Technical

LAB MEETING: A Connection Between Generative Adversarial Networks, Inverse Reinforcement Learning

Model-Assisted Generative Adversarial Networks Leigh Whitehead ICL Seminar 05/06/20

Bregman and Wasserstein, with Applications to Generative Adversarial Networks (GANs) and beyond

Deep learning 11.3. Conditional GAN and image translation Fran cois Fleuret

CMP722 ADVANCED COMPUTER VISION Lecture #8 Image Synthesis Aykut Erdem // Hacettepe

Advanced Machine Learning CS 7140 - Spring 2018 Lecture 20: Generative Adversarial Networks

Days 3&amp;4: ELAN Our class Google Drive folder: Lesson 2 bit.ly/DigLangDocLSA2019 Andrea

Anyway S102 Functions # Select &quot;name&quot; and &quot;value&quot; columns from secure

Understanding Design Pattern Density with Aspects A Case Study in JHotDraw with AspectJ Simon

Unit 1: Data Fitting Motivation Data fitting: Construct a continuous function that represents

Management and Business Strategy 2015/2016 MANUEL DE NICOLA LEARNING OBJECTIVES LO 1 What we

Days 3&4: ELAN Our class Google Drive folder: Lesson 2 bit.ly/DigLangDocLSA2019 Andrea

Anyway S102 Functions # Select "name" and "value" columns from secure