CS3750: ADVANCED MACHINE LEARNING GENERATIVE ADVERSARIAL NETWORKS - PDF document

2/11/2020 CS3750: ADVANCED MACHINE LEARNING GENERATIVE ADVERSARIAL NETWORKS Adapted from Slides made by Khushboo Thaker Presented by Tristan Maidment GROWTH (AND DECLINE) IN GAN PAPERS 1

2/11/2020 Overview Why Existing MiniMax Properties of GAN Generative Generative game theory GANs Framework Modelling? Models for GANs Why GAN Common Tricks for training is extensions to Conclusion GAN training HARD GANS Generative Modelling Inpu nput Outp tput ut Unsu super pervise ised Super pervised ised Training Examples Some representation of Data: X Data: X, y a probability Goal: Learn hidden Goal: Learn hidden distribution, which underlying structure of mapping from X -> y defines this example data space. 2

2/11/2020 Why Generative Modelling? Features Noisy Input Simulated Data Representative of Data Prediction of Semi-supervised Missing Data Future State Learning MAXIMUM LIKELIHOOD BASED MODELS 𝑞 𝑦 | 𝜄 ∗ = ARGMAX 𝜄 | 𝐹 𝑦~𝑄𝑒𝑏𝑢𝑏 𝑚𝑝𝑕𝑄( 𝑦 𝜄) 3

2/11/2020 PixelRNN PixelCNN WaveNet ◦ Generate image pixels from the corner ◦ Stable and Fast training ◦ Slow generation (sequential) ◦ Cannot generate samples based on latent code ◦ Tractable ◦ 𝑞 𝑦 = ς 𝑗=1 𝑜 𝑞(𝑦 𝑗 |𝑦 1 ,𝑦 2 ,… , 𝑦 𝑗−1 ) ◦ Maximum Likelihood based Training ◦ Chain Rule 4

2/11/2020 Variational Auto Encoder ◦ Able to achieve high likelihood ◦ Not asymptotically consistent unless q is perfect ◦ Lower Quality (blurry) samples ◦ Non tractable ◦ log 𝑞 𝑦 ≥ log 𝑞 𝑦 − 𝐸 𝐿𝑀 (𝑟 𝑨 || 𝑞(𝑨|𝑦)) = 𝐹 𝑨~𝑟 log 𝑞(𝑦, 𝑨) + 𝐼(𝑟) Boltzmann Machine ◦ Energy Function Based Model ◦ Markov Chains don’t work for long sequences ◦ Hard to scale on large dataset ◦ 𝑞 𝑦, ℎ = exp −𝐹 𝑦, ℎ | 𝑎 ◦ 𝑎 = σ 𝑦,ℎ exp(−𝐹 𝑦, ℎ ) 5

2/11/2020 Where are some properties of GANs? Can use latent information Asymptotically consistent No Markov Chain assumption Samples produced are high quality 6

2/11/2020 NEXT FRAME VIDEO GENERATION 7

2/11/2020 Generative Adversarial Networks x D(x) D G z G(z) D(G(z)) https://www.slideshare.net/xavigiro/deep-learning-for-computer-vision-generative-models-and-adversarial-training-upc-2016 8

2/11/2020 9

2/11/2020 “The generative model can be thought of as analogous to a team Generative of counterfeiters, trying to produce fake currency and use it without detection, while the discriminative model is analogous to Adversarial the police, trying to detect the counterfeit currency. Competition in this game drives both teams to improve their methods until the Networks counterfeits are indistinguishable from the genuine articles.” – Goodfellow, et. Al. “Generative Adversarial Nets” (2014) Minimax Game ◦ Generator minimizes the log-probability of the discriminator being correct Approach ◦ Resembles Jensen-Shannon divergence ◦ Saddle point of Discriminator’s loss 10

2/11/2020 Minimax Game ◦ Generator minimizes the log-probability of the discriminator being correct Approach ◦ Resembles Jensen-Shannon divergence ◦ Saddle point of Discriminator’s loss Vanishing ◦ Gradient disappears if D is confident, i.e. D(G(z)) → 0 ◦ As can be seen that whenever the discriminator becomes Gradient Problem very confident the loss value will be zero ◦ Nothing to improve for Generator 11

2/11/2020 Heuristic Non- ◦ Generator maximizes the log probability of the discriminator’s mistake Saturating Games ◦ Does not change when discriminator is successful COMPARISON OF GENERATOR LOSSES 12

2/11/2020 MODE COLLAPSE 𝑛𝑗𝑜 𝐻 𝑛𝑏𝑦 𝐸 𝑊 𝐻,𝐸 ≠ 𝑛𝑏𝑦 𝐸 𝑛𝑗𝑜 𝐻 𝑊 𝐻,𝐸 13

2/11/2020 Why are GENERATOR KEEPS MAINTAIN TRADE-OFF OF THE TWO LEARNING TASKS GENERATING SIMILAR GENERATING MORE NEED TO HAVE BALANCE TO IMAGES – SO NOTHING TO ACCURATE VS HIGH ACHIEVE STABILITY LEARN COVERAGE SAMPLES GANs hard to train? IF DISCRIMINATOR IS NOT IF DISCRIMINATOR IS OVER- SUFFICIENTLY TRAINED – TRAINED – VANISHING LEADS TO POOR GENERATOR GRADIENT PROBLEM PERFORMANCE 14

2/11/2020 One-Sided Label Smoothing Historically generated batches Tricks to Feature Matching Train GANs Batch Normalization Regularizing discriminator gradient in region around real data (DRAGAN) One-Sided Label Generator is VERY sensitive to output from Discriminator ◦ Regulates Discriminator gradients ◦ Smoothing Does-not reduce accuracy ◦ ◦ Increases confidence ◦ Only smooth positive samples 15

2/11/2020 Feature Matching ◦ Generated images must match statistics of real images ◦ Discriminator defines the statistics ◦ Generator is trained such that the expected value of statistics matches the expected value of real statistics ◦ Generator tries to minimize the L2 distance in expected values in some arbitrary space ◦ Discriminator defines that arbitrary space Batch Normalization ◦ Construct different mini-batches for real and fake ◦ Each mini-batch needs to contain only all real images or all generated images. ◦ Makes samples with-in a batch less dependent 16

2/11/2020 DRAGAN ◦ Failed GANs typically have extreme gradients/sharp peaks around real data ◦ Regularize GANs to reduce the gradient of the discriminator in region around real data GAN Variations ◦ Conditional GAN ◦ LapGAN ◦ DCGAN ◦ CatGAN ◦ InfoGan ◦ AAE ◦ DRAGAN ◦ IRGAN ◦ ProGAN ◦ and more! 17

2/11/2020 DCGAN ◦ Multiple Convolutional Layers ◦ Batch Normalization ◦ Strides with Convolution ◦ Leaky ReLUs DCGAN ◦ Multiple Convolutional Layers ◦ Batch Normalization ◦ Strides with Convolution ◦ Leaky ReLUs 18

2/11/2020 DCGAN ◦ Multiple Convolutional Layers ◦ Batch Normalization ◦ Strides with Convolution ◦ Leaky ReLUs Conditional GANs P(X|Y) ◦ Generator Learns P(X|Z,Y) ◦ Discriminator Learns P(L|X,Y) 19

2/11/2020 InfoGAN ◦ Rewards Disentanglement ◦ (individual dimensions capturing key attributes of images) ◦ Z – partitioned into two parts ◦ z – capture slight variation in the images ◦ y – captures the main attributes of the images ◦ Mutual Information ◦ maximizing mutual information Between the code and generator output 20

2/11/2020 InfoGAN ◦ Encoder BiGAN ◦ Decoder ◦ Discriminator 21

2/11/2020 LapGAN ◦ Scale GANs for large images ◦ Laplacian pyramid function is used to generate different scales of image LapGAN ◦ Scale GANs for large images ◦ Laplacian pyramid function is used to generate different scales of image 22

2/11/2020 PROGAN ADVERSARIAL AUTOENCODER (GAN + VAE) 23

2/11/2020 Conclusion GAN framework is GAN is still an active flexible to support GAN does not area of research variety of learning guarantee to converge problems GAN can capture Needs a lot of work in Evaluation of GAN is perceptual similarity theoretic foundation of still an open research and generates Network (Theis et. al) better images than VAE 24

2/11/2020 Software ◦ https://github.com/eriklindernoren/Keras-GAN ◦ https://github.com/eriklindernoren/PyTorch-GAN ◦ https://github.com/znxlwm/tensorflow-MNIST-cGAN-cDCGAN References Deep Learning Book GAN paper: https://arxiv.org/abs/1701.00160 GAN slides: http://slazebni.cs.illinois.edu/spring17/lec11_gan.pd GAN Tutorial: https://www.youtube.com/watch?v=HGYYEUSm-0Q 25

2/11/2020 THANK YOU! 26

CS3750: ADVANCED MACHINE LEARNING GENERATIVE ADVERSARIAL NETWORKS - PDF document

2/11/2020 CS3750: ADVANCED MACHINE LEARNING GENERATIVE ADVERSARIAL NETWORKS Adapted from Slides made by Khushboo Thaker Presented by Tristan Maidment GROWTH (AND DECLINE) IN GAN PAPERS 1 2/11/2020 Overview Why Existing MiniMax

Generative Adversarial Nets(GANs) Troy Cary and Chenzhi Zhao A generative adversarial net is

CSC421/2516 Lecture 18: Generative Adversarial Networks Roger Grosse and Jimmy Ba Roger Grosse

Robust Estimation and Generative Adversarial Networks Weizhi ZHU Hong Kong University of Science

Generative Adversarial Networks Benjamin Striner CMU 11-785 March 21, 2018 Benjamin Striner

GAN-based Photo Video Synthesis Summary of Generative Adversarial Nets Lei Zhang What is

generative design systems Generative Brief Design Definitions Workshop Processes

Applications of GANs Photo-Realistic Single Image Super-Resolution Using a Generative

CSC321 Lecture 19: Generative Adversarial Networks Roger Grosse Roger Grosse CSC321 Lecture 19:

Generative Adversarial Networks presented by Ian Goodfellow presentation co-developed with Aaron

Applications of GANs Photo-Realistic Single Image Super-Resolution Using a Generative

MACHINE LEARNING Kernel Canonical Correlation Analysis 1 ADVANCED MACHINE LEARNING ADVANCED

LAB MEETING: A Connection Between Generative Adversarial Networks, Inverse Reinforcement Learning

Learning Universal Adversarial Perturbations with Generative Models Jamie Hayes & George

Advanced Machine Learning CS 7140 - Spring 2019 Lecture 20: Generative Adversarial Networks

Advanced Machine Learning CS 7140 - Spring 2018 Lecture 20: Generative Adversarial Networks

Generative networks part 2: GANs 23 / 54 Recap on generative networks Generative networks provide

Type-driven Incremental Semantic Parsing with Polymorphism Kai Zhao and Liang Huang City

Incomplete Contracts and Control Oliver Hart Nobel Prize Lecture December 8, 2016 How my work

Contents Introduction 1 Enumerating candidate zeta functions 2 Enumerating zeta functions of

Variation of NronSeveri ranks of reductions of K3 surfaces Edgar Costa (Massachusetts

An algebraic analogue of a formula of Knuth Lionel Levine (MIT) FPSAC, San Francisco, August 6,

Black hole Growth and Feedback in AREPO Tiago Costa Debora Sijacki & Martin Haehnelt

CSC321 Lecture 21: Bayesian Hyperparameter Optimization Roger Grosse Roger Grosse CSC321

On a q -analog of the Ap ery numbers International conference on orthogonal polynomials and q

Sambuz

Useful Links

Newsletter

Mail Us

CS3750: ADVANCED MACHINE LEARNING GENERATIVE ADVERSARIAL NETWORKS - PDF document

2/11/2020 CS3750: ADVANCED MACHINE LEARNING GENERATIVE ADVERSARIAL NETWORKS Adapted from Slides made by Khushboo Thaker Presented by Tristan Maidment GROWTH (AND DECLINE) IN GAN PAPERS 1 2/11/2020 Overview Why Existing MiniMax

Generative Adversarial Nets(GANs) Troy Cary and Chenzhi Zhao A generative adversarial net is

CSC421/2516 Lecture 18: Generative Adversarial Networks Roger Grosse and Jimmy Ba Roger Grosse

Robust Estimation and Generative Adversarial Networks Weizhi ZHU Hong Kong University of Science

Generative Adversarial Networks Benjamin Striner CMU 11-785 March 21, 2018 Benjamin Striner

GAN-based Photo Video Synthesis Summary of Generative Adversarial Nets Lei Zhang What is

generative design systems Generative Brief Design Definitions Workshop Processes

Applications of GANs Photo-Realistic Single Image Super-Resolution Using a Generative

CSC321 Lecture 19: Generative Adversarial Networks Roger Grosse Roger Grosse CSC321 Lecture 19:

Generative Adversarial Networks presented by Ian Goodfellow presentation co-developed with Aaron

Applications of GANs Photo-Realistic Single Image Super-Resolution Using a Generative

MACHINE LEARNING Kernel Canonical Correlation Analysis 1 ADVANCED MACHINE LEARNING ADVANCED

LAB MEETING: A Connection Between Generative Adversarial Networks, Inverse Reinforcement Learning

Learning Universal Adversarial Perturbations with Generative Models Jamie Hayes &amp; George

Advanced Machine Learning CS 7140 - Spring 2019 Lecture 20: Generative Adversarial Networks

Advanced Machine Learning CS 7140 - Spring 2018 Lecture 20: Generative Adversarial Networks

Generative networks part 2: GANs 23 / 54 Recap on generative networks Generative networks provide

Type-driven Incremental Semantic Parsing with Polymorphism Kai Zhao and Liang Huang City

Incomplete Contracts and Control Oliver Hart Nobel Prize Lecture December 8, 2016 How my work

Contents Introduction 1 Enumerating candidate zeta functions 2 Enumerating zeta functions of

Variation of NronSeveri ranks of reductions of K3 surfaces Edgar Costa (Massachusetts

An algebraic analogue of a formula of Knuth Lionel Levine (MIT) FPSAC, San Francisco, August 6,

Black hole Growth and Feedback in AREPO Tiago Costa Debora Sijacki &amp; Martin Haehnelt

CSC321 Lecture 21: Bayesian Hyperparameter Optimization Roger Grosse Roger Grosse CSC321

On a q -analog of the Ap ery numbers International conference on orthogonal polynomials and q

Sambuz

Useful Links

Newsletter

Mail Us

Learning Universal Adversarial Perturbations with Generative Models Jamie Hayes & George

Black hole Growth and Feedback in AREPO Tiago Costa Debora Sijacki & Martin Haehnelt