Introduction to Generative Adversarial Network (GAN) Hongsheng Li - PowerPoint PPT Presentation

Introduction to Generative Adversarial Network (GAN) Hongsheng Li Department of Electronic Engineering Chinese University of Hong Kong Adversarial – adj. 對抗的 1

Generative Models • Density Estimation ( | ) p y x – Discriminative model: • y=0 for elephant, y=1 for horse ( | ) – Generative model: p x y   ( | 0 ) ( | 1 ) p x y p x y Horse (y=1) Elephant (y=0) 2

Generative Models • • Sample Generation Model Training samples samples 3

Generative Models • Sample Generation Model Training samples samples Training samples 4

Generative Models • Generative model p mod p Sample generation Data el data • GAN is a generative model – Mainly focuses on sample generation – Possible to do both 5

Why Worth Studying? • Excellent test of our ability to use high- dimensional, complicated probability distributions p mod Sample generation el • Missing data – Semi-supervised learning 6

Why Worth Studying? • Multi-modal outputs – Example: next frame prediction Lotter et al. 2015 7

Why Worth Studying? • Image generation tasks – Example: single-image super-resolution Ledig et al 2015 8

Why Worth Studying? • Image generation tasks – Example: Image-to-Image Translation – https://affinelayer.com/pixsrv/ 9 Isola et al 2016

Why Worth Studying? • Image generation tasks – Example: Text-to-Image Generation Zhang et al 2016 10

How does GAN Work? • Adversarial – adj. 對抗的 • Two networks: – Generator G : creates (fake) samples that the discriminator cannot distinguish – Discriminator D : determine whether samples are fake or real compete Generator Discriminator 11

The Generator • G : a differentiable function – modeled as a neural network • Input: – z : random noise vector from some simple prior distribution • Output: – x = G ( z ): generated samples z x Generator 12

The Generator p mod el ~ p z G ( z )= x Generator data • The dimension of z should be at least as large as that of x 13

The Discriminator • D : modeled as a neural network • Input: – Real sample – Generated sample x • Output: – 1 for real samples – 0 for fake samples x 0 Discriminator Real data 1 14

Generative Adversarial Networks 15

Cost Functions • The discriminator outputs a value D(x) indicating the chance that x is a real image • For real images, their ground-truth labels are 1. For generated images their labels are 0. • Our objective is to maximize the chance to recognize real images as real and generated images as fake • The objective for generator can be defined as 16

Cost Functions • For the generator G , its objective function wants the model to generate images with the highest possible value of D(x) to fool the discriminator • The cost function is • The overall GAN training is therefore a min-max game 17

Training Procedure • The generator and the discriminator are learned jointly by the alternating gradient descent – Fix the generator’s parameters and perform a single iteration of gradient descent on the discriminator using the real and the generated images – Fix the discriminator and train the generator for another single iteration 18

The Algorithm 19

Illustration of the Learning • Generative adversarial learning aims to learn a model distribution that matches the actual data distribution Data Model Discriminator distribution 20

Generator diminished gradient • However, we encounter a gradient diminishing problem for the generator. The discriminator usually wins early against the generator • It is always easier to distinguish the generated images from real images in early training. That makes cost function approaches 0. i.e. - log(1 -D(G(z))) → 0 • The gradient for the generator will also vanish which makes the gradient descent optimization very slow • To improve that, the GAN provides an alternative function to backpropagate the gradient to the generator minimize maximize 21

Comparison between Two Losses 22

Non-Saturating Game • • • In the min-max game, the generator maximizes the same cross-entropy • Now, generator maximizes the log-probability of the discriminator being mistaken • Heuristically motivated; generator can still learn even when discriminator successfully rejects all generator samples 23

Deep Convolutional Generative Adversarial Networks (DCGAN) • All convolutional nets • No global average pooling • Batch normalization • ReLU 24 Radford et al. 2016

Deep Convolutional Generative Adversarial Networks (DCGAN) • LSUN bedroom (about 3m training images) 25 Radford et al. 2016

Manipulating Learned z 26

Manipulating Learned z 27

Image Super-resolution with GAN 28 Ledig et al. 2016

Image Super-resolution with GAN 29

Image Super-resolution with GAN 30

Image Super-resolution with GAN bicubic SRResNet SRGAN original 31

Context-Encoder for Image Inpainting • For a pre-defined region, synthesize the image contents Pathak 32 et al 2016

Context-Encoder for Image Inpainting • For a pre-defined region, synthesize the image contents 33 Pathak et al 2016

Context-Encoder for Image Inpainting • Overall framework Synthetic region Original region 34

Context-Encoder for Image Inpainting • The objective 35

Context-Encoder for Image Inpainting 36

Image Inpainting with Partial Convolution • Partial convolution for handling missing data • L1 loss: minimizing the pixel differences between the generated image and their ground-truth images • Perceptual loss: minimizing the VGG features of the generated images and their ground-truth images • Style loss (Gram matrix): minimizing the gram matrices of the generated images and their ground-truth images 37 Liu 2016

Image Inpainting with Partial Convolution: Results 38 Liu 2016

Texture Synthesis with Patch-based GAN • Synthesize textures for input images 39 Liu et al. 2018

Texture Synthesis with Patch-based GAN Adv loss MSE Loss 40 Li and Wand 2016

Texture Synthesis with Patch-based GAN 41 Li and Wand 2016

Texture Synthesis with Patch-based GAN 42 Li and Wand 2016

Conditional GAN • GAN is too free. How to add some constraints? • Add conditional variables y into the generator Model Training samples samples 43 Mirza and Osindero 2016

Conditional GAN • GAN is too free. How to add some constraints? • Add conditional variables y into G and D 44 Mirza and Osindero 2016

Conditional GAN 45 Mirza and Osindero 2016

Conditional GAN 0 1 0 0 0 0 0 0 0 0 46 Mirza and Osindero 2016

Conditional GAN • Positive samples for D – True data + corresponding conditioning variable • Negative samples for D – Synthetic data + corresponding conditioning variable – True data + non-corresponding conditioning variable 47 Mirza and Osindero 2016

Text-to-Image Synthesis 48 Reed et al 2015

StackGAN: Text to Photo-realistic Images • How humans draw a figure? – A coarse-to-fine manner 49 Zhang et al. 2016

StackGAN: Text to Photo-realistic Images • Use stacked GAN structure for text-to-image synthesis 50 Zhang et al. 2016

StackGAN: Text to Photo-realistic Images • Use stacked GAN structure for text-to-image synthesis 51

StackGAN: Text to Photo-realistic Images • Conditioning augmentation • No random noise vector z for Stage-2 • Conditioning both stages on text help achieve better results • Spatial replication for the text conditional variable • Negative samples for D – True images + non-corresponding texts – Synthetic images + corresponding texts 52

Conditioning Augmentation • How train parameters like the mean and variance of a Gaussian distribution  0  • ( , ) N 0 • Sample from standard Normal distribution ( 0 , 1 ) N   • Multiple with and then add with 0 0 • The re-parameterization trick 53

More StackGAN Results on Flower 54

More StackGAN Results on COCO 55

StackGAN-v2: Architecture • Approximate multi-scale image distributions jointly • Approximate conditional and unconditional image distributions jointly 56

StackGAN-v2: Results 57

Progressive Growing of GAN • Share the similar spirit with StackGAN-v1/-v2 but use a different training strategy 58

Progressive Growing of GAN • Impressively realistic face images 59

Image-to-Image Translation with Conditional GAN 60 Isola et al. 2016

Image-to-Image Translation with Conditional GAN • Incorporate L1 loss into the objective function • Adopt the U-net structure for the generator Encoder-decoder with skips Encoder-decoder 61

Patch-based Discriminator • Separate each image into N x N patches • Instead of distinguish whether the whole image is real or fake, train a patch-based discriminator 62

More Results 63

More Results 64

CycleGAN • All previous methods require to have paired training data, i.e., exact input-output pairs, which can be extremely difficult to obtain in practice

Introduction to Generative Adversarial Network (GAN) Hongsheng Li - PowerPoint PPT Presentation

Introduction to Generative Adversarial Network (GAN) Hongsheng Li Department of Electronic Engineering Chinese University of Hong Kong Adversarial adj. 1 Generative Models Density Estimation ( | ) p y x

Generative Adversarial Networks MIX+GAN Ian Goodfellow, Sta ff Research Scientist, Google Brain

Generative Adversarial Networks MIX+GAN Ian Goodfellow, Sta ff Research Scientist, Google Brain

GAN-based Photo Video Synthesis Summary of Generative Adversarial Nets Lei Zhang What is

Bridging Theory and Practice of GANs MIX+GAN Ian Goodfellow, Sta ff Research Scientist, Google

Adversarial Machine Learning MIX+GAN Ian Goodfellow, Sta ff Research Scientist, Google Brain

Adversarial Machine Learning MIX+GAN Ian Goodfellow, Sta ff Research Scientist, Google Brain

Adversarial Machine Learning MIX+GAN Ian Goodfellow, Sta ff Research Scientist, Google Brain

Generative Adversarial Nets(GANs) Troy Cary and Chenzhi Zhao A generative adversarial net is

GANs for Creativity and Design MIX+GAN Ian Goodfellow, Sta ff Research Scientist, Google Brain

GANs for Limited Labeled Data MIX+GAN Ian Goodfellow, Sta ff Research Scientist, Google Brain

Generative Adversarial Networks Benjamin Striner 1 1 Carnegie Mellon University April 8, 2019

Generative Adversarial Networks Benjamin Striner 1 1 Carnegie Mellon University November 23, 2020

Generative Adversarial Network and it its Applications to Human La Language Processing

CSC421/2516 Lecture 18: Generative Adversarial Networks Roger Grosse and Jimmy Ba Roger Grosse

Robust Estimation and Generative Adversarial Networks Weizhi ZHU Hong Kong University of Science

Generative Adversarial Networks Benjamin Striner CMU 11-785 March 21, 2018 Benjamin Striner

Tizen, Security and The Internet of Things Casey Schaufler 1 Casey Schaufler Security

Flowering Plants 2 Learning tasks for Year 1 . Plant Detectives Week 2 We are going to be

PERSPECTIVES IN MODEL THEORY OF FUZZY LOGIC PILAR DELLUNDE PRAGUE, THE FUTURE OF MATHEMATICAL

Abstraction Principles and Grounding Freges Program Grundgesetze Predicative Frege

Cloth Animation Christopher Twigg March 4, 2003 Outline Overview Models Integrating

Software Engineering at the Speed of Light: How Developers Stay Current using Twitter Presented

Design and implementation of an ITS station to bridge automotive and IoT systems Master Thesis in

The Evolution of Galaxies: The Evolution of Galaxies: From the Local Group From the Local Group