Applications of GANs Photo-Realistic Single Image Super-Resolution - PowerPoint PPT Presentation

Applications of GANs ● Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network ● Deep Generative Image Models using a Laplacian Pyramid of Adversarial Networks ● Generative Adversarial Text to Image Synthesis 1

Using GANs for Single Image Super-Resolution Christian Ledig, Lucas Theis, Ferenc Huszar, Jose Caballero, Andrew Aitken, Alykhan Tejani, Johannes Totz, Zehan Wang, Wenzhe Shi 2

Problem How do we get a high resolution (HR) image from just one (LR) lower resolution image? Answer: We use super-resolution (SR) techniques. http://www.extremetech.com/wp-content/uploads/2012/07/super-resolution-freckles.jpg 3

Previous Attempts 4

SRGAN 5

SRGAN - Generator G: generator that takes a low-res image I LR and outputs its high-res ● counterpart I SR ● θ G : parameters of G, {W 1:L , b 1:L } l SR : loss function measures the difference between the 2 high-res images ● 6

SRGAN - Discriminator D: discriminator that classifies whether a high-res image is I HR or I SR ● ● θ D : parameters of D 7

SRGAN - Perceptual Loss Function Loss is calculated as weighted combination of: Content loss ➔ Adversarial loss ➔ Regularization loss ➔ 8

SRGAN - Content Loss Instead of MSE, use loss function based on ReLU layers of pre-trained VGG network. Ensures similarity of content. � i,j : feature map of j th convolution before i th maxpooling ● ● W i,j and H i,j : dimensions of feature maps in the VGG 9

SRGAN - Adversarial Loss Encourages network to favour images that reside in manifold of natural images. 10

SRGAN - Regularization Loss Encourages spatially coherent solutions based on total variations. 11

SRGAN - Examples 12

SRGAN - Examples 13

Deep Generative Image Models using a Laplacian Pyramid of Adversarial Networks Work by Emily Denton, Soumith Chintala, Arthur Szlam, Rob Fergus 14

Short Background 15

Conditional Generative Adversarial Nets (CGAN) Mirza and Osindero (2014) GAN CGAN 16

Laplacian pyramid Burt and Adelson (1983) 17

Laplacian pyramid Burt and Adelson (1983) 18

Laplacian Pyramid Generative Adversarial Network (LAPGAN) 19

Image Generation 20

Training 21

Generation: Coarse to fine 22

Different draws, starting from the same initial 4x4 image 23

Some thoughts on the method ● The Laplacian Pyramid Framework is independent of the Generative Model Possible to use a completely different model like Pixel RNN 24

Some thoughts on the method ● The Generative Models at each step can be totally different! These can also be different models! 25

Some thoughts on the method ● The Generative Models at each step can be totally different! High resolution architecture Low resolution architecture 26

Generative Adversarial Text to Image Synthesis Scott Reed, Zeynep Akata, Xinchen Yan, Lajanugen Logeswaran, Bernt Schiele, Honglak Lee Author’s code available at: https://github.com/reedscot/icml2016 27

Motivation Current deep learning models enable us to... ➢ Learn feature representations of images & text ➢ Generate realistic images & text pull out images based on captions generate descriptions based on images answer questions about image content 28

Problem - Multimodal distribution • Many plausible image can be associated with one single text description • Previous attempt uses Variational Recurrent Autoencoders to generate image from text caption but the images were not realistic enough. (Mansimov et al. 2016) 29

What GANs can do • CGAN: Use side information (eg. classes) to guide the learning process • Minimax game: Adaptive loss function ➢ Multi-modality is a very well suited property for GANs to learn. 30

The Model - Basic CGAN Learns a compatibility Pre-trained char-CNN-RNN function of images and text -> joint embedding 31

The Model - Variations GAN-CLS Algorithm In order to distinguish different error sources: Present to the discriminator network 3 different types of input. (instead of 2) 32

The Model - Variations cont. GAN-INT Updated Equation In order to generalize the output of G: Interpolate between training set embeddings to generate new text and hence fill the gaps {fake image, fake text} on the image data manifold. GAN-INT-CLS: Combination of both previous variations 33

Disentangling ❖ Style is background, position & orientation of the object, etc. ❖ Content is shape, size & colour of the object, etc. ● Introduce S(x), a style encoder with a squared loss function: ● Useful in generalization: encoding style and content separately allows for different new combinations 34

Training - Data (separated into class-disjoint train and test sets) Caltech-UCSD Birds MS COCO Oxford Flowers 35

Training – Results: Flower & Bird 36

Mansimov et al. Training – Results: MS COCO 37

Training – Results Style disentangling 38

Thoughts on the paper • Image quality • Generalization • Future work 39

Applications of GANs Photo-Realistic Single Image Super-Resolution - PowerPoint PPT Presentation

Applications of GANs Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network Deep Generative Image Models using a Laplacian Pyramid of Adversarial Networks Generative Adversarial Text to Image Synthesis

Entropic GANs meet VAEs: A Statistical Approach to Compute Sample Likelihoods in GANs Yogesh

GANs for Word Embeddings Akshay Budhkar and Krishnapriya Introduction GANs have shown incredible

Bregman and Wasserstein, with Applications to Generative Adversarial Networks (GANs) and beyond

Advanced Section #8: Generative Adversarial Networks (GANs) CS109B Data Science 2 Vincent Casser

Reading group: Latent Optimized GANs (Game theory brings guns to GANs) Michal Sustr Dept. of

GANs, Optimal Transport, and Implicit Distribution Estimation Tengyuan Liang Econometrics and

Lecture 20: GANS CS109B Data Science 2 Pavlos Protopapas and Mark Glickman 1 Outline Review of

On Minimax Optimality of GANs for Robust Mean Estimation Kaiwen Wu 1,2 With Gavin Weiguang Ding 3

GAN Frontiers/Related Methods Improving GAN Training Improved Techniques for Training GANs

Intro Tutorial on GANs Michela Paganini Fermilab Machine Learning Group Meeting March 21, 2018

Applications of GANs Photo-Realistic Single Image Super-Resolution Using a Generative

HOWTO AI FOR GOOD ARE ADS, GAMES, AND GANS REALLY THE BEST WE CAN DO? A PATH (TOO) WELL

Beyond detection: GANs and LSTMs to pay attention at human presence Ri Rita Cucchia iara Imag

Photo Editing With Generative Adversarial Networks (GANs) GTC, May 2017. Greg Heinrich.

GAN Fashion Photo Shoot: Garment to Model Images Using Conditional GANs. Costa M. Colbert, Chief

Federal Crop Insurance Presented by the USDA Risk Management Agency With Nick Gans Disclaimer

Applying Deep Learning To Airbnb Search Unique Challenges High Value Transactions Sparse

Networking @ OLN 21 January 2017 ENGLISH LANGUAGE DESIRED OUTCOMES FOR EL PROFICIENCY IN

Augurative and Commemorative Names from Early 14th-Century Imola Sara L. Uckelman @SaraLUckelman

Welcome to 3rd Grade Mrs. Szczureks Class Room 310 a r Y e o l h o S c 7 0 1 -

I Can See Clearer Now The Blur is Gone Per Christian Hansen deblurring GAMM 2008 1 The

Modeling Semantic Overlap Over the last few years, broadening consensus that this is the core

A Project Presented to The Faculty of Department of Computer Science San Jos State University

Gender Stereotypes & Diversity in Families: A drama workshop for primary schools addressing

Applications of GANs Photo-Realistic Single Image Super-Resolution - PowerPoint PPT Presentation

Applications of GANs Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network Deep Generative Image Models using a Laplacian Pyramid of Adversarial Networks Generative Adversarial Text to Image Synthesis

Entropic GANs meet VAEs: A Statistical Approach to Compute Sample Likelihoods in GANs Yogesh

GANs for Word Embeddings Akshay Budhkar and Krishnapriya Introduction GANs have shown incredible

Bregman and Wasserstein, with Applications to Generative Adversarial Networks (GANs) and beyond

Advanced Section #8: Generative Adversarial Networks (GANs) CS109B Data Science 2 Vincent Casser

Reading group: Latent Optimized GANs (Game theory brings guns to GANs) Michal Sustr Dept. of

GANs, Optimal Transport, and Implicit Distribution Estimation Tengyuan Liang Econometrics and

Lecture 20: GANS CS109B Data Science 2 Pavlos Protopapas and Mark Glickman 1 Outline Review of

On Minimax Optimality of GANs for Robust Mean Estimation Kaiwen Wu 1,2 With Gavin Weiguang Ding 3

GAN Frontiers/Related Methods Improving GAN Training Improved Techniques for Training GANs

Intro Tutorial on GANs Michela Paganini Fermilab Machine Learning Group Meeting March 21, 2018

Applications of GANs Photo-Realistic Single Image Super-Resolution Using a Generative

HOWTO AI FOR GOOD ARE ADS, GAMES, AND GANS REALLY THE BEST WE CAN DO? A PATH (TOO) WELL

Beyond detection: GANs and LSTMs to pay attention at human presence Ri Rita Cucchia iara Imag

Photo Editing With Generative Adversarial Networks (GANs) GTC, May 2017. Greg Heinrich.

GAN Fashion Photo Shoot: Garment to Model Images Using Conditional GANs. Costa M. Colbert, Chief

Federal Crop Insurance Presented by the USDA Risk Management Agency With Nick Gans Disclaimer

Applying Deep Learning To Airbnb Search Unique Challenges High Value Transactions Sparse

Networking @ OLN 21 January 2017 ENGLISH LANGUAGE DESIRED OUTCOMES FOR EL PROFICIENCY IN

Augurative and Commemorative Names from Early 14th-Century Imola Sara L. Uckelman @SaraLUckelman

Welcome to 3rd Grade Mrs. Szczureks Class Room 310 a r Y e o l h o S c 7 0 1 -

I Can See Clearer Now The Blur is Gone Per Christian Hansen deblurring GAMM 2008 1 The

Modeling Semantic Overlap Over the last few years, broadening consensus that this is the core

A Project Presented to The Faculty of Department of Computer Science San Jos State University

Gender Stereotypes &amp; Diversity in Families: A drama workshop for primary schools addressing

Gender Stereotypes & Diversity in Families: A drama workshop for primary schools addressing