Works will be presented Deep Convolutional Generative Adversarial - PowerPoint PPT Presentation

Michael James Smith’s rhyperealistic paintings Part 3 – Image Editing with GANs Levent Karacan Computer Vision Lab, Hacettepe University

Works will be presented • Deep Convolutional Generative Adversarial Networks(DCGAN) • Image Editing on Learned Manifold(iGAN) • Conditional Generative Adversarial Networks(cGAN) − Image Generation from Text (Text2Im) − Stacked Generative Adversarial Networks(StackGAN) − Location and Description Conditioned Image Generation(GAWWN) − Image to Image Translation(pix2pix) − Image Generation from Semantic Segments and Attributes(AL-CGAN) (Our work) − Unpaired Image to Image Translation(CycleGAN) • Neural Face Editing 2

Generative Adversarial Networks(GAN) Goodfellow vd. 2014(GAN); Radford vd. 2015(DCGAN) • G tries to generate fake images that fool D. • D tries to identify fake images. ℒ "#$ %, ' = ) *~, -./. (*) log ' 5 + ) *~, -./. * ,7~, 8 7 [log(1 − '(5, %(<)))] % ∗ = min " max ℒ "#$ %, ' D 3

DCGAN • Cats Source: https://github.com/a leju/cat-generator 4

DCGAN • Animes Source: https://github.com/jaylei cn/animeGAN 5

DCGAN • Album covers Source: https://github.com/jaylei cn/animeGAN 6

DCGAN • Flowers 7

DCGAN • Faces 8

Image Editing on Learned Manifold(iGAN) Zhu vd. 2016 • An image editing method that aims to find projection E of input image F . (a) original photo (e) different degree of image manipulation Project Edit Transfer (c) Editing UI (b) projection on manifold (d) smooth transition between the original and edited projection 9

Image Editing on Learned Manifold(iGAN) Zhu vd. 2016 • Find E that generates the input image x using generator network . < H − < I I G % < H , % < I ≈ K(E) D (F) 10

Image Editing on Learned Manifold(iGAN) Zhu vd. 2016 • Images generated from DCGAN trained on shirt image dataset. < H − < I I G % < H , % < I ≈ (a) random samples (b) random jittering (c) linear interpolation 11

� Image Editing on Learned Manifold(iGAN) Zhu vd. 2016 • Projection via optimization. L-BFGS-B method. < ∗ = MNOmin < H − < I I R ℒ(%(<), 5 S ) G % < H , % < I ≈ 7∈ℤ • Projection via feedforward network. ∗ = MNOmin S ; T U , 5 Z S )) T U V W X ℒ(%(Y 5 Z I ℒ 5 H , 5 I = L 5 H − L 5 I Z • Hybrid method. 12

Image Editing on Learned Manifold(iGAN) Zhu vd. 2016 Original photos Reconstruction via Optimization 0.165 0.164 0.370 0.279 0.350 0.249 0.437 0.255 0.178 0.227 Reconstruction via Network 0.198 0.190 0.382 0.302 0.251 0.339 0.482 0.270 0.248 0.263 Reconstruction via Hybrid Method 0.133 0.141 0.298 0.218 0.160 0.204 0.318 0.185 0.183 0.190 13

� Image Editing on Learned Manifold(iGAN) Zhu vd. 2016 < H − < I I G % < H , % < I ≈ g: Color, shape and warping constraints for image editing. I + a b . < − < d I < ∗ = min 7∈ℤ {X ^ _ %(<) − ` _ _ (a) User constraints � at different update steps (b) Updated images according to user edits � � (c) Linear interpolation between � and � 14

Image Editing on Learned Manifold(iGAN) Zhu vd. 2016 Edit Transfer • A dense correspondence algorithm to estimate both the geometric and color changes induced by the editing process. 15

Image Editing on Learned Manifold(iGAN) Zhu vd. 2016 16

Image Editing on Learned Manifold(iGAN) Zhu vd. 2016 17

Conditional Generative Adversarial Networks(cGAN) Mirza vd. 2014 • Concatenate condition information F to noise vector E and introduce to discriminator. ℒ e"#$ %, ' = ) *,f~, -./. (*,f) log ' 5, g + ) *~, -./. * ,7~, 8 7 [log(1 − '(5, %(5, <)))] % ∗ = min " max ℒ e"#$ %, ' D 18

Image Generation from Text(Text2Im) Reed vd. 2016 • Discriminator network tries to classify real image and wrong text as well as real/fake image with right text. • Condition: Text description embedding. • CUB bird dataset(11788 images from 200 categories), Oxford-102 flower dataset(8189 images from 102 categories). Figure 2. Our text-conditional convolutional GAN architecture. Text encoding is used by both generator and discriminator. It is 19

Image Generation from Text(Text2Im) Reed vd. 2016 Images Text descriptions this small bird has a pink this magnificent fellow is (style) (content) breast and crown, and black almost all black with a red primaries and secondaries. crest, and white cheek patch. The bird has a yellow breast with grey features and a small beak. This is a large white bird with black wings and a red head . A small bird with a black head and wings and features grey wings. This bird has a white breast , brown and white coloring on its head and wings, and a thin pointy beak. the flower has petals that this white and yellow flower have thin white petals and a are bright pinkish purple A small bird with white base and black stripes throughout its belly, head, and with white stigma round yellow stamen feathers. A small sized bird that has a cream belly and a short pointed bill. This bird is completely red . This bird is completely white . Figure 1. Examples of generated images from text descriptions. This is a yellow bird. The wings are bright blue . Figure 6. Transfering style from the top row (real) images to the 20

Image Generation from Text(Text2Im) Reed vd. 2016 ‘Red bird with black beak’ “Blue bird with black beak “This bird is completely red with black wings” ‘Small blue bird with black wings’ → 21

Image Generation from Text(Text2Im) Reed vd. 2016 “Small blue bird with black wings. ” “Small yellow bird with black wings” ‘This bird is bright.’ → ‘This bird is dark.’ ‘This is a yellow bird. The wings are bright blue’ 22

Image Generation from Text(Text2Im) Reed vd. 2016 “This bird is bright. ” “This bird is dark” 23

Stacked Generative Adversarial Networks(StackGAN) Han vd. 2016 • There are 2 stages. • Stage-I GAN: Generates low resolution images. • Conditioning Augmentation • Regularization term is added to generator. ' hi (j(k(l m ) ∥ j(0, p)) • Stage-II GAN: Generates high resolution detailed images. • Noise vector is not used. 24

Stacked Generative Adversarial Networks(StackGAN) Han vd. 2016 25

Location and Description Conditioned Image Reed vd. 2016 Generation(GAWWN) This bird is completely black. Beak Belly Right leg This bird is bright blue. Head a man in an orange jacket, black pants and a black cap wearing sunglasses skiing 29

Location and Description Conditioned Image Reed vd. 2016 Generation(GAWWN) • Keypoint conditioned architecture. 30

Location and Description Conditioned Image Reed vd. 2016 Generation(GAWWN) • Keypoint conditioned architecture. Shrinking Translation Stretching Caption GT This bird has a black head, a long orange beak and yellow body This large black bird has a pointy beak and black eyes This small blue bird has a short pointy beak and brown patches on its wings Figure 6: Controlling the bird’s position using keypoint coordinates. Here we only interpolated the 31

Location and Description Conditioned Image Reed vd. 2016 Generation(GAWWN) • Bounding box conditioned architecture. 32

Location and Description Conditioned Image Reed vd. 2016 Generation(GAWWN) • Bounding box conditioned architecture. Caption Shrinking GT Translation Stretching This bird has a black head, a long orange beak and yellow body This large black bird has a pointy beak and black eyes This small blue bird has a short pointy beak and brown patches on its wings Figure 4: Controlling the bird’s position using bounding box coordinates. and previously-unseen text. 33

Image to Image Translation(pix2pix) Isola vd. 2017 34

Image to Image Translation(pix2pix) Isola vd. 2017 % ∗ = MNOmin " max ℒ e"#$ %, ' + aℒ iH % Adversarial Loss D ℒ e"#$ %, ' = ) *,f~, -./. (*,f) log ' 5, g + Real Fake ) *~, -./. * [log(1 − '(5, %(5)))] L1 Loss ℒ iH % = ) *,f~, -./. *,f ,7~, 8 7 g − % 5, < H Encoder-decoder U-Net • Noise vector is removed,.Instead dropout is used to provide stochasticity. • Skip connections on Generative model • G tries to generate fake images that fool D. • PatchGAN is proposed for dicriminator instead of • D tries to identify fake images. pixel GAN. 35

Image to Image Translation(pix2pix) Isola vd. 2017 L1+cGAN L1 Encoder-decoder • U-Net provides to include low-level features to be used yo generate more realistic images. • PatchGAN provides to generate U-Net sharper images. . Input Ground truth L1 cGAN L1 + cGAN 36

Image to Image Translation(pix2pix) Isola vd. 2017 Input L1 cGAN cGAN + L1 Real Isola, P., Zhu, J.Y., Zhou, T. and Efros, A.A. “Image-to-image translation with conditional adversarial networks.”. In CVPR 2017 . . 37

Image to Image Translation(pix2pix) Isola vd. 2017 Input Ground truth Output Input Ground truth Output 38

Image to Image Translation(pix2pix) Isola vd. 2017 Input Ground truth Output Input Ground truth Output 39

Works will be presented Deep Convolutional Generative Adversarial - PowerPoint PPT Presentation

Michael James Smiths rhyperealistic paintings Part 3 Image Editing with GANs Levent Karacan Computer Vision Lab, Hacettepe University Works will be presented Deep Convolutional Generative Adversarial Networks(DCGAN) Image Editing

Public Works Department Public Works Department Public Works Department 2012-2017 Capital Works

I want my MVP UX in the City - 20th April 2017 PILOT WORKS 1 Hello, I am Alastair from PILOT

ArcelorMittal Newcastle Works June 2012 Contents About Newcastle Works Newcastle Works

PUBLIC WORKS DEPARTMENT PUBLIC WORKS DEPARTMENT The City of Rossland Public Works staff consists

How Colitag works: How Colitag works: Colitag is a dehydrated test culture medium Colitag is a

Ontario Works Program Ontario Works To be eligible for Ontario Works, an applicant must be:

Maas Energy Works, Inc. Renewable Energy that Works Daryl Maas Chief Executive Officer

Rookery South ERF CLP Meeting LLRS Works Update 12 th June 2017 LLRS Works Low Level

Technology & the Local Government Works Officer TECHNOLOGY & THE LOCAL GOVERNMENT WORKS

public works public works City of Buellton * February 13, 2013 OVERVIEW Public works covers the

DESIGN-BUILD Projects July 16, 2020 LA COUNTY PUBLIC WORKS Los Angeles County Public Works

PRESENTATION OF SCIENTIFIC WORKS 1. The scientific works for the CLASA-Uruguay Congress will be

Josephine County Public Works 2016 17 Budget Presentation Robert Brandes, Director Public Works

Presentation Contents 1. Whos Who 2. Where are the works to be undertaken 3. Preparatory works

Emilio Velis CC El Salvador / OSHWA El Salvador How to identify open works in general How can

General Managers Recommended Budget FY 14/15 Public Works Department 1 Public Works

30/03/2016 Safety Moment Ben Green Planning Forum Ian Fletcher 22 March 2016 2 Safety Moment

Statistical methods Toma Podobnik Oddelek za fiziko, FMF, UNI-LJ , , Odsek za

Strong fields in heavy ion collisions at VLHC P. Lvai (Wigner RCP, Budapest, Hungary) V.V.

Topic 8 graphics "What makes the situation worse is that the highest level CS course I've

Laser Interferometer Space Antenna (LISA) Time-Delay Interferometry with Moving Spacecraft

A brief history of gravitational-wave research and the gravitational-wave spectrum Wei-Tou Ni

GW1Introduction to Gravitational Waves Michele Vallisneri ICTP Summer School on Cosmology

Digital Democracy: Interactive Medium-Fi Prototype Sharewaves Garrick F. | Gen S. | Grace W.

Works will be presented Deep Convolutional Generative Adversarial - PowerPoint PPT Presentation

Michael James Smiths rhyperealistic paintings Part 3 Image Editing with GANs Levent Karacan Computer Vision Lab, Hacettepe University Works will be presented Deep Convolutional Generative Adversarial Networks(DCGAN) Image Editing

Public Works Department Public Works Department Public Works Department 2012-2017 Capital Works

I want my MVP UX in the City - 20th April 2017 PILOT WORKS 1 Hello, I am Alastair from PILOT

ArcelorMittal Newcastle Works June 2012 Contents About Newcastle Works Newcastle Works

PUBLIC WORKS DEPARTMENT PUBLIC WORKS DEPARTMENT The City of Rossland Public Works staff consists

How Colitag works: How Colitag works: Colitag is a dehydrated test culture medium Colitag is a

Ontario Works Program Ontario Works To be eligible for Ontario Works, an applicant must be:

Maas Energy Works, Inc. Renewable Energy that Works Daryl Maas Chief Executive Officer

Rookery South ERF CLP Meeting LLRS Works Update 12 th June 2017 LLRS Works Low Level

Technology &amp; the Local Government Works Officer TECHNOLOGY &amp; THE LOCAL GOVERNMENT WORKS

public works public works City of Buellton * February 13, 2013 OVERVIEW Public works covers the

DESIGN-BUILD Projects July 16, 2020 LA COUNTY PUBLIC WORKS Los Angeles County Public Works

PRESENTATION OF SCIENTIFIC WORKS 1. The scientific works for the CLASA-Uruguay Congress will be

Josephine County Public Works 2016 17 Budget Presentation Robert Brandes, Director Public Works

Presentation Contents 1. Whos Who 2. Where are the works to be undertaken 3. Preparatory works

Emilio Velis CC El Salvador / OSHWA El Salvador How to identify open works in general How can

General Managers Recommended Budget FY 14/15 Public Works Department 1 Public Works

30/03/2016 Safety Moment Ben Green Planning Forum Ian Fletcher 22 March 2016 2 Safety Moment

Statistical methods Toma Podobnik Oddelek za fiziko, FMF, UNI-LJ , , Odsek za

Strong fields in heavy ion collisions at VLHC P. Lvai (Wigner RCP, Budapest, Hungary) V.V.

Topic 8 graphics &quot;What makes the situation worse is that the highest level CS course I've

Laser Interferometer Space Antenna (LISA) Time-Delay Interferometry with Moving Spacecraft

A brief history of gravitational-wave research and the gravitational-wave spectrum Wei-Tou Ni

GW1Introduction to Gravitational Waves Michele Vallisneri ICTP Summer School on Cosmology

Digital Democracy: Interactive Medium-Fi Prototype Sharewaves Garrick F. | Gen S. | Grace W.

Technology & the Local Government Works Officer TECHNOLOGY & THE LOCAL GOVERNMENT WORKS

Topic 8 graphics "What makes the situation worse is that the highest level CS course I've