pixelgan autoencoders
play

PixelGAN Autoencoders Alireza Makhzani, Brendan Frey Machine - PowerPoint PPT Presentation

PixelGAN Autoencoders Alireza Makhzani, Brendan Frey Machine learning Group University of Toronto CIFAR Deep Learning Summer School Montreal, Canada June 29 th , 2017 Alireza Makhzani PixelGAN Autoencoders 1 / 27 Outline 1. Background


  1. PixelGAN Autoencoders Alireza Makhzani, Brendan Frey Machine learning Group University of Toronto CIFAR Deep Learning Summer School Montreal, Canada June 29 th , 2017 Alireza Makhzani PixelGAN Autoencoders 1 / 27

  2. Outline 1. Background • PixelCNNs • Variational Autoencoders • Adversarial Autoencoders 2. PixelGAN Autoencoders • Gaussian Priors • Categorical Priors ✦ Clustering ✦ Semi-supervised Learning Alireza Makhzani PixelGAN Autoencoders 2 / 27

  3. PixelCNNs ✦ Learn the image statistics directly at the pixel level. ✦ Good at modelling low-level pixel statistics. ✦ Samples lack global structure. ✦ Lacking latent representation. ✦ Conditional PixelCNNs can learn conditional densities. van den Oord et al., 2016 Alireza Makhzani PixelGAN Autoencoders 3 / 27

  4. Variational Autoencoders log p ( x ) > � E q ( z | x ) [ � log( p ( x | z )] � KL( q ( z | x ) k p ( z )) ✦ Good at capturing the global structure, but samples are blurry. ✦ Learn hierarchical representations useful for down-stream tasks. ✦ Attempts at combining PixelCNN with VAEs: • PixelVAE ( Gulrajani et al., 2016 ) • VLAE ( Chen et al., 2017 ) Kingma et al., 2013 Alireza Makhzani PixelGAN Autoencoders 4 4 / 27

  5. Adversarial Autoencoders B D Code Space of MNIST: Gaussian Prior Mixture of Gaussians Makhzani et al., 2015 Alireza Makhzani PixelGAN Autoencoders 5 / 27

  6. Outline 1. Background • PixelCNNs • Variational Autoencoders • Adversarial Autoencoders 2. PixelGAN Autoencoders • Gaussian Priors • Categorical Priors ✦ Clustering ✦ Semi-supervised Learning Alireza Makhzani PixelGAN Autoencoders 6 / 27

  7. Limitations of Variational/Adversarial Autoencoders ✦ All the image statistics are captured by the single latent vector. VAE label, style p ( z ) Latent Variable z global and local Deterministic p ( x | z ) None (factorized Gaussians) x Alireza Makhzani PixelGAN Autoencoders 7 / 27

  8. PixelGAN Autoencoders ✦ The image statistics are captured jointly by the latent vector and the autoregressive decoder. p ( z ) Latent Variable z p ( x | z ) PixelCNN x Alireza Makhzani PixelGAN Autoencoders 8 / 27

  9. PixelGAN Autoencoders ✦ The image statistics are captured jointly by the latent vector and the autoregressive decoder. p ( z ) Latent Variable z GAN p ( x | z ) PixelCNN x Alireza Makhzani PixelGAN Autoencoders 9 / 27

  10. PixelGAN Autoencoders ✦ The image statistics are captured jointly by the latent vector and the autoregressive decoder. PixelGAN PixelGAN (Gaussian) (Categorical) Global Discrete p ( z ) Latent Variable z (low-frequency) (label) GAN Local Continuous p ( x | z ) PixelCNN (high-frequency) (Style) x Alireza Makhzani PixelGAN Autoencoders 10 / 27

  11. PixelGAN Autoencoders ✦ The image statistics are captured jointly by the latent vector and the autoregressive decoder. PixelGAN PixelGAN (Gaussian) (Categorical) Global Discrete p ( z ) Latent Variable z (low-frequency) (label) GAN Local Continuous p ( x | z ) PixelCNN (high-frequency) (Style) x Semi-supervised Learning Alireza Makhzani PixelGAN Autoencoders 11 / 27

  12. PixelGAN Autoencoders Cost function of PixelGAN = Reconstruction + Adversarial Cost Alireza Makhzani PixelGAN Autoencoders 12 / 27

  13. Outline 1. Background • PixelCNNs • Variational Autoencoders • Adversarial Autoencoders 2. PixelGAN Autoencoders • Gaussian Priors • Categorical Priors Clustering Semi-supervised Learning Alireza Makhzani PixelGAN Autoencoders 13 / 27

  14. Global vs. Local Decomposition Alireza Makhzani PixelGAN Autoencoders 14 / 27

  15. Code Space Code Space of MNIST: Alireza Makhzani PixelGAN Autoencoders 15 / 27

  16. Outline 1. Background • PixelCNNs • Variational Autoencoders • Adversarial Autoencoders 2. PixelGAN Autoencoders • Gaussian Priors • Categorical Priors ✦ Clustering ✦ Semi-supervised Learning Alireza Makhzani PixelGAN Autoencoders 16 / 27

  17. PixelGAN Autoencoders with Categorical Priors Alireza Makhzani PixelGAN Autoencoders 17 / 27

  18. Discrete vs. Continuous Decomposition (Clustering) Alireza Makhzani PixelGAN Autoencoders 18 / 27

  19. Discrete vs. Continuous Decomposition (Clustering) 0.3% Error rate Alireza Makhzani PixelGAN Autoencoders 19 / 27

  20. Unsupervised Clustering Alireza Makhzani PixelGAN Autoencoders 20 / 27

  21. Unsupervised Clustering Alireza Makhzani PixelGAN Autoencoders 21 / 27

  22. Unsupervised Clustering 5% Error rate Alireza Makhzani PixelGAN Autoencoders 22 / 27

  23. Outline 1. Background • PixelCNNs • Variational Autoencoders • Adversarial Autoencoders 2. PixelGAN Autoencoders • Gaussian Priors • Categorical Priors ✦ Clustering ✦ Semi-supervised Learning Alireza Makhzani PixelGAN Autoencoders 23 / 27

  24. Semi-supervised Learning Semi-supervised Learning Alireza Makhzani PixelGAN Autoencoders 24 / 27

  25. Semi-supervised Learning Semi-supervised Learning Alireza Makhzani PixelGAN Autoencoders 25 / 27

  26. Semi-supervised Classification Alireza Makhzani PixelGAN Autoencoders 26 / 27

  27. Thank you! Alireza Makhzani PixelGAN Autoencoders 27 / 27

Recommend


More recommend