Paper ID: 4410 Self-Supervised Model Training and Selection for Disentangling GANs Previous title: InfoGAN-CR: Disentangling Generative Adversarial Networks with Contrastive Regularizers Zinan Lin Kiran Thekumparampil Giulia Fanti Sewoong Oh CMU UIUC CMU UW
Generative Adversarial Networks (GANs) π factors π input noise π¨ ! β’ Hair color π¨ " β’ Rotation Generator β¦ β’ Background π¨ # β’ Bangs β’ How do π¨ ! s control the factors? Vanilla GANs Disentangled GANs π¨ ! Factor ! π ! Factor ! π¨ " Factor " π " Factor " β¦ β¦ β¦ β¦ π¨ # Factor $ π $ Factor $ π¨ % s ID 4410: Self-supervised model training and selection for disentangling GAN Code & Paper: https://github.com/fjxmlzn/InfoGAN-CR 2
Examples of Disentanglement π ! π " π & π ' π ( Changing only: Latent (CelebA dataset) codes π ! π " hair color rotation lighting background bangs Generator β¦ π # π¨ % s The remaining (dSprites dataset) noise dimensions y-position shape scale rotation x-position * CelebA example is generated by InfoGAN-CR. Dsprites example is synthetic for illustration. ID 4410: Self-supervised model training and selection for disentangling GAN Code & Paper: https://github.com/fjxmlzn/InfoGAN-CR 3
Challenges in Learning Disentangled GANs 1. How to train disentangled GANs? GANs are good at generating high fidelity images, β’ but are reported to be bad at disentanglement (e.g. compared with VAEs) 2. How to do unsupervised model selection? In practice, we donβt have ground-truth factor labels β’ for selecting the best disentangled models. ID 4410: Self-supervised model training and selection for disentangling GAN Code & Paper: https://github.com/fjxmlzn/InfoGAN-CR 4
Our Solution: Self-Supervision Assessing the Pairs of images disentanglement with 1. How to train disentangled GANs? quality self-supervised labels InfoGAN-CR β’ Contrastive Generator Regularizer β¦ (CR) 2. How to do unsupervised model selection? ModelCentrality β’ Self-supervised All trained A pair of distance metric models: trained models: Pick the most central model ID 4410: Self-supervised model training and selection for disentangling GAN Code & Paper: https://github.com/fjxmlzn/InfoGAN-CR 5
Benchmark dSprites Dataset Model FactorVAE Metric β VAE 0.63 πΎ -TCVAE 0.62 HFVAE 0.63 VAE Supervised πΎ -VAE 0.63 hyper-parameter CHyVAE 0.77 selection FactorVAE 0.82 InfoGAN 0.59 InfoGAN (modified) 0.83 GAN IB-GAN 0.80 InfoGAN-CR 0.90 Β±π. ππ Unsupervised InfoGAN-CR model 0.92 Β±π. ππ model selection selected with ModelCentrality Code & Paper & More results: https://github.com/fjxmlzn/InfoGAN-CR ID 4410: Self-supervised model training and selection for disentangling GAN Code & Paper: https://github.com/fjxmlzn/InfoGAN-CR 6
Details 1. InfoGAN-CR (model training) 2. ModelCentrality (model selection) ID 4410: Self-supervised model training and selection for disentangling GAN Code & Paper: https://github.com/fjxmlzn/InfoGAN-CR 7
Intuition of Contrastive Regularizer (CR) $ ) to generate a pair of images $ ,β¦, π # $ ,β¦, π ! β’ Use two latent codes ( π " ,β¦, π ! ,β¦, π # ), ( π " Equal Same i-th latent code Same shape π = 1 Generator Same x-position π = 2 ( π» ) π = 3 Same y-position ID 4410: Self-supervised model training and selection for disentangling GAN Code & Paper: https://github.com/fjxmlzn/InfoGAN-CR 8
Intuition of Contrastive Regularizer (CR) $ ) to generate a pair of images $ ,β¦, π # $ ,β¦, π ! β’ Use two latent codes ( π " ,β¦, π ! ,β¦, π # ), ( π " Equal Same i-th latent code 1 π = 1 Contrastive Generator 2 π = 2 Regularizer ( π» ) (CR) π = 3 3 Classification task! ID 4410: Self-supervised model training and selection for disentangling GAN Code & Paper: https://github.com/fjxmlzn/InfoGAN-CR 9
Μ InfoGAN-CR GAN Discriminator β % β β # # Input Noise " InfoGAN Encoder ! β β ! ! $ β β " %β² β β # Latent Factors $ β β " %β²β² β β # CR πΌ β " InfoGAN-CR loss: %,',( max min π *+, π», πΈ β ππ½ π», π β π½π - (π», πΌ) ) Mutual info GANβs classification accuracy of CR loss adversarial loss InfoGAN [1] [1] InfoGAN. Chen, X., Duan, Y., Houthooft, R., Schulman, J., Sutskever, I., & Abbeel, P. (NeurIPS 2016) ID 4410: Self-supervised model training and selection for disentangling GAN Code & Paper: https://github.com/fjxmlzn/InfoGAN-CR 10
CR Achieves a Further Gain that Cannot be Gotten with InfoGAN Alone Disentanglement Split training #training iterations (dSprites dataset) ID 4410: Self-supervised model training and selection for disentangling GAN Code & Paper: https://github.com/fjxmlzn/InfoGAN-CR 11
Details 1. InfoGAN-CR (model training) 2. ModelCentrality (model selection) ID 4410: Self-supervised model training and selection for disentangling GAN Code & Paper: https://github.com/fjxmlzn/InfoGAN-CR 12
Intuition of ModelCentrality β’ Well-disentangled models are close to the (unknown) ground-truth model Model space Ground-truth model (unknown) Bad Good model model β’ Idea: pick the model at the βcenterβ ID 4410: Self-supervised model training and selection for disentangling GAN Code & Paper: https://github.com/fjxmlzn/InfoGAN-CR 13
ModelCentrality β’ How to find the model at the βcenterβ ? Two models are close Γ³ Their latent codes control similar factors (1) Treat as the Model π ground-truth factors Any supervised Asymmetric similarity π !" disentanglement metric Similarity π !" ? Similarity π !" = π "! = (π !" + π "! )/2 (e.g. FactorVAE metric [1]) Model π (The higher, the more similar) ! (2) ./! β 01% π %0 ModelCentrality score π % = High similarity Low similarity (3) Selection the model with the highest π % [1] Disentangling by Factorising. Kim, H., & Mnih, A. (ICML 2018) ID 4410: Self-supervised model training and selection for disentangling GAN Code & Paper: https://github.com/fjxmlzn/InfoGAN-CR 14
Results of ModelCentrality β’ Works well for GANs. Take InfoGAN-CR for example: Method FactorVAE Metric β Supervised hyper-parameter selection 0.90 Unsupervised model selection with UDR Lasso [1] 0.86 Unsupervised model selection with UDR Spearman [1] 0.84 Unsupervised model selection with ModelCentrality 0.92 β’ Also works well for VAEs! Take FactorVAE for example: Method FactorVAE Metric β Supervised hyper-parameter selection 0.83 Unsupervised model selection with UDR Lasso [1] 0.81 Unsupervised model selection with UDR Spearman [1] 0.79 Unsupervised model selection with ModelCentrality 0.84 [1] Unsupervised Model Selection for Variational Disentangled Representation Learning. Duan, S., Matthey, L., Saraiva, A., Watters, N., Burgess, C. P., Lerchner, A., & Higgins, I. (ICLR 2020). ID 4410: Self-supervised model training and selection for disentangling GAN Code & Paper: https://github.com/fjxmlzn/InfoGAN-CR 15
Conclusion β’ InfoGAN-CR β’ New disentangled GANs, achieving state-of-the-art results β’ ModelCentrality β’ Unsupervised model selection approach for disentangled GANs and VAEs β’ InfoGAN-CR + ModelCentrality β’ Unsupervised disentangled generative model training and selection package, achieving state-of-the-art results Code & Paper & More results: https://github.com/fjxmlzn/InfoGAN-CR Including: More theoretical & empirical analysis of InfoGAN & InfoGAN-CR β’ Analysis of total correlation for GANs and CR for VAEs β’ New challenging disentanglement datasets β’ ID 4410: Self-supervised model training and selection for disentangling GAN Code & Paper: https://github.com/fjxmlzn/InfoGAN-CR 16
Recommend
More recommend