PixelCNN Models with Auxiliary Variables for Natural Image Modeling - PowerPoint PPT Presentation

Oct 17, 2023 •220 likes •369 views

PixelCNN Models with Auxiliary Variables for Natural Image Modeling Alexander Kolesnikov, Christoph H. Lampert *IST Austria ICML 2017 PixelCNN Models with Auxiliary Variables 1. What is the task? 2. PixelCNN model (recap of last coffeetalk)

PixelCNN Models with Auxiliary Variables for Natural Image Modeling Alexander Kolesnikov*, Christoph H. Lampert* *IST Austria ICML 2017
PixelCNN Models with Auxiliary Variables 1. What is the task? 2. PixelCNN model (recap of last coffeetalk) 3. Proposed models a) Grayscale Pixel CNN b) Pyramid Pixel CNN 4. Conclusion
What is the task? Density estimation • Task: • Why learn p(x)? • Input: training set of images • Representation learning • Output: model estimating p(x) • Image reconstruction • Evaluation: measure p(x) on testset. • Deblurring • Higher p(x) is better. • Super resolution • Note: p(x) should be normalized • Image compression • …
Recap of Pixel CNN • Pixel CNN is a recurrent network • Generative model • Input: previously generated pixels • Output: pdf (prediction) for next pixel • Pro’s • Cons • Can compute p(x) (unlike GANs) • No latent variables • Train by maximum likelihood • Generation of images is very slow • Stable training (unlike GANs) because of recurrent structure • Generates sharp images (unlike VAE) • Incoherent global image structure
First Pixel CNN Proposed Grayscale Pixel CNN • PixelCNN models lowlevel feature well, but not global structure. • Idea: Likelihood is dominated by low-level details. Deep CNN (feature extractor) • First pixel CNN for global details • Output: grayscale version with 4 bits / pixel. Second Pixel CNN • Second pixel CNN for low level details • Input: output of first model (auxiliary variable) • Output: 24 bit color image.
Grayscale Pixel CNN: Results • State of the art performance on testset (CIFAR-10) • Samples are highly diverse and have coherent global structure • Is not overfitting (train loss = test loss, approximately) • Decomposition of likelihood in 2 parts shows that indeed low-level detail have more influence in the likelihood objective. • Because grayscale pixel CNN has 2 models, the objectives do not interfere
Pyramid Pixel CNN • Motivations: (1) asymmetry, lower right pixel has access to more information. (2) speed up model. Very Deep CNN Very Deep CNN Very Deep CNN Very Deep CNN • Idea P1 P2 P3 P4 P5
Pyramid Pixel CNN: Results (1/2) • Close to SOTA on CIFAR-10 • Speed up factor at least 10x • Evaluation on CelebA MAP MAP
Pyramid Pixel CNN: Results (2/2)
Conclusions • Low-level details distract models from learning high level details • Use 2 models (low level model, high level model) • Multiscale architecture can model high resolution faces • Next coffeetalk : “ Neural Discrete Representation Learning”
Grayscale results
Grayscale colored

Recommend

VFW Auxiliary LOCAL AUXILIARY TREASURERS AND TRUSTEES TRAINING Presented By VFW Auxiliary

VFW Auxiliary LOCAL AUXILIARY TREASURERS AND TRUSTEES TRAINING Presented By VFW Auxiliary National Headquarters Staff George Martin, Director of Accounting VFW Auxiliary Headquarters Phone (816) 561 8655 406 West 34 th Street Toll Free

1.12k views • 85 slides

Office of Auxiliary Services Presented by Dr. Gregory A. McCord Chief Auxiliary Services Officer

Beaufort County School District Office of Auxiliary Services Presented by Dr. Gregory A. McCord Chief Auxiliary Services Officer November 28, 2017 Auxiliary Services..its a team effort!!! Dr. Gregory A. McCord, Chief Auxiliary Services

190 views • 15 slides

VFW Auxiliary LOCAL AUXILIARY TREASURERS AND TRUSTEES TRAINING Presented By George Martin

VFW Auxiliary LOCAL AUXILIARY TREASURERS AND TRUSTEES TRAINING Presented By George Martin Director of Accounting VFW Auxiliary Headquarters Phone (816) 561 8655 406 West 34 th Street Toll Free (866) 299 1286 10 th Floor Fax (816) 931

461 views • 30 slides

VFW Auxiliary INVESTMENTS HELD AT VFW AUXILIARY NATIONAL HEADQUARTERS Presented By George

VFW Auxiliary INVESTMENTS HELD AT VFW AUXILIARY NATIONAL HEADQUARTERS Presented By George Martin Director of Accounting VFW Auxiliary Headquarters Phone (816) 561 8655 406 West 34 th Street Toll Free (866) 299 1286 10 th Floor Fax

392 views • 8 slides

Deep Autoregressive Models mainly PixelCNN and Wavenet 1 Another Way to Generate

Deep Autoregressive Models mainly PixelCNN and Wavenet 1 Another Way to Generate UWaterloo Use Chain Rule P ( x n , x n 1 , . . . , x 2 , x 1 ) = P ( x n | x n 1 , . . . , x 2 , x 1 ) * P ( x n 1 | x n 2 , . . . , x

525 views • 31 slides

Revisiting Auxiliary Variables Stephan Merz joint work with Leslie Lamport Inria & LORIA,

Revisiting Auxiliary Variables Stephan Merz joint work with Leslie Lamport Inria & LORIA, Nancy, France IFIP Working Group 2.2 Bordeaux, September 2017 Stephan Merz Revisiting Auxiliary Variables WG 2.2, 2017-09 1 / 24 Specifications

990 views • 51 slides

YCL Week 3 Lets talk about variables! Variables Variables are containers for data. Variables

YCL Week 3 Lets talk about variables! Variables Variables are containers for data. Variables have two parts: a name and a value, which is of a certain data type. A variables value is usually assigned to it using an equal sign. x = 5

171 views • 15 slides

Improving PixelCNN Vertical stack oblem with this m of masked convolution. Blind spot

Improving PixelCNN Vertical stack oblem with this m of masked convolution. Blind spot Horizontal stack Solution: use two stacks of Stacking layers of masked convolution, convolution creates convolution creates a blindspot a blindspot a

489 views • 48 slides

Dependency Parse Dependency Tags aux auxiliary auxpass passive auxiliary cop

Dependency Parse Dependency Tags aux auxiliary auxpass passive auxiliary cop -- copula conj conjunct cc coordination ref -- referent subj subject nsubj nominal subject nsubjpass

1.18k views • 69 slides

Term Rep placement Deep rep lacement Auxiliary constructor i Auxiliary constructor i module

Term Rep placement Deep rep lacement Auxiliary constructor i Auxiliary constructor i module Tree-drepl d l T d l imports Tree-syntax Deep replacement: repla ace only occurrences close exports A bottom-up transformer that to the

183 views • 14 slides

Closures & Scoping Variables Parameters Local variables Free variables

CS152 Programming Language Paradigms Prof. Tom Austin, Fall 2016 Closures & Scoping Variables Parameters Local variables Free variables Variables not defined in the current scope e.g. global variables #!/bin/bash Is x

569 views • 13 slides

Auxiliary Mixture Sampling for Age-Period-Cohort Models Andrea Riebler & Leonhard Held

Auxiliary Mixture Sampling for Age-Period-Cohort Models Andrea Riebler & Leonhard Held University of Zurich Reisensburg, September 2007 Introduction Age-Period-Cohort Models Auxiliary Mixture Sampling Summary and Outlook Outline

367 views • 33 slides

Approximate Bayesian Computation using Auxiliary Models Tony Pettitt Co-authors Chris Drovandi,

Approximate Bayesian Computation using Auxiliary Models Tony Pettitt Co-authors Chris Drovandi, Malcolm Faddy Queensland University of Technology Brisbane MCQMC February 2012 Tony Pettitt () ABC using Auxiliary Models MCQMC February 2012

690 views • 41 slides

On the Complexity of Simulating Auxiliary Input Yi-Hsiu Chen 1 Kai-Min Chung 2 Jyun-Jie Liao 2 1

On the Complexity of Simulating Auxiliary Input Yi-Hsiu Chen 1 Kai-Min Chung 2 Jyun-Jie Liao 2 1 Harvard University, Cambridge, USA 2 Academia Sinica, Taipei, Taiwan 1 / 18 Simulating Auxiliary Input [JP14] Consider random variables ( X , Z )

868 views • 62 slides

Auxiliary Rubrics Module 6 Module 5 Review At the conclusion of Module 5, the team completed

Auxiliary Rubrics Module 6 Module 5 Review At the conclusion of Module 5, the team completed all classroom observation rubrics. Auxiliary Rubrics Pillar I Self Assessment, PLP and Evidence Rubric * Professionalism Rubric Pillar II

729 views • 15 slides

Auxiliary Turn Lanes Adam Kirk Kentucky Transportation Center INTRODUCTION SPR Project:

Auxiliary Turn Lanes Adam Kirk Kentucky Transportation Center INTRODUCTION SPR Project: Criteria for the Design and Justification of Auxiliary Turn lanes Purpose Provide consistent and clear left and right turn-lane warrants

611 views • 45 slides

DEEP LEARNING FOR NATURAL LANGUAGE PROCESSING Lecture 2: Recurrent Neural Networks (RNNs) Caio

DEEP LEARNING FOR NATURAL LANGUAGE PROCESSING Lecture 2: Recurrent Neural Networks (RNNs) Caio Corro LECTURE 1 RECALL Language modeling with a multi-layer perceptron n 2nd order Markov chain: p ( y 1 , . . . , y n ) = p ( y 1 ) p ( y 2 |

738 views • 69 slides

Machine Learning for NLP Sequential NN models Aurlie Herbelot 2019 Centre for Mind/Brain

Machine Learning for NLP Sequential NN models Aurlie Herbelot 2019 Centre for Mind/Brain Sciences University of Trento 1 The unreasonable effectiveness... 2 Karpathy (2015) In 2015, Andrej Karpathy wrote a blog entry which became

857 views • 57 slides

Recurrent machines for likelihood-free inference Arthur Pesah Antoine Wehenkel Gilles Louppe

Recurrent machines for likelihood-free inference Arthur Pesah Antoine Wehenkel Gilles Louppe KTH ULige ULige 1 Likelihood-free Inference 2 Likelihood-free inference: what? Parameters Goal Finding the parameters corresponding to

526 views • 26 slides

CSCE 496/896 Lecture 6: Architectures Stephen Scott Recurrent Architectures Introduction Basic

CSCE 496/896 Lecture 6: Recurrent CSCE 496/896 Lecture 6: Architectures Stephen Scott Recurrent Architectures Introduction Basic Idea I/O Mappings Stephen Scott Examples Training (Adapted from Vinod Variyam and Ian Goodfellow) Deep

378 views • 35 slides

Convolutional and recurrent neural networks Benoit Favre < benoit.favre@univ-mrs.fr >

Deep learning for natural language processing Convolutional and recurrent neural networks Benoit Favre < benoit.favre@univ-mrs.fr > Aix-Marseille Universit, LIF/CNRS 22 Feb 2017 Benoit Favre (AMU) DL4NLP: CNNs and RNNs 22 Feb 2017 1

574 views • 25 slides

Lecture 4: Recurrent neural networks for natural language processing Plan of the lecture Part

Neural Natural Language Processing Lecture 4: Recurrent neural networks for natural language processing Plan of the lecture Part 1 : Language modeling. Part 2 : Recurrent neural networks. Part 3 : Long-Short Term Memory (LSTM).

1.07k views • 78 slides

Document Modeling with Gated Recurrent Neural Network for Sentiment Classification Duyu Tang,

Document Modeling with Gated Recurrent Neural Network for Sentiment Classification Duyu Tang, Bing Qin, Ting Liu Harbin Institute of Technology 1 Sentiment Classification Given a piece of text, sentiment classification focus on inferring

287 views • 27 slides

Convolutional LSTM Network: A Machine Learning Approach for Precipitation Nowcasting LSTM

Convolutional LSTM Network: A Machine Learning Approach for Precipitation Nowcasting LSTM : VALSE 2016/03/23 Content Quick Review of Recurrent Neural Network

738 views • 27 slides