Poster #24 1 Applied AI Lab, Oxford Robotics Institute 2 Department - PowerPoint PPT Presentation

Sequential Attend, Infer, Repeat: Generative Modelling of Moving Objects Adam R. Kosiorek 1,2 , Hyunjik Kim 2 , Ingmar Posner 1 , Yee Whye Teh 2 Poster #24 1 Applied AI Lab, Oxford Robotics Institute 2 Department of Statistics, University of Oxford NeurIPS 2018

Attend, Infer, Repeat 1 1 Eslami et. al., “Attend, Infer, Repeat”, NIPS 2016.

Sequential Attend, Infer, Repeat: Generative Modelling of Moving Objects Attend, Infer, Repeat Attend, Infer, Repeat 1 (AIR): 1 Eslami et. al., “Attend, Infer, Repeat”, NIPS 2016.

Sequential Attend, Infer, Repeat: Generative Modelling of Moving Objects Attend, Infer, Repeat Attend, Infer, Repeat 1 (AIR): • Variational Autoencoder (VAE) 1 Eslami et. al., “Attend, Infer, Repeat”, NIPS 2016.

Sequential Attend, Infer, Repeat: Generative Modelling of Moving Objects Attend, Infer, Repeat Attend, Infer, Repeat 1 (AIR): • Variational Autoencoder (VAE) • Decomposes an image into objects 1 Eslami et. al., “Attend, Infer, Repeat”, NIPS 2016.

Sequential Attend, Infer, Repeat: Generative Modelling of Moving Objects Attend, Infer, Repeat Attend, Infer, Repeat 1 (AIR): • Variational Autoencoder (VAE) • Decomposes an image into objects • Explains each object with a separate latent variable 1 Eslami et. al., “Attend, Infer, Repeat”, NIPS 2016.

Sequential Attend, Infer, Repeat: Generative Modelling of Moving Objects Attend, Infer, Repeat Attend, Infer, Repeat 1 (AIR): • Variational Autoencoder (VAE) • Decomposes an image into objects • Explains each object with a separate latent variable Here, we have two objects with superscripts 1 and 4 1 Eslami et. al., “Attend, Infer, Repeat”, NIPS 2016.

Sequential Attend, Infer, Repeat: Generative Modelling of Moving Objects AIR: Latent Variables Objects are explained by separate latent variables

Sequential Attend, Infer, Repeat: Generative Modelling of Moving Objects AIR: Latent Variables Objects are explained by separate latent variables what : Gaussian, how does it look like?

Sequential Attend, Infer, Repeat: Generative Modelling of Moving Objects AIR: Latent Variables Objects are explained by separate latent variables what : Gaussian, how does it look like? where : Gaussian, where and how big is it?

Sequential Attend, Infer, Repeat: Generative Modelling of Moving Objects AIR: Latent Variables Objects are explained by separate latent variables what : Gaussian, how does it look like? where : Gaussian, where and how big is it? presence : Bernoulli, does it exist?

Sequential Attend, Infer, Repeat

Sequential Attend, Infer, Repeat: Generative Modelling of Moving Objects SQAIR: Generative Model Sequential Attend, Infer Repeat (SQAIR) extends AIR to image sequences

Sequential Attend, Infer, Repeat: Generative Modelling of Moving Objects SQAIR: Generative Model Sequential Attend, Infer Repeat (SQAIR) extends AIR to image sequences Like AIR: model objects with separate latent variables

Sequential Attend, Infer, Repeat: Generative Modelling of Moving Objects SQAIR: Generative Model Sequential Attend, Infer Repeat (SQAIR) extends AIR to image sequences Like AIR: model objects with separate latent variables Objects can appear and disappear in every frame

Sequential Attend, Infer, Repeat: Generative Modelling of Moving Objects SQAIR: Generative Model Sequential Attend, Infer Repeat (SQAIR) extends AIR to image sequences Like AIR: model objects with separate latent variables Objects can appear and disappear in every frame Here, object 4 appeared and object 3 disappeared in frame t

Sequential Attend, Infer, Repeat: Generative Modelling of Moving Objects MNIST: Reconstructions SQAIR can model sequences of moving objects

Sequential Attend, Infer, Repeat: Generative Modelling of Moving Objects MNIST: Reconstructions SQAIR can model sequences of moving objects like this one

Sequential Attend, Infer, Repeat: Generative Modelling of Moving Objects MNIST: Reconstructions SQAIR can model sequences of moving objects like this one any VAE could reconstruct it

Sequential Attend, Infer, Repeat: Generative Modelling of Moving Objects MNIST: Reconstructions SQAIR can model sequences of moving objects like this one any VAE could reconstruct it one latent variable per object SQAIR: knows their location maintains identity (unlike AIR)

Sequential Attend, Infer, Repeat: Generative Modelling of Moving Objects MNIST: Samples Once trained, we can sample from SQAIR Check what the model learned

Sequential Attend, Infer, Repeat: Generative Modelling of Moving Objects MNIST: Samples Once trained, we can sample from SQAIR Check what the model learned Object appearance does not change between frames

Sequential Attend, Infer, Repeat: Generative Modelling of Moving Objects MNIST: Samples Once trained, we can sample from SQAIR Check what the model learned Object appearance does not change between frames Motion is consistent with motion patterns in the training set

Sequential Attend, Infer, Repeat: Generative Modelling of Moving Objects MNIST: Conditional Generation Condition the model on three frames Predict the next 97 frames by sampling from the prior

Sequential Attend, Infer, Repeat: Generative Modelling of Moving Objects MNIST: Conditional Generation Condition the model on three frames Predict the next 97 frames by sampling from the prior For every conditioning sequence, we can imagine different rollouts

Sequential Attend, Infer, Repeat: Generative Modelling of Moving Objects SQAIR vs AIR Reconstruction from partial observations SQAIR AIR

Sequential Attend, Infer, Repeat: Generative Modelling of Moving Objects SQAIR vs AIR Reconstruction from partial Disentangling overlapping observations objects SQAIR AIR SQAIR AIR

Sequential Attend, Infer, Repeat: Generative Modelling of Moving Objects SQAIR vs AIR Reconstruction from partial Disentangling overlapping observations objects SQAIR AIR SQAIR AIR missing objects!

Real World Data: Unsupervised Detection & Tracking of Pedestrians

Sequential Attend, Infer, Repeat: Generative Modelling of Moving Objects DukeMTMC: Reconstructions DukeMTMC dataset 2 contains videos from static CCTV cameras 2 Ristani et. al., “Performance Measures and a Data Set for Multi-Target, Multi-Camera Tracking”, ECCV workshop , 2016.

Sequential Attend, Infer, Repeat: Generative Modelling of Moving Objects DukeMTMC: Reconstructions DukeMTMC dataset 2 contains videos from static CCTV cameras Pre-process by removing backgrounds and inverting colours 2 Ristani et. al., “Performance Measures and a Data Set for Multi-Target, Multi-Camera Tracking”, ECCV workshop , 2016.

Sequential Attend, Infer, Repeat: Generative Modelling of Moving Objects DukeMTMC: Reconstructions DukeMTMC dataset 2 contains videos from static CCTV cameras Pre-process by removing backgrounds and inverting colours SQAIR learns to detect & track pedestrians without human supervision! 2 Ristani et. al., “Performance Measures and a Data Set for Multi-Target, Multi-Camera Tracking”, ECCV workshop , 2016.

Sequential Attend, Infer, Repeat: Generative Modelling of Moving Objects DukeMTMC: Conditional Generation SQAIR trained on sequences of five frames

Sequential Attend, Infer, Repeat: Generative Modelling of Moving Objects DukeMTMC: Conditional Generation SQAIR trained on sequences of five frames • Condition the model on five frames • Predict the next 15 frames by sampling from the prior

Sequential Attend, Infer, Repeat: Generative Modelling of Moving Objects DukeMTMC: Conditional Generation SQAIR trained on sequences of five frames • Condition the model on five frames • Predict the next 15 frames by sampling from the prior Each row contains five different predictions for the same sequence

Sequential Attend, Infer, Repeat: Generative Modelling of Moving Objects Code: Poster #24 /akosiorek/SQAIR

Poster #24 1 Applied AI Lab, Oxford Robotics Institute 2 Department - PowerPoint PPT Presentation

Sequential Attend, Infer, Repeat: Generative Modelling of Moving Objects Adam R. Kosiorek 1,2 , Hyunjik Kim 2 , Ingmar Posner 1 , Yee Whye Teh 2 Poster #24 1 Applied AI Lab, Oxford Robotics Institute 2 Department of Statistics, University of

Poster A1 Poster A2 Poster A3 Poster A4 Poster A5 Poster A6 Investigation of a passive

Poster Presentations Poster No. 01-40: Poster session-1 (Monday, 09 October, 2017) Poster No.

POSTER PRESENTATION & POSTER DESIGN GUIDELINES PRESENTING YOUR POSTER The poster sessions

OMT OMTAT 201 AT 2013 Poster Presenta Poster Pres entations tions Poster Session 1 in Hall -3

Poster Presentation Guidelines Version 1 .0 - 28 October 20 13 POSTER SUBMI SSI ON Poster

Poster Presentation Information for CRYO2016 Poster Boards This year poster boards are being

OMT OMTAT 201 AT 2013 Poster Pres Poster Presenta entations tions Poster Session 2 in Hall -3

OMT OMTAT 201 AT 2013 Poster Pres Poster Presenta entations tions Poster Session 3 in Hall -

Siggraph Asia 2012 Paul Bourke Singapore Expo Centre Poster Poster Poster + emerging

Poster Presentation Schedule *Final Poster No. and Presentation Schedule can be changed. Please

POSTER PRESENTATIONS Poster Presenter Poster Title Country A.A. Nayl Recovery and Separation

WCPCCS 2017 Poster Presentation Guidelines Timing of Poster Presentations JULY 17 - MONDAY POSTER

SISC 2017 Research Poster Presentation SISC 2017 Poster Abstracts 2 SISC 2017 Bioinformatics

Important Notice for Poster Presentation 1. Basic information [ Date ] Poster session 1 : 24 th ,

Poster Presentation Poster Session (I), L008-Lobby, 14:30-15:30 Poster Session Chairs: Dr.

Poster Discussion and Poster Presentation Guidelines Poster

Variational Sequential Labelers for Semi-Supervised Learning Mingda Chen, Qingming Tang, Karen

Probabilistic & Unsupervised Learning Latent Variable Models Maneesh Sahani

Introduction to Information Retrieval http://informationretrieval.org IIR 18: Latent Semantic

Maximum Reconstruction Estimation for Generative Latent-Variable Models Yong Cheng joint work

A Discriminative Latent Variable Model for Online Clustering Rajhans Samdani, Kai-Wei Chang , Dan

Learning Latent Dynamics for Planning from Pixels Danijar Hafner, Timothy Lillicrap, Ian Fischer,

Finding Latent Code Errors via Machine Learning over Program Executions Yuriy Brun Michael D.

Case Study: Approximate Bayesian Inference for Latent Gaussian Models by Using Integrated Nested

Poster #24 1 Applied AI Lab, Oxford Robotics Institute 2 Department - PowerPoint PPT Presentation

Sequential Attend, Infer, Repeat: Generative Modelling of Moving Objects Adam R. Kosiorek 1,2 , Hyunjik Kim 2 , Ingmar Posner 1 , Yee Whye Teh 2 Poster #24 1 Applied AI Lab, Oxford Robotics Institute 2 Department of Statistics, University of

Poster A1 Poster A2 Poster A3 Poster A4 Poster A5 Poster A6 Investigation of a passive

Poster Presentations Poster No. 01-40: Poster session-1 (Monday, 09 October, 2017) Poster No.

POSTER PRESENTATION &amp; POSTER DESIGN GUIDELINES PRESENTING YOUR POSTER The poster sessions

OMT OMTAT 201 AT 2013 Poster Presenta Poster Pres entations tions Poster Session 1 in Hall -3

Poster Presentation Guidelines Version 1 .0 - 28 October 20 13 POSTER SUBMI SSI ON Poster

Poster Presentation Information for CRYO2016 Poster Boards This year poster boards are being

OMT OMTAT 201 AT 2013 Poster Pres Poster Presenta entations tions Poster Session 2 in Hall -3

OMT OMTAT 201 AT 2013 Poster Pres Poster Presenta entations tions Poster Session 3 in Hall -

Siggraph Asia 2012 Paul Bourke Singapore Expo Centre Poster Poster Poster + emerging

Poster Presentation Schedule *Final Poster No. and Presentation Schedule can be changed. Please

POSTER PRESENTATIONS Poster Presenter Poster Title Country A.A. Nayl Recovery and Separation

WCPCCS 2017 Poster Presentation Guidelines Timing of Poster Presentations JULY 17 - MONDAY POSTER

SISC 2017 Research Poster Presentation SISC 2017 Poster Abstracts 2 SISC 2017 Bioinformatics

Important Notice for Poster Presentation 1. Basic information [ Date ] Poster session 1 : 24 th ,

Poster Presentation Poster Session (I), L008-Lobby, 14:30-15:30 Poster Session Chairs: Dr.

Poster Discussion and Poster Presentation Guidelines Poster

Variational Sequential Labelers for Semi-Supervised Learning Mingda Chen, Qingming Tang, Karen

Probabilistic &amp; Unsupervised Learning Latent Variable Models Maneesh Sahani

Introduction to Information Retrieval http://informationretrieval.org IIR 18: Latent Semantic

Maximum Reconstruction Estimation for Generative Latent-Variable Models Yong Cheng joint work

A Discriminative Latent Variable Model for Online Clustering Rajhans Samdani, Kai-Wei Chang , Dan

Learning Latent Dynamics for Planning from Pixels Danijar Hafner, Timothy Lillicrap, Ian Fischer,

Finding Latent Code Errors via Machine Learning over Program Executions Yuriy Brun Michael D.

Case Study: Approximate Bayesian Inference for Latent Gaussian Models by Using Integrated Nested

POSTER PRESENTATION & POSTER DESIGN GUIDELINES PRESENTING YOUR POSTER The poster sessions

Probabilistic & Unsupervised Learning Latent Variable Models Maneesh Sahani