Learning 3D object models from 2D images Cropped Input Image - PowerPoint PPT Presentation

Learning 3D object models from 2D images Cropped Input Image Predicted Mesh Generated Ground Truth Predicted Landmarks Mesh Loss Latent Spatial Mesh Convolutional ResNet-50 Vector Decoder Iterative Model Fitting Learning from Imperfect Data Workshop Iasonas Kokkinos

Ariel AI S. Zafeiriou E. Schmitt H. Wang D. Kulon G. Papandreou R. A. Guler B. Fulkerson P. Koutras E. Skordos A. Kakolyris H. Tam A. Lazarou S. Galanakis D. Stoddard UCL, Imperial College, FAIR, INRIA, Stony Brook Natalia Neverova M. Bronstein Z. Shu M. Sahasrabudhe E. Bartrum N. Paragios D. Samaras Imperial College FAIR Stony Brook INRIA UCL INRIA Stony Brook

Human analysis: from coarse to fine DensePose (our work) DensePose (our work) Pose Estimation Pose Estimation Image Classification Object Detection Image Classification Part Segmentation Object Detection Part Segmentation Image Classification Is there a person in this image? Is there a person in this Is there a person in this Find correspondence between Find correspondence between Input Image Localize persons in the Localize persons in the Segment semantically Segment semantically Localize joints of the Localize joints of the Yes? No? image? image? all pixels and a 3D model. all pixels and a 3D model. image. image. meaningful body parts. meaningful body parts. persons in the images. persons in the images. Yes? No? Yes? No? Image Classification

Human analysis: from coarse to fine DensePose (our work) DensePose (our work) Pose Estimation Pose Estimation Image Classification Object Detection Image Classification Part Segmentation Object Detection Part Segmentation Person Detection Localize persons in the image. Is there a person in this Is there a person in this Find correspondence between Find correspondence between Input Image Localize persons in the Localize persons in the Segment semantically Segment semantically Localize joints of the Localize joints of the image? image? all pixels and a 3D model. all pixels and a 3D model. image. image. meaningful body parts. meaningful body parts. persons in the images. persons in the images. Yes? No? Yes? No? Image Classification Person Detection

Human analysis: from coarse to fine DensePose (our work) DensePose (our work) Pose Estimation Pose Estimation Image Classification Object Detection Image Classification Part Segmentation Object Detection Part Segmentation Part Segmentation Segment semantically meaningful Is there a person in this Is there a person in this Find correspondence between Find correspondence between Input Image Localize persons in the Localize persons in the Segment semantically Segment semantically Localize joints of the Localize joints of the body parts. image? image? all pixels and a 3D model. all pixels and a 3D model. image. image. meaningful body parts. meaningful body parts. persons in the images. persons in the images. Yes? No? Yes? No? DensePose (our work) Image Classification Person Detection Pose Estimation Image Classification Part Segmentation Object Detection Part Segmentation Is there a person in this Find correspondence between Localize persons in the Segment semantically Localize joints of the image? all pixels and a 3D model. image. meaningful body parts. persons in the images. Yes? No?

Human analysis: from coarse to fine DensePose (our work) Pose Estimation Image Classification Object Detection Part Segmentation Pose Estimation Localize joints of the persons in the Is there a person in this Find correspondence between Input Image Localize persons in the Segment semantically Localize joints of the images. image? all pixels and a 3D model. image. meaningful body parts. persons in the images. Yes? No? DensePose (our work) Image Classification Person Detection Pose Estimation Image Classification Part Segmentation Pose Estimation Object Detection Part Segmentation Is there a person in this Find correspondence between Localize persons in the Segment semantically Localize joints of the image? all pixels and a 3D model. image. meaningful body parts. persons in the images. Yes? No?

Human analysis: from coarse to fine DensePose (our work) Pose Estimation Image Classification Object Detection Part Segmentation Dense Pose Estimation Find correspondence between all Is there a person in this Find correspondence between Input Image Localize persons in the Segment semantically Localize joints of the pixels and a 3D model. image? all pixels and a 3D model. image. meaningful body parts. persons in the images. Yes? No? DensePose (our work) Image Classification Person Detection Pose Estimation Image Classification Part Segmentation Pose Estimation Object Detection DensePose Part Segmentation Is there a person in this Find correspondence between Localize persons in the Segment semantically Localize joints of the image? all pixels and a 3D model. image. meaningful body parts. persons in the images. Yes? No?

Holy grail: 3D human reconstruction “W “Wide Open” ” (T (The Mill, 2015) 8

Ariel AI: 3D human reconstruction on mobile 9

Ariel AI: 3D human reconstruction on mobile Seamless augmented reality Immersive gaming Holographic telepresence Kinetic learning Universal motion capture Personalised, experiential retail 10 10 10 10

Challenges Depth/height ambiguity 3D from 2D: fundamentally ill-posed problem Scarce 3D supervision – almost impossible in-the-wild 11 11 11 11

From imperfect vision to imperfect data Computer Vision before deep learning: - Your `local evidence’ is imperfect (classifier scores, unary terms, ..) - Compensate for it by model-based prior during inference (AAMs, MRFs,..) Computer Vision after deep learning: - Your `local evidence’ can become perfect - Your training data is imperfect - Compensate for it by some model-based prior, prior or during training

Imperfect Data for Semantic Segmentation Bounding boxes + occupancy priors “Weakly- and Semi-Supervised Learning of a Deep Convolutional Network for Semantic Image Segmentation” George Papandreou, Liang-Chieh Chen, Kevin P. Murphy, Alan L. Yuille, ICCV 2015

Imperfect Data for Instance Segmentation 4 points + segmentation system Deep Extreme Cut: From Extreme Points to Object Segmentation, Kevis-Kokitsi Maninis, Sergi Caelles, Jordi Pont-Tuset, Luc Van Gool

Imperfect Data for Pose Estimation Keypoints + temporal correspondence Learning Temporal Pose Estimation from Sparsely Labeled Videos, Bertasius, Gedas and Feichtenhofer, Christoph, and Tran, Du and Shi, Jianbo, and Torresani, Lorenzo(NeurIPS 2019)

Part 1: Weakly- and semi- supervised learning for 3D HoloPose: Holistic 3D Human Reconstruction In-the-Wild, A. Guler and I. Kokkinos, CVPR 2019 Weakly-Supervised Mesh-Convolutional Hand Reconstruction in the Wild, D. Kulon et al CVPR 2020

Part 2: Fully unsupervised learning for 3D Unstructured face dataset 3D model comes out deep magic happens Includes all previous tasks as special cases Lifting AutoEncoders: Unsupervised Learning of 3D Morphable Models Using Deep Non-Rigid Structure from Motion, M. Sahasrabudhe, Z. Shu, E. Bartrum, A. Guler, D. Samaras and I. Kokkinos, ICCV GMDL 2019

DenseReg: From Image to Template to Task R. A. Guler, G. Trigeorgis, E. Antonakos, P. Snape, S. Zafeiriou, I. Kokkinos, DenseReg: Fully Convolutional Dense Shape Regression In-the-Wild, CVPR 2017

DenseReg, Frame-by-Frame

Supervision: from parametric model fitting to 2D keypoints 2D canonical coordinates Annotation effort: a few 2D landmarks per image Density: morphable model prior

DensePose: dense image-to-body correspondence DensePose-RCNN: ~25 FPS http://densepose.org/ R. A. Guler, N. Neverova, I. Kokkinos “DensePose: Dense Human Pose Estimation In The Wild”, CVPR’18

An Annot otation on pi pipe peline ne-II II segmented parts sampled points rendered images for the specific part segmented parts sampled points rendered images for the specific part input image input image ... ... ... ... Surface Correspondence Surface Correspondence TASK 1: Part Segmentation TASK 2: Marking Correspondences TASK 1: Part Segmentation TASK 2: Marking Correspondences

DensePose-COCO dataset Quantization replaced by part assignment. densepose.org U coordinates V coordinates Image

DensePose-RCNN in action De DensePose-RC RCNN Re Results Visualization Quantization replaced by part assignment.

HoloPose: multi-person 3D reconstruction results R. A. Guler, I. Kokkinos “HoloPose: Holistic 3D Human Reconstruction In The Wild”, CVPR’19

Surface-level human understanding, CVPR 2018 Dense UV coordinate regression SMPL parameter regression En End-to to-en end Rec ecover ery of Hu Human Shape e and Pose, e, CVPR 2018 A. Kanazawa M. J Black D. W. Jacobs J. Malik Learning Lea g to Estimate e 3D Hu Human Pose e and Shape e from a Singl gle e Im Image , , CVPR 2018 De DensePose: : Dense Human an Pos ose Estim imation ion In The Wild ild, , CVPR 2018 G. Pavlakos, L. Zhu, X. Zhou, K. Daniilidis R. A. Güler, N. Neverova, I. Kokkinos, Monocu cular 3D Pose and Shape Estimation of Multiple People, , CVPR 2018, Andrei Zanfir, Elisabeta Marinoiu, Cristian Sminchisescu Robust & accurate, “in-the-wild” Parametric and 3D Not 3D Alignment

Bottom-up human body reconstruction

Learning 3D object models from 2D images Cropped Input Image - PowerPoint PPT Presentation

Learning 3D object models from 2D images Cropped Input Image Predicted Mesh Generated Ground Truth Predicted Landmarks Mesh Loss Latent Spatial Mesh Convolutional ResNet-50 Vector Decoder Iterative Model Fitting Learning from Imperfect

Object Oriented Object 3 Programming Object 1 Object 2 Object 4 For : COP 3330. Object

CS4495/6495 Introduction to Computer Vision 2A-L1 Images as functions Images as functions Images

Object-Oriented Databases Object Oriented Databases ODMG Standard Object Model, Object

Object oriented Object oriented Object oriented Object oriented approach and UML approach and

Object Space Volume Rendering Object Space Volume Rendering Ronald Peikert SciVis 2010 - Object

Bitmap (Raster) Images CO2016 Multimedia and Computer Graphics Roy Crole: Bitmap Images (CO2016,

HAAR-like features for images Images digit images are scanned hand written digits Digit

https://images-na.ssl-images-amazon.com/images/I/A1w4iP5ov-L._SY879_.jpg Translate this table to a

Java Object Persistence Rakesh Vidyadharan rakesh@sptci.com 2008-05-20 Rakesh Vidyadharan Java

The Object Factory Object-Oriented Programming in R: S3 & R6 pour refill Object-Oriented

Chapter 8, Object Design: Object design is the process of adding details to the requirements

Outline Object-orientation and databases CS 235: Object-oriented model: ODL Object

Specifying Interfaces Object Design: Chapter 9, Object Design ! Object design is the process of

Multi-Object Tracking Challenge CV3DST Lecture Exercises Multi-Object Tracking Multi-Object

CS6501: Deep Learning for Visual Recognition Object Detection: RCNN, Fast-RCNN, Faster-RCNN

CS 10: Problem solving via Object Oriented Programming Animated Images Agenda 1. Multiple

CS573 Data privacy and security in the cloud in the cloud Slide credits: Ragib Hasan, Johns

MOBILE COMPUTING CSE 40814/60814 Fall 2015 Course Overview Instructor: Christian Poellabauer

Constraint Programming Connections with HPC Jean-Guillaume FAGES, PhD COSLING CEO Outline 1

IPv6 Potential Routing Table Size IPv6 Potential Routing Table Size IPv6 Potential Routing Table

Ana6: an IPv6 ad hoc addressing architecture Guillaume Chelius <guillaume.chelius@

Internet Lab (iLab1) Introduction to Cryptography Lars Wstrich ilab1@net.in.tum.de Chair of

Introduc)on to Cloud Compu)ng Dr. Zhenlin Wang Michigan Tech Very Short Bio BS, MS, Peking

Building NREN service for a WireFree world Maurice van den Akker Team Manager WireFree SURFnet

Learning 3D object models from 2D images Cropped Input Image - PowerPoint PPT Presentation

Learning 3D object models from 2D images Cropped Input Image Predicted Mesh Generated Ground Truth Predicted Landmarks Mesh Loss Latent Spatial Mesh Convolutional ResNet-50 Vector Decoder Iterative Model Fitting Learning from Imperfect

Object Oriented Object 3 Programming Object 1 Object 2 Object 4 For : COP 3330. Object

CS4495/6495 Introduction to Computer Vision 2A-L1 Images as functions Images as functions Images

Object-Oriented Databases Object Oriented Databases ODMG Standard Object Model, Object

Object oriented Object oriented Object oriented Object oriented approach and UML approach and

Object Space Volume Rendering Object Space Volume Rendering Ronald Peikert SciVis 2010 - Object

Bitmap (Raster) Images CO2016 Multimedia and Computer Graphics Roy Crole: Bitmap Images (CO2016,

HAAR-like features for images Images digit images are scanned hand written digits Digit

https://images-na.ssl-images-amazon.com/images/I/A1w4iP5ov-L._SY879_.jpg Translate this table to a

Java Object Persistence Rakesh Vidyadharan rakesh@sptci.com 2008-05-20 Rakesh Vidyadharan Java

The Object Factory Object-Oriented Programming in R: S3 &amp; R6 pour refill Object-Oriented

Chapter 8, Object Design: Object design is the process of adding details to the requirements

Outline Object-orientation and databases CS 235: Object-oriented model: ODL Object

Specifying Interfaces Object Design: Chapter 9, Object Design ! Object design is the process of

Multi-Object Tracking Challenge CV3DST Lecture Exercises Multi-Object Tracking Multi-Object

CS6501: Deep Learning for Visual Recognition Object Detection: RCNN, Fast-RCNN, Faster-RCNN

CS 10: Problem solving via Object Oriented Programming Animated Images Agenda 1. Multiple

CS573 Data privacy and security in the cloud in the cloud Slide credits: Ragib Hasan, Johns

MOBILE COMPUTING CSE 40814/60814 Fall 2015 Course Overview Instructor: Christian Poellabauer

Constraint Programming Connections with HPC Jean-Guillaume FAGES, PhD COSLING CEO Outline 1

IPv6 Potential Routing Table Size IPv6 Potential Routing Table Size IPv6 Potential Routing Table

Ana6: an IPv6 ad hoc addressing architecture Guillaume Chelius &lt;guillaume.chelius@

Internet Lab (iLab1) Introduction to Cryptography Lars Wstrich ilab1@net.in.tum.de Chair of

Introduc)on to Cloud Compu)ng Dr. Zhenlin Wang Michigan Tech Very Short Bio BS, MS, Peking

Building NREN service for a WireFree world Maurice van den Akker Team Manager WireFree SURFnet

The Object Factory Object-Oriented Programming in R: S3 & R6 pour refill Object-Oriented

Ana6: an IPv6 ad hoc addressing architecture Guillaume Chelius <guillaume.chelius@