End-to-end Probabilistic Deep Networks for Large-scale Semantic - PowerPoint PPT Presentation

From Pixels to Buildings : End-to-end Probabilistic Deep Networks for Large-scale Semantic Mapping Kaiyu Zheng 1* , Andrzej Pronobis 2,3 1 Brown University 2 University of Washington 3 KTH Royal Institute of Technology *work done while studying at 2 UW IROS 2019

Motivation: Semantic Mapping Output Planning in Large-Scale Probabilistic Partially Observable Representation of Uncertain Environments Spatial Knowledge (i.e. POMDP planning) (Semantic Maps) Semantic Mapping Input Constructed from local sensor observations + prior knowledge of semantic information From Pixels to Buildings: End-to-end Probabilistic Deep Networks for Large-scale Semantic Mapping 2

Semantic Mapping: Challenges • Spatial knowledge exists at Building/Floor • Different spatial scales Places Objects YCB dataset [Calli et al, 2015] From Pixels to Buildings: End-to-end Probabilistic Deep Networks for Large-scale Semantic Mapping 3

Semantic Mapping: Challenges office • Spatial knowledge exists at • Different spatial scales corridor • Multiple levels of abstraction Semantics doorway Topology Place appearance Sensory data From Pixels to Buildings: End-to-end Probabilistic Deep Networks for Large-scale Semantic Mapping 4

Semantic Mapping: Challenges • Spatial knowledge exists at • Different spatial scales • Multiple levels of abstraction • Sensory observations are • Local, Partial, Noisy Local , Partial laser-range observations with Noisy occupancy Credit of Images: Kousuke Ariga From Pixels to Buildings: End-to-end Probabilistic Deep Networks for Large-scale Semantic Mapping 5

Prior work Semantic Mapping: Challenges • Spatial knowledge exists at • Different spatial scales • Multiple levels of abstraction • Sensory observations are Ours • Local, Partial, Noisy • Relationships in human world are Complex, Noisy • Complex: Large number of connections From Pixels to Buildings: End-to-end Probabilistic Deep Networks for Large-scale Semantic Mapping 6

Prior work Semantic Mapping: Challenges • Spatial knowledge exists at • Different spatial scales • Multiple levels of abstraction • Sensory observations are Ours • Local, Partial, Noisy • Relationships in human world are Complex, Noisy • Complex: Large number of connections Noisy: Variability across floors/runs Topological graph constructed on the same floor in two runs. From Pixels to Buildings: End-to-end Probabilistic Deep Networks for Large-scale Semantic Mapping 7

Semantic Mapping: Challenges • Spatial knowledge exists at • Different spatial scales • Multiple levels of abstraction • Sensory observations are • Local, Partial, Noisy • Relationships in human world are Complex, Noisy • • Agent operates in new environments • Vary in scale and structure • Reason about unexplored places From Pixels to Buildings: End-to-end Probabilistic Deep Networks for Large-scale Semantic Mapping 8

Semantic Mapping: Desired Properties A. Captures spatial scales and abstractions B. Is probabilistic , captures uncertainty C. Allows real-time , efficient inference D. Leverages relationships between spatial concepts to • Improve robustness • resolve ambiguities • predict latent information (e.g. about unexplored places) Output Probabilistic Representation of Spatial Knowledge (Semantic Maps) Semantic Mapping Input Constructed from local sensor observ. + prior knowledge of Structured Prediction semantic information From Pixels to Buildings: End-to-end Probabilistic Deep Networks for Large-scale Semantic Mapping 9

Existing Work: Robotics Structured prediction in semantic mapping • Assembly of independent components (e.g. Conditional Random Field + CNN) • Bottleneck in communication between components • Cannot be learned end-to-end • Approximate inference for graphical models • Convergence issues • Unable to reason about unexplored space Our method doesn’t require segmentation, or room/door detection [Friedman et al. 2007] [Mozos et al. 2007] [Sünderhauf et al 2015] [Pronobis et al. 2012] 10 [Brucker et al. 2018] From Pixels to Buildings: End-to-end Probabilistic Deep Networks for Large-scale Semantic Mapping

Existing Work: Computer Vision Deep structured prediction approaches (e.g. image generation, semantic segmentation) • Fixed number of variables [Wu et al. ‘16][Mahmood et al. ‘19] [Chen et al.’18][Schwing & Urtasun,’15] • Static global structure [Belanger & McCallum,’16] • Some not probabilistic [Shelhamer et. al. ‘16] From Pixels to Buildings: End-to-end Probabilistic Deep Networks for Large-scale Semantic Mapping 11

TopoNets: Overview • Take-away I : End-to-end Unified Deep Probabilistic Spatial Model • Take-away II: Tractable Exact Inference (real time) • Take-away III: Template-based method • Learn template networks during training • Instantiate complete network while to infer semantics for any test environment • Pr(semantics ( Y ), geometry ( X ) | topology) From Pixels to Buildings: End-to-end Probabilistic Deep Networks for Large-scale Semantic Mapping 12

TopoNets Take-away I : End-to-end Unified Deep Probabilistic Spatial Model From Pixels to Buildings: End-to-end Probabilistic Deep Networks for Large-scale Semantic Mapping 13

TopoNets Take-away I : End-to-end Unified Deep Probabilistic Spatial Model From Pixels to Buildings: End-to-end Probabilistic Deep Networks for Large-scale Semantic Mapping 14

TopoNets Take-away II: Tractable Exact Inference From Pixels to Buildings: End-to-end Probabilistic Deep Networks for Large-scale Semantic Mapping 15

TopoNets: Sum Product Networks Sum-Product Networks, a recent deep architecture • Solid theoretical foundations [Poon&Domingos’11] [Gens&Domingos’12] [ Peharz et al.’17] • Learn conditional or joint distributions • Tractable partition function, exact inference • Applied in a variety of problems (vision, NLP, robotics etc.) • Viewed in 2 ways: • Graphical model Latent Variable • Deep architecture • Structure semantics: • Hierarchical mixture of parts Input Variables From Pixels to Buildings: End-to-end Probabilistic Deep Networks for Large-scale Semantic Mapping 16

Refer to [van de Wolfshaar and Pronobis 2019] for TopoNets convolutional representations of visual/spatial data. url: https://arxiv.org/pdf/1902.06155.pdf Take-away III: Template-based method • Learn template networks during training From Pixels to Buildings: End-to-end Probabilistic Deep Networks for Large-scale Semantic Mapping 17

TopoNets Take-away III: Template-based method • Instantiate complete network to infer semantics of any test environment From Pixels to Buildings: End-to-end Probabilistic Deep Networks for Large-scale Semantic Mapping 18

TopoNets: Recap of Merits • Builds a unified deep model (an SPN) instead of an assembly of independent models • Can be learned end-to-end from robot sensor input • Template-based method • Adapts to different environments • Tractable, exact inference (real-time) • Theoretically guaranteed thanks to Sum-Product Networks • Fully probabilistic and generative • Can detect novel semantic maps to trigger additional learning From Pixels to Buildings: End-to-end Probabilistic Deep Networks for Large-scale Semantic Mapping 19

Experiments From Pixels to Buildings: End-to-end Probabilistic Deep Networks for Large-scale Semantic Mapping 20

Experiments: Inference Tasks Task 1: Semantic place classification (accuracy) ෝ 𝒛 𝑓𝑦𝑞𝑚𝑝𝑠𝑓𝑒 = argmax 𝒛 𝑓𝑦𝑞𝑚𝑝𝑠𝑓𝑒 𝑄(𝒛 𝑓𝑦𝑞𝑚𝑝𝑠𝑓𝑒 |𝒚) Task 2: Inferring placeholders (unexplored) (accuracy of placeholders) ෝ 𝒛 𝑓𝑦𝑞𝑚𝑝𝑠𝑓𝑒, ෝ 𝒛 𝑣𝑜𝑓𝑦𝑞𝑚𝑝𝑠𝑓𝑒 = argmax 𝒛 𝑣𝑜𝑓𝑦𝑞𝑚𝑝𝑠𝑓𝑒 𝑄 𝒛 𝑓𝑦𝑞𝑚𝑝𝑠𝑓𝑒 , 𝒛 𝑣𝑜𝑓𝑦𝑞𝑚𝑝𝑠𝑓𝑒 𝒚 𝒛 𝑓𝑦𝑞𝑚𝑝𝑠𝑓𝑒 Task 3: Novelty detection (ROC curve) σ 𝒛 𝑓𝑦𝑞𝑚𝑝𝑠𝑓𝑒 𝑄 𝒛 𝑓𝑦𝑞𝑚𝑝𝑠𝑓𝑒 , 𝒚 > 𝑢ℎ𝑠𝑓𝑡ℎ𝑝𝑚𝑒 From Pixels to Buildings: End-to-end Probabilistic Deep Networks for Large-scale Semantic Mapping 21

Experiments: Dataset • Collected by a mobile robot • 32 semantic maps on 4 floors • Built from laser-range and odometry data • Two experimental setups (6 or 10 semantic clases) • Cross-validation: • Trained on data from 3 floors • Tested on data from remaining floor From Pixels to Buildings: End-to-end Probabilistic Deep Networks for Large-scale Semantic Mapping 22

Experiments: Baseline An assembled approach consisting of • SPN-based Local Place Classifier • Markov Random Field (MRF) • Similar to [Pronobis et al. 2012] • Markov Random Field + door detector + SVM From Pixels to Buildings: End-to-end Probabilistic Deep Networks for Large-scale Semantic Mapping 23

Experiments: Semantic Place Classification Task 1: Semantic place classification ෝ 𝒛 𝑓𝑦𝑞𝑚𝑝𝑠𝑓𝑒 = argmax 𝒛 𝑓𝑦𝑞𝑚𝑝𝑠𝑓𝑒 𝑄(𝒛 𝑓𝑦𝑞𝑚𝑝𝑠𝑓𝑒 |𝒚) Our approach consistently improves classification accuracy and disambiguates semantic information. From Pixels to Buildings: End-to-end Probabilistic Deep Networks for Large-scale Semantic Mapping 24

End-to-end Probabilistic Deep Networks for Large-scale Semantic - PowerPoint PPT Presentation

From Pixels to Buildings : End-to-end Probabilistic Deep Networks for Large-scale Semantic Mapping Kaiyu Zheng 1* , Andrzej Pronobis 2,3 1 Brown University 2 University of Washington 3 KTH Royal Institute of Technology *work done while studying at 2

Probabilistic model Probabilistic model c Probabilistic model Probabilistic model c c

Deep Neural Networks and Deep Reinforcement Learning Deep Learning, Goodfellow, Bengio and

CS 4110 Probabilistic Programming Probabilistic Programming It's not about writing software.

Deep Learning: Theory and Practice Deep Learning - Practical 02-04-2020 Considerations

AGN deep multiwavelength AGN deep multiwavelength AGN deep multiwavelength surveys: surveys:

Deep Reinforcement Learning and Complex Environments Raia Hadsell End-to-end Deep Learning

Edward: Deep Probabilistic Programming Extended Seminar Systems and Machine Learning Steven

Deep Networks Andrea Passerini passerini@disi.unitn.it Machine Learning Deep Networks Need for

Deep Networks Andrea Passerini passerini@disi.unitn.it Machine Learning Deep Networks Need for

E9 205 Machine Learning for Signal Processing Understanding Deep Networks 08-11-2019 Instructor

Running Probabilistic Running Probabilistic Running Probabilistic Programs Backwards Programs

Probabilistic Tracking and Probabilistic Tracking and Probabilistic Tracking and Thesis

Probabilistic Computation Lecture 13 BPP vs. PH 1 Recap 2 Recap Probabilistic computation 2

Table of Contents I Probabilistic Reasoning Classical Probabilistic Models Basic Probabilistic

Probabilistic Computation Lecture 12 Flipping coins, taking chances PP, BPP 1 Probabilistic

Probabilistic Tracking and Probabilistic Tracking and Probabilistic Tracking and Reconstruction

Classifiers for Sensory Data (e.g. tanh( ) ) Key Signal to Symbol Refinement Two layers: (VMM +

Uncertainty-Centric Safety Assurance of ML- Based Perception for Automated Driving Krzysztof

A package for sensory data analysis with R scientific discipline that applies principals of

Putting the Fun in Functional Making The Game Fun Carrie Richardson Leadership Springfield

Consciousness and Thought: Wrap-Up Talk David Chalmers The Critique of Pure Thought David

Sensory Replacement What is it? Related Devices Can it be applied to the art world? Sensory

Committee E18 Sensory Evaluation more than meets the eye Fall Committee Meeting Opening

HELPING CHILDREN KEEP ALERT AND MAINTAIN SELF REGULATION Gloria Ng Siok Kwan Senior Occupational

Sambuz

Useful Links

Newsletter

Mail Us