3d vision
play

3D Vision Torsten Sattler and Martin Oswald Spring 2018 3D Vision - PowerPoint PPT Presentation

3D Vision Torsten Sattler and Martin Oswald Spring 2018 3D Vision Understanding geometric relations between images and the 3D world between images Obtaining 3D information describing our 3D world from images from


  1. 3D Vision Torsten Sattler and Martin Oswald Spring 2018

  2. 3D Vision • Understanding geometric relations • between images and the 3D world • between images • Obtaining 3D information describing our 3D world • from images • from dedicated sensors

  3. 3D Vision • Extremely important in robotics and AR / VR • Visual navigation • Sensing / mapping the environment • Obstacle detection, … • Many further application areas • A few examples …

  4. Google Tango (officially discontinued, lives on as ARCore)

  5. Google Tango

  6. Image-Based Localization

  7. Geo-Tagging Holiday Photos (Li et al. ECCV 2012)

  8. Augmented Reality (Middelberg et al. ECCV 2014)

  9. Large-Scale Structure-from-Motion Video credit: Johannes Schönberger

  10. Virtual Tourism

  11. 3D Urban Modeling UNC/UKY UrbanScape project

  12. 3D Urban Modeling

  13. Mobile Phone 3D Scanner

  14. Mobile Phone 3D Scanner

  15. Self-Driving Cars

  16. Self-Driving Cars

  17. Self-Driving Cars

  18. Micro Aerial Vehicles

  19. Mixed Reality Microsoft HoloLens

  20. Virtual Reality

  21. Raw Kinect Output: Color + Depth http://grouplab.cpsc.ucalgary.ca/cookbook/index.php/Technologies/Kinect

  22. Human-Machine Interface

  23. 3D Video with Kinect

  24. Autonomous Micro-Helicopter Navigation Use Kinect to map out obstacles and avoid collisions

  25. Dynamic Reconstruction

  26. Performance Capture

  27. Performance Capture (Oswald et al. ECCV 14)

  28. Performance Capture

  29. Motion Capture

  30. Interactive 3D Modeling (Sinha et al. Siggraph Asia 08) collaboration with Microsoft Research (and licensed to MS)

  31. Scanning Industrial Sites as-build 3D model of off-shore oil platform

  32. Scanning Cultural Heritage

  33. Cultural Heritage Stanford ’ s Digital Michelangelo Digital archive Art historic studies

  34. Archaeology accuracy ~1/500 from DV video (i.e. 140kb jpegs 576x720)

  35. Forensics • Crime scene recording and analysis

  36. Forensics

  37. Sports

  38. Surgery

  39. 3D Vision Course Team Martin Oswald Nikolay Savinov Peidong Liu Torsten Sattler CNB G103.2 CAB G 81.1 CAB G 84.2 CNB 104 peidong.liu@inf.ethz.ch torsten.sattler@inf.ethz.ch martin.oswald@inf.ethz.ch nikolay.savinov@inf.ethz.ch Katarina Tóthóva Johannes Schönberger Federico Camposeco CAB G 102.2 CAB G 85.1 CAB G 86.3 katarina.tothova@inf.ethz.ch jsch@inf.ethz.ch federico.camposeco@inf.ethz.ch

  40. Course Objectives • To understand the concepts that relate images to the 3D world and images to other images • Explore the state of the art in 3D vision • Implement a 3D vision system/algorithm

  41. Learning Approach • Introductory lectures: • Cover basic 3D vision concepts and approaches. • Further lectures: • Short introduction to topic • Paper presentations ( you ) (seminal papers and state of the art, related to your projects) • 3D vision project: • Choose topic, define scope (by week 4) • Implement algorithm/system • Presentation/demo and paper report Grade distribution • Paper presentation & discussions: 25% • 3D vision project & report: 75%

  42. Materials Slides and more http://www.cvg.ethz.ch/teaching/3dvision/ Also check out on-line “shape-from-video” tutorial: http://www.cs.unc.edu/~marc/tutorial.pdf http://www.cs.unc.edu/~marc/tutorial/ Textbooks: • Hartley & Zisserman, Multiple View Geometry • Szeliski, Computer Vision: Algorithms and Applications

  43. Schedule Feb 19 Introduction Feb 26 Geometry, Camera Model, Calibration Mar 5 Features, Tracking / Matching Mar 12 Project Proposals by Students Mar 19 Structure from Motion (SfM) + papers Mar 26 Dense Correspondence (stereo / optical flow) + papers Apr 2 Bundle Adjustment & SLAM + papers Apr 9 Student Midterm Presentations Arp16 Easter break Apr 23 Multi-View Stereo & Volumetric Modeling + papers Whitsundite Apr 30 May 7 3D Modeling with Depth Sensors + papers May 14 3D Scene Understanding + papers May 21 4D Video & Dynamic Scenes + papers May 28 Student Project Demo Day = Final Presentations

  44. Fast Forward • Quick overview of what is coming…

  45. Camera Models and Geometry Pinhole camera or Geometric transformations in 2D and 3D

  46. Camera Calibration • Know 2D/3D correspondences, compute projection matrix also radial distortion (non-linear)

  47. Feature Tracking and Matching Harris corners, KLT features, SIFT features key concepts: invariance of extraction, descriptors to viewpoint, exposure and illumination changes

  48. 3D from Images L 2 m 1 C 1 M? M L 1 Triangulation - calibration m 2 l 2 - correspondences C 2

  49. Epipolar Geometry Fundamental matrix Essential matrix Also how to robustly compute from images

  50. Structure from Motion Initialize Motion Initialize Structure (P 1 ,P 2 compatibel with F) (minimize reprojection error) Extend motion Extend structure (compute pose through matches (Initialize new structure, seen in 2 or more previous views) refine existing structure)

  51. Visual SLAM • Visual Simultaneous Navigation and Mapping (Clipp et al. ICCV’09)

  52. Stereo and Rectification Warp images to simplify epipolar geometry Compute correspondences for all pixels

  53. Multi-View Stereo

  54. Joint 3D Reconstruction and Class Segmentation (Haene et al CVPR13) reconstruction only (isotropic smoothness prior) joint reconstruction and segmentation ■ Building (ground, building, vegetation, stuff) ■ Ground ■ Vegetation ■ Clutter

  55. Structured Light • Projector = camera • Use specific patterns to obtain correspondences

  56. Papers and Discussion • Will cover recent state of the art • Each student team will present a paper (5min per team member), followed by discussion • “Adversary” to lead the discussion • Papers will be related to projects/topics • Will distribute papers later (depending on chosen projects)

  57. Projects and reports • Project on 3D Vision-related topic • Implement algorithm / system • Evaluate it • Write a report about it • 3 Presentations / Demos: • Project Proposal Presentation (week 4) • Midterm Presentation (week 8) • Project Demos (week 15) • Ideally: Groups of 3 students

  58. Course project example: Build your own 3D scanner! Example: Bouguet ICCV’98

  59. Project Topics

  60. DeepVO: Towards End-to-End Visual Odometry with Deep Recurrent Convolutional Neural Networks Goal: The goal is to implement a deep recurrent convolutional neural network for end-to-end visual odometry [1] Description: Most of existing VO algorithms are developed under a standard pipeline including feature extraction, feature matching, motion estimation, local optimization, etc. Although some of them have demonstrated superior performance, they usually need to be carefully designed and specifically fine-tuned to work well in different environments. Some prior knowledge is also required to recover an absolute scale for monocular VO. This project is to implement a novel end-to- end framework for monocular VO by using deep Recurrent Convolutional Neural Networks (RCNNs). Since it is trained and deployed in an end-to-end manner, it infers poses directly from a sequence of raw RGB images (videos) without adopting any module in the conventional VO pipeline. Based on the RCNNs, it not only automatically learns effective feature representation for the VO problem through Convolutional Neural Networks, but also implicitly models sequential dynamics and relations using deep Recurrent Neural Networks. Extensive experiments on the KITTI VO dataset show competitive performance to state-of-the-art methods, verifying that the end-to-end Deep Learning technique can be a viable complement to the traditional VO systems. [1] Wang et. al., DeepVO: Towards End-to-End Visual Odometry with Deep Recurrent Convolutional Neural Networks, ICRA 2017 Peidong Liu, CNB D102 peidong.liu@inf.ethz.ch Recommended : Python and prior knowledge in machine learning

Recommend


More recommend