real time monocular slam
play

Real-Time Monocular SLAM Andrew Davison Robot Vision Group - PowerPoint PPT Presentation

Real-Time Monocular SLAM Andrew Davison Robot Vision Group Department of Computing Imperial College London March 30, 2011 Robot Vision in Real-Time Performance in robot vision is advancing fast . What are the reasons? Continued exponential


  1. Real-Time Monocular SLAM Andrew Davison Robot Vision Group Department of Computing Imperial College London March 30, 2011

  2. Robot Vision in Real-Time Performance in robot vision is advancing fast . What are the reasons? • Continued exponential increase in low-cost computer power. • Bayesian probability theory: now widely agreed upon as the absolute framework for doing inference with real-world data. • A wealth of well understood methods that really work are publicly available (well engineered algorithms or even code) and can be easily used to put systems together.

  3. Simultaneous Localisation and Mapping B C A (a) Robot start (zero uncertainty); first measurement of feature A.

  4. Simultaneous Localisation and Mapping (b) Robot drives forwards (uncertainty grows).

  5. Simultaneous Localisation and Mapping (c) Robot makes first measurements of B and C.

  6. Simultaneous Localisation and Mapping (d) Robot drives back towards start (uncertainty grows more)

  7. Simultaneous Localisation and Mapping (e) Robot re-measures A; loop closure ! Uncertainty shrinks.

  8. Simultaneous Localisation and Mapping (f) Robot re-measures B; note that uncertainty of C also shrinks.

  9. SLAM with First Order Uncertainy Propagation     ˆ x v P xx P xy 1 P xy 2 . . . y 1 ˆ P y 1 x P y 1 y 1 P y 1 y 2 . . .     x = ˆ P =     y 2 ˆ , P y 2 x P y 2 y 1 P y 2 y 2 . . .     . . . .     . . . . . . . . • Camera pose and map stored in single state vector and updated on every frame via a single Extended Kalman Filter. • Full PDF over robot and map parameters represented by a single multi-variate Gaussian.

  10. SLAM Using Vision: First Steps • Fixating active stereo measuring one feature at a time. • 5Hz real-time processing (100MHz PC!). Davison and Murray, ECCV 1998, PAMI 2002.

  11. SLAM Using Active Stereo Vision Probabilistic Map Results z 1 z 0 x x

  12. Monocular SLAM • Can we still do SLAM with a single unconstrained camera, flying generally through the world in 3D? • 30Hz or higher operation required to track agile motion. • Salient feature patches detected once to serve as long-term visual landmarks. • Landmarks gradually accumulated and stored indefinitely.

  13. Modelling an Agile Camera Camera state representation: 3D position, orientation, velocity and angular velocity:  r W  q WR   x v =   v W   ω R Each feature state is a 3D position vector:   x i y i = y i   z i

  14. Prediction Step: A ‘Smooth Motion’ Model Assume bounded, Gaussian-distributed linear and angular acceleration. r W + ( v W + V W )∆ t  r W    new q WR × q (( ω R + Ω R )∆ t ) q WR  new    f v =  = v W + V W  v W     new   ω R + Ω R ω R new

  15. Measurement Step: Image Features and Active Search • Salient feature patches detected to serve as visual landmarks. • Uncertainty-guided active search within elliptical regions.

  16. Automatic Map Management • Initialise system from a few known features. • Add a new feature if number of measurable features drops below threshold (e.g. 10). • Choose salient image patch from search box not overlapping existing features.

  17. Monocular Feature Initialisation with Depth Particles 3.5 3 Probability Density 2.5 2 1.5 1 0.5 0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 Depth (m)

  18. MonoSLAM Davison, ICCV 2003; Davison, Molton, Reid, Stasse, PAMI 2007.

  19. Application: HRP-2 Humanoid at JRL, AIST, Japan • Small circular loop within a large room • No re-observation of ‘old’ features until closing of large loop.

  20. HRP2 Loop Closure (Davison, Stasse, et al. , PAMI 2007)

  21. SLAM as a Bayesian Network x 0 x 1 x 2 x 3 z z z z z z z z z z z z z z z z 7 1 2 3 4 5 6 8 9 10 11 12 13 14 15 16 y y y y y y 1 2 3 4 5 6 (See ‘Probabilistic Robotics’, Thrun, Burgard and Fox, MIT Press 2005.)

  22. Real-Time Monocular SLAM: Why Filter? entropy reduction in bits 20 10 0 400 200 15 10 n 5 m • Hauke Strasdat, J. M. M. Montiel and Andrew J. Davison, ICRA 2010. • A comparison: filtering vs. keyframes + optimisation for monocular SLAM in terms of accuracy and computational cost. • A clear winner with modern computing resources: keyframes + optimisation.

  23. General Components of a Scalable SLAM System Local Motion Estimation Loop Closure Detection Global Map Relaxation

  24. Local Metric Estimation: ‘Visual Odometry’ • Civera et al. , IROS 2009 (monocular EKF ‘forgetting filter’). • High feature count provides local accuracy.

  25. Active Matching for Super-Efficient Tracking • Many systems work well if the update rate can be kept high, because knowledge of continuity to permits local search: tracking . • Active Matching: sequential, one by one search for global correspondence driven by expected information gain. • Active Matching: Chli, Davison, ECCV 2008

  26. Scalable Active Matching • Efficient transfer of matching result from feature to feature by message passing through a tree. (Scalable Active Matching: Handa, Chli, Strasdat, Davison, CVPR 2010)

  27. Global Topological: ‘Loop Closure Detection’ • Angeli et al. , IEEE Transactions on Robotics 2008.

  28. SLAM for Scene Segmentation and Understanding • Keypoint clustering and video segmentation, Angeli and Davison BMVC 2010.

  29. Optimisation: ‘Pose Graph Relaxation’ • Keyframe-based spherical mosaicing, Lovegrove and Davison, ECCV 2010. • Local tracking relative to keyframes with parallel global optimisation.

  30. Large Scale Monocular SLAM using Optimisation Scale Drift-Aware Large Scale Monocular SLAM (Strasdat, Montiel, Davison, Robotics: Science and Systems 2010).

  31. Live Dense Reconstruction with a Single Camera (Newcombe, Davison, CVPR 2010) • During live camera tracking, perform dense per-pixel surface reconstruction. • Relies heavily on GPU processing for dense image matching. • Runs live on current desktop hardware.

  32. Live Dense Reconstruction with a Single Camera Point Cloud Base Surface Bundle Matching D ( u,v ) Dense Depth Map Depth Map Stitching

  33. Live Dense Reconstruction with a Single Camera • Multiple depths maps stitched live into single desktop model.

  34. Live Dense Reconstruction with a Single Camera

Recommend


More recommend