dc flow
play

DC Flow Jia Xu, Intel Labs May 24, 2017 Joint work with Ren Ranftl - PowerPoint PPT Presentation

DC Flow Jia Xu, Intel Labs May 24, 2017 Joint work with Ren Ranftl and Vladlen Koltun DC Flow, Intel Visual Computing Lab Optical Flow Input Output Dense correspondence for each pixel between two frames Optical Flow Key building block


  1. DC Flow Jia Xu, Intel Labs May 24, 2017 Joint work with RenΓ© Ranftl and Vladlen Koltun DC Flow, Intel Visual Computing Lab

  2. Optical Flow Input Output Dense correspondence for each pixel between two frames

  3. Optical Flow Key building block for many computer vision systems: - Video processing and analytics: motion detection, object tracking, action recognition, video segmentation, etc. - Robotics: visual odometry - Autonomous driving

  4. Optical Flow Key building block for many computer vision systems: - Video processing and analytics: motion detection, object tracking, action recognition, video segmentation, etc. - Robotics: visual odometry - Autonomous driving Challenges: - large displacements, textureless regions, motion blur, and non-rigid deformation.

  5. Stereo v.s. Optical Flow Stereo -256 0 256 1-D displacement Left image Right image Optical Flow -256 -256 0 256 256 First image 2-D displacement Second image

  6. Prior Work Sparse-to-dense regime: - Finding matches with hand-crafted feature: Brox and Malik 2014, EpicFlow (Revaud et al. 2015), DiscreteFlow (Menze et al. 2015), FlowFields (Bailer et al. 2015), CPM (Hu et al. 2016), FullFlow (Chen and Koltun, 2016) - Approximation with nearest neighbor search or coarse- to-fine schemes: Brox and Malik 2014, DiscreteFlow (Menze et al. 2015), FlowFields (Bailer et al. 2015), CPM (Hu et al. 2016) Learning based methods : FlowNet (Dosovitskiy et al. 2015), PatchBatch (Gadot and Wolf 2016), DeepDiscreteFlow (Guney and Geige, 2016) Domain specific methods : SOF (Sevilla-Lara et al. 2016), JHS (Hur and Roth 2016), SDF (Bai et al. 2016)

  7. Our Idea Direct 4-D cost volume processing 𝑁× 𝑂 Γ— 𝑆× 𝑆 First frame Second frame

  8. Overview

  9. Learning a Pixel-level Feature Embedding xp xa xn Learning with triplet loss norm norm norm conv conv conv ... ... ... conv+relu conv+relu conv+relu conv+relu conv+relu conv+relu Positive patch Anchor patch Negative patch

  10. Direct Cost Volume Processing Build 4-D cost volume: dot products then stored with 8 bits Semi-Global Matching for optical flow Four cardinal path directions β€’ 16-bit filtered cost volume β€’ Smallest cost gives the final correspondence β€’ Forward and backward matching β€’

  11. Post-processing

  12. Post-processing

  13. Sintel Test Set 250 DeepDiscreteFlow FlowFields+ SPM-BPv2 200 FlowFields CPM-Flow Runtime (sec) FullFlow 150 Ours 100 50 0 5.1 5.2 5.3 5.4 5.5 5.6 5.7 5.8 5.9 6 EPE-all

  14. Sintel Test Set

  15. KITTI 2015 Test Set

  16. KITTI 2015 Test Set

  17. Run-time

  18. Qualitative Result - Sintel

  19. Qualitative Results – KITTI 2015

  20. Ablation Study - Components

  21. Ablation Study - Dimensionality

  22. Summary An optical flow estimation approach that directly constructs β€’ and processes the 4-D cost volume A step towards unifying optical flow and stereo estimation β€’ Our approach combines high accuracy with competitive β€’ runtimes, outperforming prior methods on standard benchmarks by significant margins More details β€’ http://pages.cs.wisc.edu/~jiaxu/dcflow/ β€’

  23. Thank you! Acknowledgements: Qifeng Chen(Intel VCL), Alexey Dosovitskiy(Intel VCL)

Recommend


More recommend