co compute mputer r vision 3 detection segmentation and
play

Co Compute mputer r Vision 3: Detection, , Segmentation and - PowerPoint PPT Presentation

Co Compute mputer r Vision 3: Detection, , Segmentation and and Tr Trac acking king CV3DST | Prof. Leal-Taix 1 The The Te Team Lecturers Prof. Dr. Laura Dr. Aljosa Leal-Taix Osep CV3DST | Prof. Leal-Taix 2 Wh What at


  1. Co Compute mputer r Vision 3: Detection, , Segmentation and and Tr Trac acking king CV3DST | Prof. Leal-Taixé 1

  2. The The Te Team Lecturers Prof. Dr. Laura Dr. Aljosa Leal-Taixé Osep CV3DST | Prof. Leal-Taixé 2

  3. Wh What at this cou ourse e is: • A course on Computer Vision – Object detection – Instance and semantic segmentation – Multiple object tracking in 2D and 3D • Other CV courses: – Computer Vision 2: Multiple View Geometry (WS) CV3DST | Prof. Leal-Taixé 3

  4. Wh What at this cou ourse e is NOT: • An Introduction to Deep Learning – Take “Introduction to Deep Learning” if you are not familiar with basic DL concepts • A practical project course – Take “Advanced Deep Learning for Computer Vision” • A theoretical introduction into 3D Vision – Take “Computer Vision 2: Multiple View Geometry (WS)” CV3DST | Prof. Leal-Taixé 4

  5. Wh What at is Com omputer er Vi Vision on? First defined in the 60s in artificial intelligence groups • “Mimic the human visual system” • Center block of robotic intelligence • CV3DST | Prof. Leal-Taixé 5

  6. CV3DST | Prof. Leal-Taixé 6

  7. Comp Computer er Vis ision ion Give eyes to a computer CV3DST | Prof. Leal-Taixé 9

  8. Comp Computer er Vis ision ion Understand every pixel of an image CV3DST | Prof. Leal-Taixé 10

  9. Comp Computer er Vis ision ion Understand every pixel of an image tree car person Semantic segmentation road CV3DST | Prof. Leal-Taixé 11

  10. Comp Computer er Vis ision ion Understand every pixel of an image tree person 2 car Instance- based segmentation Semantic person 1 segmentation person 3 road CV3DST | Prof. Leal-Taixé 12

  11. Comp Computer er Vis ision ion Understand every pixel of a video Multiple object tracking Instance- based segmentation Semantic segmentation CV3DST | Prof. Leal-Taixé 13

  12. Dyn Dynamic c Sce Scene Understa tanding Understand every pixel of a video Multiple object tracking Instance- based segmentation Semantic segmentation CV3DST | Prof. Leal-Taixé 14

  13. Au Auto tono nomous drivi ving ng CV3DST | Prof. Leal-Taixé 15

  14. Au Auto tono nomous drivi ving ng CV3DST | Prof. Leal-Taixé 16

  15. Un Underst standin ing a an ima image CV3DST | Prof. Leal-Taixé 17 Credit: Li/Karpathy/Johnson

  16. Un Underst standin ing a an ima image K. He, G. Gkioxari, P. Dollar, R. Girshick. Mask R-CNN. ICCV 2017. CV3DST | Prof. Leal-Taixé 18

  17. Un Underst standin ing a an ima image CV3DST | Prof. Leal-Taixé 19

  18. Un Underst standin ing a an ima image CV3DST | Prof. Leal-Taixé 20

  19. Un Underst standin ing a an ima image CV3DST | Prof. Leal-Taixé 21

  20. Un Underst standin ing a an ima image • Different representations depending on the granularity – Detections (coarse) – Segmentations (precise) – Semantic with/without instances (person 1, person 2) • Goes well with Deep Learning CV3DST | Prof. Leal-Taixé 22

  21. Un Underst standin ing a an v vid ideo • Temporal domain which brings us advantages – A lot of redundancy – A smoothness assumption: things do not change much from one frame to another • … but also disadvantages – At 30 FPS, image the computation one has to do to process a video…. – Occlusions, multiple objects moving and interacting… CV3DST | Prof. Leal-Taixé 23

  22. Un Underst standin ing a an v vid ideo: t : then CV3DST | Prof. Leal-Taixé 24

  23. Un Underst standin ing a an v vid ideo: n : now CV3DST | Prof. Leal-Taixé 25

  24. Un Underst standin ing a an v vid ideo • Where is every object going? • How are objects interacting? • Get consistent results in the temporal dimension CV3DST | Prof. Leal-Taixé 26

  25. Rou Rough schedu edule/c e/con onten ent • 1. Introduction • 2. Object Detection 1 • 3. Object Detection 2 • 4. Single/Multiple object tracking • 5. Multiple object tracking • 6. Trajectory prediction • 7. Semantic segmentation • 8. Instance Segmentation • 9. Video object segmentation • 10. Going towards 3D tracking and segmentation CV3DST | Prof. Leal-Taixé 27

  26. Rou Rough schedu edule/c e/con onten ent • RCNN, Fast RCNN and Faster RCNN • YOLO, SSD, RetinaNet • Siamese networks – Person Re-Identification • Message Passing Networks • Network (non-neural) flow for tracking • Generative Adversarial Networks – trajectory prediction Mask-RCNN, UPSNet (panoptic segmentation) • Deformable/atrous convolutions • 3D – data, algorithms. • CV3DST | Prof. Leal-Taixé 28

  27. Our Our Research rch Lab Dynamic Vision and Learning Group https://dvl.in.tum.de/ CV3DST | Prof. Leal-Taixé 29

  28. Ab About t the the lectu ture Theory: 10-11 lectures • Every Wednesday 16:00-18:00 (MI HS 2) • Lectures will be recorded this year! • – Due to the virus situation, we will make all slides and video recordings available every Wednesday https://dvl.in.tum.de/teaching/cv3dst-ss20/ CV3DST | Prof. Leal-Taixé 30

  29. Gra Grading syst g system Exam: tbd tbd • There will be a retake exam as this course will be • moved permanently to Summer Semester Completing the practical part successfully gives a • bonus of 0.3 CV3DST | Prof. Leal-Taixé 31

  30. Pr Pract ctica cal l pa part • Internal Kaggle competition • We will have a tracking challenge. • You will all start from the same point (code) • After that, it will be an open competition. • Check out the video of the presentation of the challenge! CV3DST | Prof. Leal-Taixé 32

  31. Moodl Moodle • Announcements via Moodle - IMPORT PORTANT ANT! – Sign up in TUM online for access: https://www.moodle.tum.de/ – We will share common information (e.g., regarding exam) – Ask content questions online so others benefit – Don’t post solutions CV3DST | Prof. Leal-Taixé 33

  32. Ema Email ils & Slid ides es • All material will be uploaded on Moodle and the web • Questions regarding the syllabus, exercises or contents of the lecture, use Moodle! • Questions regarding organization of the course: dst@dvl.in.tum.de • Emails to the individual addresses will not be answered. CV3DST | Prof. Leal-Taixé 34

  33. Co Compute mputer r Vision 3: Detection, , Segmentation and and Tr Trac acking king CV3DST | Prof. Leal-Taixé 40

Recommend


More recommend