yolo you only look once
play

YOLO: You Only Look Once Unified Real-Time Object Detection Joseph - PowerPoint PPT Presentation

YOLO: You Only Look Once Unified Real-Time Object Detection Joseph Redmon, Santosh Divvala, Ross Girshick, Ali Farhadi [Website] [Paper] [arXiv] [Reviews] Slides by: Andrea Ferri For: Computer Vision Reading Group (08/03/16) INTRODUCTION


  1. YOLO: You Only Look Once Unified Real-Time Object Detection Joseph Redmon, Santosh Divvala, Ross Girshick, Ali Farhadi [Website] [Paper] [arXiv] [Reviews] Slides by: Andrea Ferri For: Computer Vision Reading Group (08/03/16)

  2. INTRODUCTION

  3. Nowadays State of the Art approach, are so architected: RPN RPN Proposals Conv layers Conv Layer 5 Class probabilities RPN Proposals RoI pooling layer FC layers Class scores

  4. This complex pipeline means that: Slow Pipeline Single Pipelines Hard to Optimize Need Parallel Training for Components

  5. WHAT’S NEW? (In the architecture approach.)

  6. Concepts Detection as Single Regression Problem Developed as Single Convolutional Network Reason Globally on the Entire Image Learns Generalizable Representations Easy & Fast

  7. Unified Detection

  8. Divide the image into a SxS grid. If the center of an object fall into a grid cell, it will be the responsible for the object. Each grid cell predict: B bounding boxes; B confidence scores as C=Pr(Obj)*IOU ; C cond. Class prob. as P=Pr( π‘«π’Žπ’ƒπ’•π’• 𝒋 |Object) ; Confidence Prediction is obtained as IOU of predicted box and any ground truth box.

  9. We obtain the class-specific confidence score as: Pr( π‘«π’Žπ’ƒπ’•π’• 𝒋 |Object)*Pr(Object)*IOU = Pr( π‘«π’Žπ’ƒπ’•π’• 𝒋 )*IOU

  10. Design

  11. Loss-Function

  12. Limitations Struggle with Small Object. Struggle with Different aspects and ratios of objects. Loss function is an approximation. Loss function threats errors in different boxes ratio at the same.

  13. EXPERIMENTS (How performs?.)

  14. General Comparison

  15. Fast R-CNN & YOLO

  16. Fast R-CNN & YOLO Using YOLO accuracy for Big object to avoid detection mistakes into Fast R-CNN:

  17. Fast R-CNN & YOLO

  18. SUMMARY (Why is an interesting approach.)

  19. Pros Trained on a loss function that directly corresponds to detection performance. The entire model is trained jointly. The fastest general-purpose object detector in the literature. At least detection at 45fps.

  20. References β€’ You Only Look Once: Unified, Real-Time Object Detection, Joseph Redmon, Santosh Divvala, Ross Girshick, Ali Farhadi.

  21. THANKS !!! QUESTIONS?

Recommend


More recommend