cs 4803 7643 deep learning
play

CS 4803 / 7643: Deep Learning Topics: (Finish) Computing Gradients - PowerPoint PPT Presentation

CS 4803 / 7643: Deep Learning Topics: (Finish) Computing Gradients Backprop in Conv Layers Forward mode vs Reverse mode AD Modern CNN Architectures Zsolt Kira Georgia Tech The architecture of LeNet5 Handwriting Recognition


  1. CS 4803 / 7643: Deep Learning Topics: – (Finish) Computing Gradients – Backprop in Conv Layers – Forward mode vs Reverse mode AD – Modern CNN Architectures Zsolt Kira Georgia Tech

  2. The architecture of LeNet5

  3. Handwriting Recognition Example

  4. Translation Invariance

  5. Some Rotation Invariance

  6. Some Scale Invariance

  7. Case Studies • There are several generations of ConvNets – 2012 – 2014: AlexNet, ZNet, VGGNet • Conv-Relu, Pooling, Fully connected, Softmax • Deeper ones (VGGNet) tend to do better – 2014 • Fully-convolutional networks for semantic segmentation • Matrix outputs rather than just one probability distribution – 2014-2016 • Fully-convolutional networks for classification • Less parameters, faster than comparable Gen1 networks • GoogleNet, ResNet – 2014-2016 • Detection layers (proposals) • Caption generation (combine with RNNs for language)

  8. An Aside

  9. AlexNet: 60M params ZNet: 75M VGG: 138M GoogleNet: 5M

  10. Importance of Depth • After a while, adding depth decreases performance • At first, vanishing/exploding gradients • normalized initialization • Batch normalization • 2 nd order methods • Then, optimization limitation – Deeper network should be able to mimic shallow ones

  11. Localization and Detection

  12. Computer Vision Tasks

  13. Computer Vision Tasks

  14. Classification + Localization

  15. CLS - ImageNet

  16. Idea 1: Localization as Regression

  17. Per-Class vs. Class Agnostic

  18. Where to attach?

  19. Multiple Objects

  20. Human Pose Estimation

  21. Sliding Window: Overfeat

  22. Sliding Window: Overfeat

  23. Sliding Window: Overfeat

  24. Sliding Window: Overfeat

  25. Sliding Window: Overfeat

  26. Sliding Window: Overfeat

  27. Sliding Window: Overfeat Why aren’t boxes across grid?

  28. Sliding Window: Overfeat

  29. Detection as Classification

  30. Detection as Classification

  31. Detection as Classification

  32. Detection as Classification

  33. Detection as Classification

  34. R-CNN

  35. Region of Interest (ROI) Pooling

Recommend


More recommend