CS 4803 / 7643: Deep Learning Topics: (Finish) Computing Gradients - PowerPoint PPT Presentation

Apr 09, 2024 •299 likes •1.16k views

CS 4803 / 7643: Deep Learning Topics: (Finish) Computing Gradients Backprop in Conv Layers Forward mode vs Reverse mode AD Modern CNN Architectures Zsolt Kira Georgia Tech The architecture of LeNet5 Handwriting Recognition

CS 4803 / 7643: Deep Learning Topics: – (Finish) Computing Gradients – Backprop in Conv Layers – Forward mode vs Reverse mode AD – Modern CNN Architectures Zsolt Kira Georgia Tech
The architecture of LeNet5
Handwriting Recognition Example
Translation Invariance
Some Rotation Invariance
Some Scale Invariance
Case Studies • There are several generations of ConvNets – 2012 – 2014: AlexNet, ZNet, VGGNet • Conv-Relu, Pooling, Fully connected, Softmax • Deeper ones (VGGNet) tend to do better – 2014 • Fully-convolutional networks for semantic segmentation • Matrix outputs rather than just one probability distribution – 2014-2016 • Fully-convolutional networks for classification • Less parameters, faster than comparable Gen1 networks • GoogleNet, ResNet – 2014-2016 • Detection layers (proposals) • Caption generation (combine with RNNs for language)
An Aside
AlexNet: 60M params ZNet: 75M VGG: 138M GoogleNet: 5M
Importance of Depth • After a while, adding depth decreases performance • At first, vanishing/exploding gradients • normalized initialization • Batch normalization • 2 nd order methods • Then, optimization limitation – Deeper network should be able to mimic shallow ones
Localization and Detection
Computer Vision Tasks
Computer Vision Tasks
Classification + Localization
CLS - ImageNet
Idea 1: Localization as Regression
Per-Class vs. Class Agnostic
Where to attach?
Multiple Objects
Human Pose Estimation
Sliding Window: Overfeat
Sliding Window: Overfeat
Sliding Window: Overfeat
Sliding Window: Overfeat
Sliding Window: Overfeat
Sliding Window: Overfeat
Sliding Window: Overfeat Why aren’t boxes across grid?
Sliding Window: Overfeat
Detection as Classification
Detection as Classification
Detection as Classification
Detection as Classification
Detection as Classification
R-CNN
Region of Interest (ROI) Pooling

Recommend

CS 4803 / 7643: Deep Learning Website: http://www.cc.gatech.edu/classes/AY2020/cs7643_spring/

CS 4803 / 7643: Deep Learning Website: http://www.cc.gatech.edu/classes/AY2020/cs7643_spring/ Piazza: https://piazza.com/gatech/spring2020/cs4803dl7643a/ Staff mailing list (personal questions): cs4803-7643-staff@lists.gatech.edu Gradescope:

1.12k views • 110 slides

CS 4803 / 7643: Deep Learning Website: https://www.cc.gatech.edu/classes/AY2020/cs7643_fall/

CS 4803 / 7643: Deep Learning Website: https://www.cc.gatech.edu/classes/AY2020/cs7643_fall/ Piazza: https://piazza.com/gatech/fall2019/cs48037643 Canvas: https://gatech.instructure.com/courses/60374 (4803)

1.15k views • 88 slides

CS 4803 / 7643: Deep Learning Topics: Image Classification Supervised Learning view

CS 4803 / 7643: Deep Learning Topics: Image Classification Supervised Learning view K-NN Linear Classifier Zsolt Kira Georgia Tech Last Time High-level intro to what deep learning is Fast brief of logistics

1.24k views • 100 slides

CS 4803 / 7643: Deep Learning Topics: Structured representations with graph networks Zsolt

CS 4803 / 7643: Deep Learning Topics: Structured representations with graph networks Zsolt Kira Georgia Tech Deep Learning (C) Dhruv Batra & Zsolt Kira 2 Slide Credit: Thomas Kipf (C) Dhruv Batra & Zsolt Kira 3 Slide Credit:

696 views • 52 slides

CS 4803 / 7643: Deep Learning Topics: Dynamic Programming (Q-Value Iteration)

CS 4803 / 7643: Deep Learning Topics: Dynamic Programming (Q-Value Iteration) Reinforcement Learning (Intro, Q-Learning, DQNs) Nirbhay Modhe Georgia Tech Topics well cover Overview of RL RL vs other forms of learning

1.01k views • 65 slides

CS 4803 / 7643: Deep Learning Topics: Moving beyond supervised learning Zsolt Kira Georgia

CS 4803 / 7643: Deep Learning Topics: Moving beyond supervised learning Zsolt Kira Georgia Tech Administrativia Projects! Due April 30 th Template online Can use MS Word but follow the organization/rubric! No

861 views • 82 slides

CS 4803 / 7643: Deep Learning Topic: Reinforcement Learning (RL) Overview Markov

CS 4803 / 7643: Deep Learning Topic: Reinforcement Learning (RL) Overview Markov Decision Processes Zsolt Kira Georgia Tech Administrative PS3/HW3 due March 15 th ! Projects 2 new FB projects up

1.06k views • 85 slides

CS 4803 / 7643: Deep Learning Topics: Policy Gradients Actor Critic Ashwin Kalyan

CS 4803 / 7643: Deep Learning Topics: Policy Gradients Actor Critic Ashwin Kalyan Georgia Tech Topics well cover Overview of RL RL vs other forms of learning RL API Applications Framework: Markov

939 views • 51 slides

CS 4803 / 7643: Deep Learning Guest Lecture: Embeddings and world2vec Feb. 18 th 2020 Ledell Wu

CS 4803 / 7643: Deep Learning Guest Lecture: Embeddings and world2vec Feb. 18 th 2020 Ledell Wu Research Engineer, Facebook AI ledell@fb.com 1 Outline Word Embeddings word2vec Graph Embeddings Applications world2vec

769 views • 41 slides

CS 4803 / 7643: Deep Learning Topics: Forward and backward though conv (Beginning) of

CS 4803 / 7643: Deep Learning Topics: Forward and backward though conv (Beginning) of convolutional neural network (CNN) architectures Zsolt Kira Georgia Tech Administrative PS1/HW1 Due Feb 11 th ! (C) Dhruv Batra & Zsolt

928 views • 57 slides

CS 4803 / 7643: Deep Learning Topics: Specifying Layers Forward & Backward

CS 4803 / 7643: Deep Learning Topics: Specifying Layers Forward & Backward autodifferentiation (Beginning of) Convolutional neural networks Zsolt Kira Georgia Tech Administrivia PS0 released mean of 20.7

1.39k views • 92 slides

CS 4803 / 7643: Deep Learning Topics: (Continue) Low-label ML Formulations Zsolt Kira

CS 4803 / 7643: Deep Learning Topics: (Continue) Low-label ML Formulations Zsolt Kira Georgia Tech Administrative Projects! Poster details out on piazza Note: No late days for anything project related! Also note: Keep

864 views • 34 slides

CS 4803 / 7643: Deep Learning Topics: Application: PointGoal Navigation Trust Region

CS 4803 / 7643: Deep Learning Topics: Application: PointGoal Navigation Trust Region Policy Optimization (TRPO) Proximal Policy Optimization (PPO) Erik Wijmans Georgia Tech Who Am I? Research Interests Computer Vision

1.52k views • 132 slides

CS 4803 / 7643: Deep Learning Topics: Low-label ML Formulations Zsolt Kira Georgia Tech

CS 4803 / 7643: Deep Learning Topics: Low-label ML Formulations Zsolt Kira Georgia Tech Administrativia Projects! Project Check-in due April 11 th Will be graded pass/fail, if fail then you can address the issues Counts

1.12k views • 56 slides

CS 4803 / 7643: Deep Learning Topics: Backpropagation Vector/Matrix/Tensor math

CS 4803 / 7643: Deep Learning Topics: Backpropagation Vector/Matrix/Tensor math Deriving vectorized gradients for ReLU Zsolt Kira Georgia Tech Administrivia PS1/HW1 out Start thinking about project topics/teams (C)

482 views • 21 slides

CS 4803 / 7643: Deep Learning Topics: Specifying Layers Forward & Backward

CS 4803 / 7643: Deep Learning Topics: Specifying Layers Forward & Backward autodifferentiation (Beginning of) Convolutional neural networks Zsolt Kira Georgia Tech Projects We will release a set of project ideas for those

1.04k views • 50 slides

Sliding Window Temporal Graph Coloring George B. Mertzios 1 Hendrik Molter 2 Viktor Zamaraev 1 1

Sliding Window Temporal Graph Coloring George B. Mertzios 1 Hendrik Molter 2 Viktor Zamaraev 1 1 Department of Computer Science, Durham University, Durham, UK 2 Algorithmics and Computational Complexity, TU Berlin, Germany AAAI 2019, Honolulu This

638 views • 28 slides

Minimal absent words in a sliding window & applications to on-line pattern matching Maxime

Minimal absent words in a sliding window & applications to on-line pattern matching Maxime Crochemore 1 , 2 , Alice Hliou 3 , Gregory Kucherov 2 , Laurent Mouchard 4 , Solon Pissis 1 , Yann Ramusat 5 1 Department of Informatics, Kings

1.59k views • 83 slides

Stream Statistics Over Sliding Window Algorithm Sum Problem Trends References Anil Maheshwari

Stream Statistics Over Sliding Window Anil Maheshwari Introduction Stream Statistics Over Sliding Window Algorithm Sum Problem Trends References Anil Maheshwari anil@scs.carleton.ca School of Computer Science Carleton University Canada

360 views • 19 slides

What is Parametric Trace Slicing Good For? Giles Reger School of Computer Science, University of

What is Parametric Trace Slicing Good For? Giles Reger School of Computer Science, University of Manchester, UK Giles Reger What is Parametric Trace Slicing Good For? 1 / 13 The Setting: Events and Traces Parametric Trace Slicing (PTS): The

129 views • 12 slides

Data-Intensive Distributed Computing CS 431/631 451/651 (Fall 2020) Part 9: Real-Time Data

Data-Intensive Distributed Computing CS 431/631 451/651 (Fall 2020) Part 9: Real-Time Data Analytics (1/2) Ali Abedi These slides are available at https://www.student.cs.uwaterloo.ca/~cs451 This work is licensed under a Creative Commons

757 views • 36 slides

ProtoDUNE Software Trigger Dev Jon Sensenig, David Last, David Rivera, Philip Rodrigues, Lukas

ProtoDUNE Software Trigger Dev Jon Sensenig, David Last, David Rivera, Philip Rodrigues, Lukas Arnold August 26, 2019 Data Flow Schematic of the current data flow development in ProtoDUNE. Hit-sending BR no longer necessary for

353 views • 11 slides

occurring in the space surrounding the body Maria Luiza Rangel 1 , Lidiane Souza 1 , Lucas Frota 1

Ncleo de Neurocincia e Reabilitao Instituto de Neurologia Deolindo Couto Universidade Federal do Rio de Janeiro Predicting upcoming events occurring in the space surrounding the body Maria Luiza Rangel 1 , Lidiane Souza 1 , Lucas Frota 1

494 views • 18 slides

Demonstration of High Transformer Ratio Plasma Wakefield Acceleration Proof-of-Principle Plasma

Demonstration of High Transformer Ratio Plasma Wakefield Acceleration Proof-of-Principle Plasma Acceleration Experiments at PITZ Gregor Loisch Hamburg ARD Alliance New Beams and Accelerators Meeting Hamburg, 05.09.2018 Outline

272 views • 15 slides

CS 4803 / 7643: Deep Learning Topics: (Finish) Computing Gradients - PowerPoint PPT Presentation

CS 4803 / 7643: Deep Learning Topics: (Finish) Computing Gradients Backprop in Conv Layers Forward mode vs Reverse mode AD Modern CNN Architectures Zsolt Kira Georgia Tech The architecture of LeNet5 Handwriting Recognition

CS 4803 / 7643: Deep Learning Website: http://www.cc.gatech.edu/classes/AY2020/cs7643_spring/

CS 4803 / 7643: Deep Learning Website: https://www.cc.gatech.edu/classes/AY2020/cs7643_fall/

CS 4803 / 7643: Deep Learning Topics: Image Classification Supervised Learning view

CS 4803 / 7643: Deep Learning Topics: Structured representations with graph networks Zsolt

CS 4803 / 7643: Deep Learning Topics: Dynamic Programming (Q-Value Iteration)

CS 4803 / 7643: Deep Learning Topics: Moving beyond supervised learning Zsolt Kira Georgia

CS 4803 / 7643: Deep Learning Topic: Reinforcement Learning (RL) Overview Markov

CS 4803 / 7643: Deep Learning Topics: Policy Gradients Actor Critic Ashwin Kalyan

CS 4803 / 7643: Deep Learning Guest Lecture: Embeddings and world2vec Feb. 18 th 2020 Ledell Wu

CS 4803 / 7643: Deep Learning Topics: Forward and backward though conv (Beginning) of

CS 4803 / 7643: Deep Learning Topics: Specifying Layers Forward &amp; Backward

CS 4803 / 7643: Deep Learning Topics: (Continue) Low-label ML Formulations Zsolt Kira

CS 4803 / 7643: Deep Learning Topics: Application: PointGoal Navigation Trust Region

CS 4803 / 7643: Deep Learning Topics: Low-label ML Formulations Zsolt Kira Georgia Tech

CS 4803 / 7643: Deep Learning Topics: Backpropagation Vector/Matrix/Tensor math

CS 4803 / 7643: Deep Learning Topics: Specifying Layers Forward &amp; Backward

Sliding Window Temporal Graph Coloring George B. Mertzios 1 Hendrik Molter 2 Viktor Zamaraev 1 1

Minimal absent words in a sliding window &amp; applications to on-line pattern matching Maxime

Stream Statistics Over Sliding Window Algorithm Sum Problem Trends References Anil Maheshwari

What is Parametric Trace Slicing Good For? Giles Reger School of Computer Science, University of

Data-Intensive Distributed Computing CS 431/631 451/651 (Fall 2020) Part 9: Real-Time Data

ProtoDUNE Software Trigger Dev Jon Sensenig, David Last, David Rivera, Philip Rodrigues, Lukas

occurring in the space surrounding the body Maria Luiza Rangel 1 , Lidiane Souza 1 , Lucas Frota 1

Demonstration of High Transformer Ratio Plasma Wakefield Acceleration Proof-of-Principle Plasma

CS 4803 / 7643: Deep Learning Topics: Specifying Layers Forward & Backward

CS 4803 / 7643: Deep Learning Topics: Specifying Layers Forward & Backward

Minimal absent words in a sliding window & applications to on-line pattern matching Maxime