ECE 6504: Deep Learning for Perception Topics: LSTMs (intuition - PowerPoint PPT Presentation

ECE 6504: Deep Learning for Perception Topics: – LSTMs (intuition and variants) – [Abhishek:] Lua / Torch Tutorial Dhruv Batra Virginia Tech

Administrativia • HW3 – Out today – Due in 2 weeks – Please please please please please start early – https://computing.ece.vt.edu/~f15ece6504/homework3/ (C) Dhruv Batra 2

RNN • Basic block diagram (C) Dhruv Batra 3 Image Credit: Christopher Olah (http://colah.github.io/posts/2015-08-Understanding-LSTMs/)

Key Problem • Learning long-term dependencies is hard (C) Dhruv Batra 4 Image Credit: Christopher Olah (http://colah.github.io/posts/2015-08-Understanding-LSTMs/)

Meet LSTMs • How about we explicitly encode memory? (C) Dhruv Batra 5 Image Credit: Christopher Olah (http://colah.github.io/posts/2015-08-Understanding-LSTMs/)

LSTMs Intuition: Memory • Cell State / Memory (C) Dhruv Batra 6 Image Credit: Christopher Olah (http://colah.github.io/posts/2015-08-Understanding-LSTMs/)

LSTMs Intuition: Forget Gate • Should we continue to remember this “bit” of information or not? (C) Dhruv Batra 7 Image Credit: Christopher Olah (http://colah.github.io/posts/2015-08-Understanding-LSTMs/)

LSTMs Intuition: Input Gate • Should we update this “bit” of information or not? – If so, with what? (C) Dhruv Batra 8 Image Credit: Christopher Olah (http://colah.github.io/posts/2015-08-Understanding-LSTMs/)

LSTMs Intuition: Memory Update • Forget that + memorize this (C) Dhruv Batra 9 Image Credit: Christopher Olah (http://colah.github.io/posts/2015-08-Understanding-LSTMs/)

LSTMs Intuition: Output Gate • Should we output this “bit” of information to “deeper” layers? (C) Dhruv Batra 10 Image Credit: Christopher Olah (http://colah.github.io/posts/2015-08-Understanding-LSTMs/)

LSTMs Intuition: Output Gate • Should we output this “bit” of information to “deeper” layers? (C) Dhruv Batra 11 Image Credit: Christopher Olah (http://colah.github.io/posts/2015-08-Understanding-LSTMs/)

LSTMs • A pretty sophisticated cell (C) Dhruv Batra 12 Image Credit: Christopher Olah (http://colah.github.io/posts/2015-08-Understanding-LSTMs/)

LSTM Variants #1: Peephole Connections • Let gates see the cell state / memory (C) Dhruv Batra 13 Image Credit: Christopher Olah (http://colah.github.io/posts/2015-08-Understanding-LSTMs/)

LSTM Variants #2: Coupled Gates • Only memorize new if forgetting old (C) Dhruv Batra 14 Image Credit: Christopher Olah (http://colah.github.io/posts/2015-08-Understanding-LSTMs/)

LSTM Variants #3: Gated Recurrent Units • Changes: – No explicit memory; memory = hidden output – Z = memorize new and forget old (C) Dhruv Batra 15 Image Credit: Christopher Olah (http://colah.github.io/posts/2015-08-Understanding-LSTMs/)

RMSProp Intuition • Gradients ≠ Direction to Opt – Gradients point in the direction of steepest ascent locally – Not where we want to go long term • Mismatch gradient magnitudes – magnitude large = we should travel a small distance – magnitude small = we should travel a large distance (C) Dhruv Batra 16 Image Credit: Geoffrey Hinton

RMSProp Intuition • Keep track of previous gradients to get an idea of magnitudes over batch • Divide by this accumulate (C) Dhruv Batra 17

ECE 6504: Deep Learning for Perception Topics: LSTMs (intuition - PowerPoint PPT Presentation

ECE 6504: Deep Learning for Perception Topics: LSTMs (intuition and variants) [Abhishek:] Lua / Torch Tutorial Dhruv Batra Virginia Tech Administrativia HW3 Out today Due in 2 weeks Please please please please please

ECE 6504: Deep Learning for Perception Topics: (Finish) Backprop Convolutional Neural

ECE 6504: Deep Learning for Perception Topics: Recurrent Neural Networks (RNNs) BackProp

ECE 6504: Advanced Topics in Machine Learning Probabilistic Graphical Models and Large-Scale

ECE 6504: Advanced Topics in Machine Learning Probabilistic Graphical Models and Large-Scale

ECE 6504: Advanced Topics in Machine Learning Probabilistic Graphical Models and Large-Scale

Visual Perception human perception display devices 1 CS 349 - Visual Perception Reference

PLAYING ATARI WITH DEEP REINFORCEMENT LEARNING NEURAL NETWORK VISION FOR ROBOT DRIVING ARJUN

ECE 6504: Advanced Topics in Machine Learning Probabilistic Graphical Models and Large-Scale

ECE 6504: Advanced Topics in Machine Learning Probabilistic Graphical Models and Large-Scale

ECE 6504: Advanced Topics in Machine Learning Probabilistic Graphical Models and Large-Scale

MODULES AS PERCEPTUAL INPUT - SYSTEMS Language Perception Visual Auditory Perception

Hao Su July 6, 2017 Outline Overview of 3D deep learning 3D deep learning algorithms

All You Want To Know About CNNs Yukun Zhu Deep Learning Deep Learning Image from

(Deep) Learning for Robot Perception and Navigation Wolfram Burgard Deep Learning for Robot

Deep Neural Networks and Deep Reinforcement Learning Deep Learning, Goodfellow, Bengio and

For New Construction & Ship Repair PERCEPTION ESTI-MATE PERCEPTION ESTI-MATE 1 PERCEPTION

Dynamic Memory Management The Linux Perspective Allocating memory: The

Avoiding Pitfalls when Using NVIDIA GPUs for Real-Time Tasks in Autonomous Systems Ming Yang,

GARBAGE BAGE CO COLLECTIO LLECTION: N: @EvaAndreasson, @Cloudera AGENDA Garbage

Portable Parallelization Strategies Charles Leggett CCE Kickoff Meeting, ANL March 9 2020 1 C.

More Advanced OpenMP This is an abbreviated form of Tim Mattsons and Larry Meadows

Short-term Memory for Self-collecting Mutators Martin Aigner, Andreas Haas , Christoph M. Kirsch,

Never-Ending Learning ICML 2019 Tutorial Tom Mitchell Partha Talukdar Carnegie Mellon

RDMAP and DDP Overview Renato Recio 11/22/2002 1 Introduction I Direct Data Placement A

ECE 6504: Deep Learning for Perception Topics: LSTMs (intuition - PowerPoint PPT Presentation

ECE 6504: Deep Learning for Perception Topics: LSTMs (intuition and variants) [Abhishek:] Lua / Torch Tutorial Dhruv Batra Virginia Tech Administrativia HW3 Out today Due in 2 weeks Please please please please please

ECE 6504: Deep Learning for Perception Topics: (Finish) Backprop Convolutional Neural

ECE 6504: Deep Learning for Perception Topics: Recurrent Neural Networks (RNNs) BackProp

ECE 6504: Advanced Topics in Machine Learning Probabilistic Graphical Models and Large-Scale

ECE 6504: Advanced Topics in Machine Learning Probabilistic Graphical Models and Large-Scale

ECE 6504: Advanced Topics in Machine Learning Probabilistic Graphical Models and Large-Scale

Visual Perception human perception display devices 1 CS 349 - Visual Perception Reference

PLAYING ATARI WITH DEEP REINFORCEMENT LEARNING NEURAL NETWORK VISION FOR ROBOT DRIVING ARJUN

ECE 6504: Advanced Topics in Machine Learning Probabilistic Graphical Models and Large-Scale

ECE 6504: Advanced Topics in Machine Learning Probabilistic Graphical Models and Large-Scale

ECE 6504: Advanced Topics in Machine Learning Probabilistic Graphical Models and Large-Scale

MODULES AS PERCEPTUAL INPUT - SYSTEMS Language Perception Visual Auditory Perception

Hao Su July 6, 2017 Outline Overview of 3D deep learning 3D deep learning algorithms

All You Want To Know About CNNs Yukun Zhu Deep Learning Deep Learning Image from

(Deep) Learning for Robot Perception and Navigation Wolfram Burgard Deep Learning for Robot

Deep Neural Networks and Deep Reinforcement Learning Deep Learning, Goodfellow, Bengio and

For New Construction &amp; Ship Repair PERCEPTION ESTI-MATE PERCEPTION ESTI-MATE 1 PERCEPTION

Dynamic Memory Management The Linux Perspective Allocating memory: The

Avoiding Pitfalls when Using NVIDIA GPUs for Real-Time Tasks in Autonomous Systems Ming Yang,

GARBAGE BAGE CO COLLECTIO LLECTION: N: @EvaAndreasson, @Cloudera AGENDA Garbage

Portable Parallelization Strategies Charles Leggett CCE Kickoff Meeting, ANL March 9 2020 1 C.

More Advanced OpenMP This is an abbreviated form of Tim Mattsons and Larry Meadows

Short-term Memory for Self-collecting Mutators Martin Aigner, Andreas Haas , Christoph M. Kirsch,

Never-Ending Learning ICML 2019 Tutorial Tom Mitchell Partha Talukdar Carnegie Mellon

RDMAP and DDP Overview Renato Recio 11/22/2002 1 Introduction I Direct Data Placement A

For New Construction & Ship Repair PERCEPTION ESTI-MATE PERCEPTION ESTI-MATE 1 PERCEPTION