Neural Attention for Object Tracking Brian Cheung - PowerPoint PPT Presentation

April 4-7, 2016 | Silicon Valley Neural Attention for Object Tracking Brian Cheung bcheung@berkeley.edu Redwood Center for Theoretical Neuroscience, UC Berkeley Visual Computing Research, NVIDIA

Source: Wikipedia “School Bus”

Motivation Solving complex vision problems ● Question Answering ● Search ● Navigation Two core components: ● Attention ● Memory 3

Emergent Properties from Attention Xu et. al. 2015 4

Recurrent Networks o(t) h(t) x(t) 5

Formulating a Glimpse Parameters in the kernel control the layout of the attention window over the original image. Translation Scale 6

Spatial Transformer Jaderberg et. al. 2015 7

Spatial Transformer Network Jaderberg et. al. 2015 8

Foveal Attention Network Classification Network Location Network Recurrent Network Glimpse Network Image Cheung et. al. 2015 9

Foveal Attention Network Classification Network Location Network ‘5’ Recurrent Network Glimpse Network Image Cheung et. al. 2015 14

Benefits of Attention ● Less parameters/Less Computation ○ Smaller Convolutional Network ● Better Performance ○ Significant performance over ConvNet over entire image ○ Breaks down complex problems into a sequence of simpler problems ○ Filters out noise and distractors ● Localization information is free 15

KITTI Tracking Dataset Geiger et. al. 2012 ● 375x1240 video ● Bounding boxes over time of cars, pedestrians, etc. 16

θ Localization Network Recurrent Network Tracking Network Convolutional Network Generate Image Glimpse = T θ (Image(t), θ loc (t-1)) Grid Generator 17

θ Localization Network Recurrent Network Tracking Network Generate features from ConvNet Convolutional Network h cnet (t) = f cnet ( ) Grid Generator 18

θ Localization Network Recurrent Network Tracking Network Convolutional Network Generate features from Recurrent Network h rnn (t) = f rnn (h cnet (t) , θ loc (t-1), h rnn (t-1)) Grid Generator 19

θ Generate parameters for next glimpse from Localization Network Localization Network θ loc (t) = f loc (h rnn (t-1)) Recurrent Network Tracking Network Convolutional Network Grid Generator 20

θ Localization Network Recurrent Network Tracking Network Convolutional Network Generate tracking prediction from Tracking Network θ pred (t), y pres (t) = f tracking (h rnn (t-1)) Grid Generator 21

Pretraining on Classification Task {‘Car’, ‘Pedestrian’, ‘Truck’, ‘Tram’, ‘Cyclist’, ‘Misc’, ‘Van’, ‘Person Sitting’} Convolutional Network ~3% Classification Error on validation set Grid Generator 22

Pretraining on the Registration Task Glimpse Parameters θ Convolutional Network Grid Generator 23

Pretraining on the Registration Task ● Simpler task similar to tracking: Fix a bad glimpse ● Useful signal for Localization Network Input Glimpse Predicted Correction Actual Correction 24

Comparing Training Gradients Without pretraining (Random Initialization) With ConvNet Pretraining 25

Bouncing MNIST

Bouncing MNIST MNIST Position Attention Position Output: Tracking Network Localization Network Ground Truth x x Prediction y y

Conclusions ● End-to-End visual attention works for simple tasks ● Robust to encoding of attention parameters 29

Conclusions ● Difficult to train on more complex tasks ○ First Step toward Model-Free, Anonymous Object Tracking with Recurrent Neural Networks (Gan et. al. 2015) ○ RATM: Recurrent Attentive Tracking Model (Kahou et. al. 2015) ● Scaling computational costs 30

Future Work ● Integrate more tailored components ○ Spatial Memory (Weiss et. al. 2015) ● Train compact ImageNet models for initialization ● Exploration/Unsupervised strategies to recover from mistakes ○ Error Based Attention (Rezende et. al. 2016) 31

Acknowledgements Special thanks to: Shalini Gupta Jan Kautz Pavlo Molchanov Stephen Tyree Eric Weiss 32

Neural Attention for Object Tracking Brian Cheung - PowerPoint PPT Presentation

April 4-7, 2016 | Silicon Valley Neural Attention for Object Tracking Brian Cheung bcheung@berkeley.edu Redwood Center for Theoretical Neuroscience, UC Berkeley Visual Computing Research, NVIDIA Source: Wikipedia School Bus Motivation

Overview Introduction Object Tracking Vehicle Tracking Theory & Implementation

Attention in NLP CS 6956: Deep Learning for NLP Overview What is attention Attention in

Multi-Object Tracking Challenge CV3DST Lecture Exercises Multi-Object Tracking Multi-Object

Attention Eye tracking seminar 2/19/15 Presented by Tatiana Emmanouil Outline What is

Object Oriented Object 3 Programming Object 1 Object 2 Object 4 For : COP 3330. Object

Attention! 1. Definitions and behavioral effects 2. Effects on neural firing rates: Spatial

Visual Attention FEF V4 spatial attention: simultaneous neural recordings in V4

Attention, Transformer and BERT Prof. Kuan-Ting Lai 2020/6/16 Attention is All You Need! A.

Neural Information Retrieval Wassila Lalouani 1 Plan Neural network architectures Neural

Tracking H akan Ard o March 4, 2013 H akan Ard o Tracking March 4, 2013 1 / 57

Advanced Neural Machine Translation Gongbo Tang 23 September 2019 Outline NMT with Attention

Advanced Neural Machine Translation Gongbo Tang 21 September 2020 Outline NMT with Attention

Object-Oriented Databases Object Oriented Databases ODMG Standard Object Model, Object

Object oriented Object oriented Object oriented Object oriented approach and UML approach and

The Attention Economy What is the attention economy? A business model where you (as the

GNN3DMOT: Graph Neural Network for 3D Multi-Object Tracking with 2D-3D Multi-Feature Learning

Finding Hidden Supernovae with Finding Hidden Supernovae with Finding Hidden Supernovae with

In-Depth Performance Analysis for OpenACC/CUDA/OpenCL Applications with Score-P and Vampir

Agenda I. Welcome & Introduction Port Staff II. Project Update on Work Completed to Date

Factors affecting survival of subyearling Chinook salmon at Little Goose Dam in 2013 RYAN HARNISH

Requirements to Event Generators from experimentalists at LHC/ATLAS, not from an NLO-WG member

Implementation of Senate Bill 3 (Session Law 2007-397) Presented to Environmental Review

GRADUATION REQUIREMENTS (Pre-Registration) DEL RIO HIGH SCHOOL 2018-2019 School Year OUR GOAL

Condensation Characteristics in Bundle Tubes of the PCCS (Passive Containment Cooling System) Heat

Neural Attention for Object Tracking Brian Cheung - PowerPoint PPT Presentation

April 4-7, 2016 | Silicon Valley Neural Attention for Object Tracking Brian Cheung bcheung@berkeley.edu Redwood Center for Theoretical Neuroscience, UC Berkeley Visual Computing Research, NVIDIA Source: Wikipedia School Bus Motivation

Overview Introduction Object Tracking Vehicle Tracking Theory &amp; Implementation

Attention in NLP CS 6956: Deep Learning for NLP Overview What is attention Attention in

Multi-Object Tracking Challenge CV3DST Lecture Exercises Multi-Object Tracking Multi-Object

Attention Eye tracking seminar 2/19/15 Presented by Tatiana Emmanouil Outline What is

Object Oriented Object 3 Programming Object 1 Object 2 Object 4 For : COP 3330. Object

Attention! 1. Definitions and behavioral effects 2. Effects on neural firing rates: Spatial

Visual Attention FEF V4 spatial attention: simultaneous neural recordings in V4

Attention, Transformer and BERT Prof. Kuan-Ting Lai 2020/6/16 Attention is All You Need! A.

Neural Information Retrieval Wassila Lalouani 1 Plan Neural network architectures Neural

Tracking H akan Ard o March 4, 2013 H akan Ard o Tracking March 4, 2013 1 / 57

Advanced Neural Machine Translation Gongbo Tang 23 September 2019 Outline NMT with Attention

Advanced Neural Machine Translation Gongbo Tang 21 September 2020 Outline NMT with Attention

Object-Oriented Databases Object Oriented Databases ODMG Standard Object Model, Object

Object oriented Object oriented Object oriented Object oriented approach and UML approach and

The Attention Economy What is the attention economy? A business model where you (as the

GNN3DMOT: Graph Neural Network for 3D Multi-Object Tracking with 2D-3D Multi-Feature Learning

Finding Hidden Supernovae with Finding Hidden Supernovae with Finding Hidden Supernovae with

In-Depth Performance Analysis for OpenACC/CUDA/OpenCL Applications with Score-P and Vampir

Agenda I. Welcome &amp; Introduction Port Staff II. Project Update on Work Completed to Date

Factors affecting survival of subyearling Chinook salmon at Little Goose Dam in 2013 RYAN HARNISH

Requirements to Event Generators from experimentalists at LHC/ATLAS, not from an NLO-WG member

Implementation of Senate Bill 3 (Session Law 2007-397) Presented to Environmental Review

GRADUATION REQUIREMENTS (Pre-Registration) DEL RIO HIGH SCHOOL 2018-2019 School Year OUR GOAL

Condensation Characteristics in Bundle Tubes of the PCCS (Passive Containment Cooling System) Heat

Overview Introduction Object Tracking Vehicle Tracking Theory & Implementation

Agenda I. Welcome & Introduction Port Staff II. Project Update on Work Completed to Date