Learning Spatiotemporal Features with 3D Convolutional Networks Du - PowerPoint PPT Presentation

Nov 14, 2023 •136 likes •355 views

Learning Spatiotemporal Features with 3D Convolutional Networks Du Tran, Lubomir Bourdev, Rob Fergus, Lorenzo Torresani, Manohar Paluri ada BAK 29.03.16 Effective Video Descriptor Generic Can represent different types Compact

Learning Spatiotemporal Features with 3D Convolutional Networks Du Tran, Lubomir Bourdev, Rob Fergus, Lorenzo Torresani, Manohar Paluri Çağdaş BAK 29.03.16
Effective Video Descriptor • Generic – Can represent different types • Compact – Processing, storage • Efficient – computation • Simple – implementation
3D Convolution and Pooling • 3D Convolution is better than 2D Convolution to model temporal information. – 2D CONV : performed only spatially, lose temporal information. – 3D CONV : performed spatio-temporally, preserve temporal information. • Same phenomena is applicable for pooling.
2D Convolution On 1-ch Input • Result : 2D Image.
2D Convolution On n-ch Input • Result : 2D Image.
3D Convolution On n-ch Input • Result : Volume
Identify Best Architecture For 3D ConvNets (On UCF101) • Common network settings – All video frames resized into 128x171. – Videos are split into non-overlapped 16 frame clip. – Input : 3x16x128x171. – 5 Convolution and Pooling layer – 2 Fully Connected layer – Softmax Loss layer to predict action labels
Identify Best Architecture For 3D ConvNets (On UCF101) • Varying Network Architecture – Homogeneous temporal depth. • Depth –d for 1,3,5,7 – Varying temporal depth. • Increasing : 3-3-5-5-7 • Decreasing : 7-7-5-5-3-3
3D Convolution Kernel Temporal Depth Search
Spatiotemporal Feature Learning • Best Network Architecture – With 3x3x3 kernel
Spatiotemporal Feature Learning • Dataset for training – Sports 1M Dataset • Largest video classification benchmark • 1.1 million sports videos • 487 categories
Sports 1M Classification Results
C3D Video Descriptor • C3D Model can be used as a feature extractor for various video analysis tasks. – Action recognition – Action similarity – Scene and Object recognition • Using with fc6 activations – 4096 dimension
Action Recognition • Dataset : UCF101 – 13.320 video – 101 human action
Action Similarity Labeling • Dataset : ASLAN – 3,631 video – 432 action class
Scene Object Recognition • Dataset : YUPENN – 420 video – 14 scene • Dataset : Maryland – 130 video – 13 scene
Why C3D Features? • Generic • Compact • Efficient • Simple
What Does C3D Learn ?
Useful Links • http://vlg.cs.dartmouth.edu/c3d/ • https://github.com/facebook/C3D
Learning Spatiotemporal Features with 3D Convolutional Networks Du Tran, Lubomir Bourdev, Rob Fergus, Lorenzo Torresani, Manohar Paluri Çağdaş BAK 29.03.16

Recommend

Convolutional Neural Networks ---- Off the shelf top notch performances Convolutional Neural

Transfer Learning with Convolutional Neural Networks ---- Off the shelf top notch performances Convolutional Neural Networks A breakthough Convolutional Neural Networks VGG-16 example Layers of Convolutional filters Bottleneck

625 views • 23 slides

Spatiotemporal Regulation of ERK by Spatiotemporal Regulation of ERK by Dual- -specificity

Spatiotemporal Regulation of ERK by Spatiotemporal Regulation of ERK by Dual- -specificity Phosphatases specificity Phosphatases Dual University of Bristol IN Cell 1000 University of Bristol IN Cell 1000 WT Equipment Grant WT Equipment

551 views • 20 slides

Convolutional Neural Networks Convolutional neural networks One of the major kinds of ANNs in use

Convolutional Neural Networks<br/><br/> 5/4/19, 4(03 PM Convolutional Neural Networks<br/><br/> 5/4/19, 4(03 PM Convolutional Neural Networks Convolutional neural networks One of the major kinds of ANNs in use UMaine

412 views • 9 slides

COMPANY PROFILE WATER FEATURES 1 WATER FEATURES 2 WATER FEATURES 3 WATER FEATURES 4 WATER

COMPANY PROFILE WATER FEATURES 1 WATER FEATURES 2 WATER FEATURES 3 WATER FEATURES 4 WATER FEATURES 5 WATER FEATURES 6 WATER FEATURES 7 EXCLUSIVE POOLS 8 EXCLUSIVE POOLS 9 EXCLUSIVE POOLS 10 EXCLUSIVE POOLS 11 OVERFLOW 12

962 views • 40 slides

Anytime Reliability of Systematic LDPC Motivation Convolutional Codes LDPC Convolutional Codes

Anytime Reliability of Systematic... L. D ossel et al Anytime Reliability of Systematic LDPC Motivation Convolutional Codes LDPC Convolutional Codes Anytime LDPC Convolutional Codes Asymptotic Analysis L. D ossel, L. K. Rasmussen ,

566 views • 30 slides

Convolutional Autoencoder (CAE) Prof. Seungchul Lee Industrial AI Lab. Convolutional Autoencoder

Convolutional Autoencoder (CAE) Prof. Seungchul Lee Industrial AI Lab. Convolutional Autoencoder Motivation: image to autoencoder ? Convolutional autoencoder extends the basic structure of the simple autoencoder by changing the fully

349 views • 15 slides

Introduction CSCE 970 CSCE 970 Lecture 4: Lecture 4: Convolutional Convolutional Neural

Introduction CSCE 970 CSCE 970 Lecture 4: Lecture 4: Convolutional Convolutional Neural Neural CSCE 970 Lecture 4: Networks Networks Good for data with a grid-like topology Stephen Scott Convolutional Neural Networks Stephen Scott

355 views • 3 slides

Convolutional Kuan-Ting Lai 2020/3/31 Neural Network Convolutional Neural Networks (CNN)

Convolutional Kuan-Ting Lai 2020/3/31 Neural Network Convolutional Neural Networks (CNN) A.k.a. CNN or ConvNet Adit Deshpande, A Beginner's Guide To Understanding Convolutional Neural Networks. Digital Images Input array: an images

1.42k views • 72 slides

An Overview of Models and Methods for Spatiotemporal Data Analysis Jim Zidek- U British

An Overview of Models and Methods for Spatiotemporal Data Analysis Jim Zidek- U British Columbia, Vancouver, Canada May 30, 2012 Jim Zidek- (UBC) An Overview of Models and Methods for Spatiotemporal Data Analysis May 30, 2012 1

1.3k views • 111 slides

A spatiotemporal stochastic model for tropical precipitation and water vapor dynamics. Scott

A spatiotemporal stochastic model for tropical precipitation and water vapor dynamics. Scott Hottovy and Sam Stechmann (UW) shottovy@math.wisc.edu University of Wisconsin ONR DURIP grant N00014-14-1-0251 S. Hottovy, UW Spatiotemporal

382 views • 16 slides

Probabilistic Palm Rejection Using Spatiotemporal Touch Features and Iterative Classification

Probabilistic Palm Rejection Using Spatiotemporal Touch Features and Iterative Classification Julia Schwarz, Robert Xiao, Jennifer Mankoff, Scott E. Hudson, Chris Harrison ? ? ? ? pen palm palm palm Prior Software-Only Approaches

590 views • 36 slides

Using Spatiotemporal Features for Butterfly Classification MARTA SKRETA, SASHA LUCCIONI, DAVID

Using Spatiotemporal Features for Butterfly Classification MARTA SKRETA, SASHA LUCCIONI, DAVID ROLNICK Climate Change and Butterflies BUTTERFLIES ECOSYSTEM Temperature/weather impact Predators of butterflies/caterpillars Indirect via habitat

585 views • 11 slides

AttentionNAS: Spatiotemporal Attention Cell Search for Video Classification Xiaofang Wang, Xuehan

AttentionNAS: Spatiotemporal Attention Cell Search for Video Classification Xiaofang Wang, Xuehan Xiong, Maxim Neumann, AJ Piergiovanni, Michael S. Ryoo, Anelia Angelova, Kris M. Kitani, Wei Hua Convolutional networks are dominant C3D [ICCV

746 views • 26 slides

ON TEGRA X1 ALAN WANG, NVIDIA Convolutional Neural Network optimization target Result

DIRECT CONVOLUTION FOR DEEP NEURAL NETWORK CLASSIFICATION ON TEGRA X1 ALAN WANG, NVIDIA Convolutional Neural Network optimization target Result Convolutional Fully Connected Input layer layer Convolutional Layer An example: A E

621 views • 18 slides

Convolutional Neural Nets CS447 Natural Language Processing (J. Hockenmaier)

Lecture 8: Convolutional Neural Nets Convolutional Neural Nets CS447 Natural Language Processing (J. Hockenmaier) https://courses.grainger.illinois.edu/cs447/ 1 Convolutional Neural Nets (ConvNets, CNNs) [4 parameters, applied 3

615 views • 24 slides

Semantic Segmentation of the sekleton in bone scintigraphy images with convolutional neural

Semantic Segmentation of the sekleton in bone scintigraphy images with convolutional neural networks Problem Description Convolutional Neural Networks Convolutional Layer Max Pooling Transforming Classification networks to segmentation

157 views • 12 slides

Formalisation in Constructive Type Theory of Barendregts Variable Convention for Generic

Formalisation in Constructive Type Theory of Barendregts Variable Convention for Generic Structures with Binders Ernesto Copello 1 Nora Szasz 2 lvaro Tasistro 2 1 Department of Computer Science The University of Iowa, USA 2 Facultad de

723 views • 55 slides

Towards Automatic Inference of Kernel Object Semantics from Binary Code Junyuan Zeng, and Zhiqiang

Introduction A RGOS Design Experimental Results Discussions & Related Work Summary & References Towards Automatic Inference of Kernel Object Semantics from Binary Code Junyuan Zeng, and Zhiqiang Lin Department of Computer Science

738 views • 53 slides

TCP/Generic Segmentation Offload and Its Application in Xen Herbert Xu Principal Software

TCP/Generic Segmentation Offload and Its Application in Xen Herbert Xu Principal Software Engineer Red Hat Asia Pacific What is TSO? Faster Ethernet (Gigabit) => higher CPU load: 1500-byte Ethernet MTU set in 70's. Amount of data

279 views • 13 slides

Using TCP Through So c k ets Da vid Mazi eres dm@amsterdam.lcs.mit.edu 1 File

Using TCP Through So c k ets Da vid Mazi eres dm@amsterdam.lcs.mit.edu 1 File descriptors 1 Most I/O on Unix systems tak es place through the and system calls . Before read write discussing net w ork I/O,

454 views • 24 slides

Change Tracking in Knowledge Organization Systems with skos-history Joachim Neubert & Osma

Change Tracking in Knowledge Organization Systems with skos-history Joachim Neubert & Osma Suominen ZBW Leibniz Information Centre for Economics, Kiel/Hamburg & The National Library of Finland, Helsinki DCMI/ASIST/AIMS Webinar

436 views • 40 slides

User-Defined Distributions and Layouts in Chapel Philosophy and Framework Brad Chamberlain,

User-Defined Distributions and Layouts in Chapel Philosophy and Framework Brad Chamberlain, Steve Deitz, David Iten, Sung Choi Cray Inc. HotPAR 10 June 15, 2010 What is Chapel? A new parallel language being developed by Cray Inc.

670 views • 42 slides

Redefinition of the U.S. Vertical Datum : Replacing NAVD 8 8 I nform ational packet including

Redefinition of the U.S. Vertical Datum : Replacing NAVD 8 8 I nform ational packet including GRAV-D updates Last Updated 12 October 2010 (DAS) 1 Outline What is a vertical datum (3 slides)? NGSs role and authority vis-a-vis

827 views • 55 slides

Introduction to Machine Descriptions Uday Khedker (www.cse.iitb.ac.in/grc) GCC Resource Center,

Tutorial on Essential Abstractions in GCC Introduction to Machine Descriptions Uday Khedker (www.cse.iitb.ac.in/grc) GCC Resource Center, Department of Computer Science and Engineering, Indian Institute of Technology, Bombay April 2011

541 views • 41 slides