Multimodal Gesture Recognition Based on the ResC3D Network Qiguang - PowerPoint PPT Presentation

Multimodal Gesture Recognition Based on the ResC3D Network Qiguang Miao Yunan Li Wanli Ouyang Zhenxin Ma Xin Xu Weikang Shi

Introduction Our Scheme Experimental Results Future Work

INTRODUCTION C3D model • 3D ConvNets ChaLearn LAP IsoGD • spatiotemporal feature • large-scale learning • Auto feature extraction • video-based

Our Scheme Optical flow data  Generating optical flow data from the RGB one

Our Scheme Retinex for illumination normalization for RGB data Median filter for denoising for depth data  Generating optical flow data from the RGB one  Different strategies for video enhancement

Our Scheme Frame number unification with sampling the most representative frames  Generating optical flow data from the RGB one  Different strategies for video enhancement  A weighted frame number unification strategy to sample the most representative frames

Our Scheme ResC3D model, a combination of C3D and ResNet for better feature extraction  Generating optical flow data from the RGB one  Different strategies for video enhancement  A weighted frame number unification strategy to sample the most representative frames  A ResC3D model for feature extraction

Our Scheme A statistical fusion scheme  Generating optical flow data from the RGB one  Different strategies for video enhancement  A weighted frame number unification strategy to sample the most representative frames  A ResC3D model for feature extraction  Using Canonical Correlation Analysis for feature fusion

Our Scheme SVM for final classification  Generating optical flow data from the RGB one  Different strategies for video enhancement  A weighted frame number unification strategy to sample the most representative frames  A ResC3D model for feature extraction  Using Canonical Correlation Analysis for feature fusion  SVM classifier for the final score

Our Scheme A. Data enhancement RGB data depth data Suffering from different illumination condition The noise exists around the edges

Our Scheme A. Data enhancment • The results of enhancement with Retinex

Our Scheme A. Data enhancment • Denoising with median filter Eliminate noise Preserve edges

Our Scheme B. Weighted frame unification The proportion in the entire video The importance to the recognition KEY FRAME

Our Scheme B. Weighted frame unification • Key frame – Divide the video into n sections – Calculate the average optical flow for each section – The frame numbers of each section are calculated by the proportion of optical flow value of the section and the whole video

Our Scheme C. Feature extraction C3D ResNet

Our Scheme C. Feature extraction

Our Scheme D. Feature fusion • Traditional methods – Parallel (averaging)

Our Scheme D. Feature fusion • Traditional methods – Parallel (averaging) – Serial (concatenating)

Our Scheme D. Feature fusion • Canonical Correlation Analysis – a way of inferring information from cross- covariance matrices – CCA tries to maximize the pair-wise correlations across features with different modalities.

EXPERIMENTAL RESULTS Iteration Times

EXPERIMENTAL RESULTS Fusion

EXPERIMENTAL RESULTS Comparison • J. Wan, S. Z. Li, Y. Zhao, S. Zhou, I. Guyon, and S. Escalera. Chalearn looking at people rgb-d isolated and continuous datasets for gesture recognition. In IEEE CVPR Workshops, pages 56 – 64. 2016. • P.Wang,W. Li, Z. Gao, Y. Zhang, C. Tang, and P. Ogunbona . Scene flow to action map: A new representation for rgb-d based action recognition with convolutional neural networks.In IEEE CVPR, 2017. • P. Wang, W. Li, S. Liu, Z. Gao, C. Tang, and P. Ogunbona. Large-scale isolated gesture recognition using convolutional neural networks. In IEEE ICPR Workshops, 2016. • G. Zhu, L. Zhang, L. Mei, J. Shao, J. Song, and P. Shen. Large-scale isolated gesture recognition using pyramidal 3d convolutional networks. In IEEE ICPR Workshops, 2016. • J. Duan , J. Wan, S. Zhou, X. Guo, and S. Li. A unified framework for multi -modal isolated gesture recognition. In ACM Transactions on Multimedia Computing, Communications, and Applications,2017 • Y. Li, Q. Miao, K. Tian, Y. Fan, X. Xu, R. Li, and J. Song. Large-scale gesture recognition with a fusion of rgb-d data based on the c3d model. In IEEE ICPR Workshops. 2016. • G. Zhu, L. Zhang, P. Shen, and J. Song. Multimodal gesture recognition using 3d convolution and convolutional lstm. IEEE Access, 2017.

Comparison EXPERIMENTAL RESULTS

FUTURE WORK

Thank you !

Multimodal Gesture Recognition Based on the ResC3D Network Qiguang - PowerPoint PPT Presentation

Multimodal Gesture Recognition Based on the ResC3D Network Qiguang Miao Yunan Li Wanli Ouyang Zhenxin Ma Xin Xu Weikang Shi Introduction Our Scheme Experimental Results Future Work Introduction Our Scheme Experimental Results Future Work

Gesture Recognition with CNN Ahmed Abdelghany 20 January 2020 Outline Motivation for Gesture

GESTURE SENSORS Microsoft Kinect V1 24M - 2013 Microsoft Kinect V2 20M - 2016 + VR + GESTURE

Multimodal Machine Learning Louis-Philippe (LP) Morency CMU Multimodal Communication and Machine

Multimodal Machine Learning Louis-Philippe (LP) Morency CMU Multimodal Communication and Machine

Gesture recognition for Smartphones/Wearables Gestures hands, face, body movements

Gesture Recognition Adrian Kndig adkuendi@student.ethz.ch Datum Informatik II Samstag, 27.

Features, Regions, Gestures: Components of a Generic Gesture Recognition Engine Florian Echtler

The Nature of Gesture Gestures are expressive, meaningful body motions, i.e., physical

Human Gesture Recognition for Drone Control Drones are cool - Flying is hard 2 Drone

GESTURE RECOGNITION WITH 3D CNNS Pavlo Molchanov 4/6/2016 Xiaodong Yang Shalini Gupta Kihwan

GESTURE RECOGNITION: USING A MULTI SENSOR APPROACH SHALINI GUPTA, PAVLO MOLCHANOV, KIHWAN KIM,

Motion Capturing and Machine Learning for Gesture Recognition Sotiris Manitsaris Centre for

uWave: Accelerometer-based Personalized Gesture Recognition and Its Applications Recognition and

Multimodal Corridor Planning & Engineering Analysis Project A1A MULTIMODAL CORRIDOR PLANNING

MULTIMODAL OPTIMIZATION MIKE PREUSS. Multimodal Optimization 1 2014-09-14 Mike Preuss

A summary of deep models for face recognition Qianli Liao Face recognition Face recognition:

EE 6882 Visual Search Engine Prof. Shih Fu Chang, Jan. 30, 2012 Lecture #2 Visual Features:

COLOR SPECTRUM RECONSTRUCTION USING NEURAL NETWORKS 2 Hyperspectral-sensing.nb THE GOAL

Multimodal 2DCNN action recognition from RGB-D Data with Video Summarization Vicent Roig Ripoll

Scene Understanding with 3D Deep Networks Thomas Funkhouser Princeton University Disclaimer: I

CS 4803 / 7643: Deep Learning Topics: Dynamic Programming (Q-Value Iteration)

Loving Kindness Meditation Mindfulness through the eyes of a Veteran video Third level

Simple Digital Camera with Image Editor Group 3 Jun Zhao, Kwan Yin Lau, and Xiang Gao The

Multimedia Event Detection Using GMM Supervectors and Camera Motion Cancelled Features Yusuke

Multimodal Gesture Recognition Based on the ResC3D Network Qiguang - PowerPoint PPT Presentation

Multimodal Gesture Recognition Based on the ResC3D Network Qiguang Miao Yunan Li Wanli Ouyang Zhenxin Ma Xin Xu Weikang Shi Introduction Our Scheme Experimental Results Future Work Introduction Our Scheme Experimental Results Future Work

Gesture Recognition with CNN Ahmed Abdelghany 20 January 2020 Outline Motivation for Gesture

GESTURE SENSORS Microsoft Kinect V1 24M - 2013 Microsoft Kinect V2 20M - 2016 + VR + GESTURE

Multimodal Machine Learning Louis-Philippe (LP) Morency CMU Multimodal Communication and Machine

Multimodal Machine Learning Louis-Philippe (LP) Morency CMU Multimodal Communication and Machine

Gesture recognition for Smartphones/Wearables Gestures hands, face, body movements

Gesture Recognition Adrian Kndig adkuendi@student.ethz.ch Datum Informatik II Samstag, 27.

Features, Regions, Gestures: Components of a Generic Gesture Recognition Engine Florian Echtler

The Nature of Gesture Gestures are expressive, meaningful body motions, i.e., physical

Human Gesture Recognition for Drone Control Drones are cool - Flying is hard 2 Drone

GESTURE RECOGNITION WITH 3D CNNS Pavlo Molchanov 4/6/2016 Xiaodong Yang Shalini Gupta Kihwan

GESTURE RECOGNITION: USING A MULTI SENSOR APPROACH SHALINI GUPTA, PAVLO MOLCHANOV, KIHWAN KIM,

Motion Capturing and Machine Learning for Gesture Recognition Sotiris Manitsaris Centre for

uWave: Accelerometer-based Personalized Gesture Recognition and Its Applications Recognition and

Multimodal Corridor Planning &amp; Engineering Analysis Project A1A MULTIMODAL CORRIDOR PLANNING

MULTIMODAL OPTIMIZATION MIKE PREUSS. Multimodal Optimization 1 2014-09-14 Mike Preuss

A summary of deep models for face recognition Qianli Liao Face recognition Face recognition:

EE 6882 Visual Search Engine Prof. Shih Fu Chang, Jan. 30, 2012 Lecture #2 Visual Features:

COLOR SPECTRUM RECONSTRUCTION USING NEURAL NETWORKS 2 Hyperspectral-sensing.nb THE GOAL

Multimodal 2DCNN action recognition from RGB-D Data with Video Summarization Vicent Roig Ripoll

Scene Understanding with 3D Deep Networks Thomas Funkhouser Princeton University Disclaimer: I

CS 4803 / 7643: Deep Learning Topics: Dynamic Programming (Q-Value Iteration)

Loving Kindness Meditation Mindfulness through the eyes of a Veteran video Third level

Simple Digital Camera with Image Editor Group 3 Jun Zhao, Kwan Yin Lau, and Xiang Gao The

Multimedia Event Detection Using GMM Supervectors and Camera Motion Cancelled Features Yusuke

Multimodal Corridor Planning & Engineering Analysis Project A1A MULTIMODAL CORRIDOR PLANNING