keypoint based action keypoint based action recognition
play

Keypoint-Based Action Keypoint-Based Action Recognition - PowerPoint PPT Presentation

Keypoint-Based Action Keypoint-Based Action Recognition Recognition Presenter: Jianchao Yang Presenter: Jianchao Yang Course Instructor: Prof. Derek Hoiem Papers to discuss Papers to discuss Behavior recognition via sparse Behavior


  1. Keypoint-Based Action Keypoint-Based Action Recognition Recognition Presenter: Jianchao Yang Presenter: Jianchao Yang Course Instructor: Prof. Derek Hoiem

  2. Papers to discuss Papers to discuss • Behavior recognition via sparse • Behavior recognition via sparse spatio-temporal features. • Learning realistic human actions from movies. actions from movies.

  3. Behavior Recognition via Sparse Spatio-Temporal Features Spatio-Temporal Features • Motivated by the success application of key points in object recognition • Designed a spatio-temporal feature for behavior recognition recognition

  4. Approach Approach • Similar to what seen in object recognition • Similar to what seen in object recognition Key Points Feature Histogram Classifier Detection Extraction Prototypes Prototypes

  5. Keypoints detection Keypoints detection • Extension from 2D • Extension from 2D • Localization proceeds along the spatial dimensions x and the spatial dimensions x and y , as well as the temporal dimension t . dimension t . • 3D corners too rare

  6. Keypoints detection (cont’) Keypoints detection (cont’) • Response function: • Response function: – Spatial kernel is 2D Gaussian – Temporal kernel

  7. Keypoints detection (cont’) Keypoints detection (cont’) • Keypoints • Keypoints – By pooling maxima of the filter responses – Emphasize temporal information other than spatial information – Strong response to periodic motions – Strong response to periodic motions – Does not respond to pure translation motion – Totally unsupervised – Totally unsupervised

  8. Cuboid descriptor Cuboid descriptor Spatio-temporal Spatio-temporal Key points Key points Cuboids Cuboids descriptor Descriptor: Descriptor: Normalized pixel values; Gradients; Windowed optical flow, etc. Windowed optical flow, etc. Transform Transform Transform: Vectorize directly; Vectorize directly; Histogram (global or local).

  9. Cuboid descriptor (cont’) Cuboid descriptor (cont’) Gradient is best! Gradient is best! Vectorize directly is best! ??

  10. Behavior descriptor Behavior descriptor Spatio-temporal Spatio-temporal Behavior Behavior Key points Key points Cuboids Cuboids descriptor descriptor Transform Transform Prototypes Prototypes

  11. Experiment results Experiment results • Datasets: facial expression, mouse, human actions • Datasets: facial expression, mouse, human actions

  12. Experiments results (cont’) Experiments results (cont’) Mouse Database. Human Facial Expression Database.

  13. Learning realistic human actions from movies movies • Automatic annotation of human actions in video. • Automatic annotation of human actions in video. • Video classification by space-time features. • Video classification by space-time features.

  14. Bag-of-feature approach Bag-of-feature approach • Extension of recent advances in bag-of-feature • Extension of recent advances in bag-of-feature approaches – Spatial pyramid � more general spatial grids Spatial pyramid more general spatial grids – Fixed weights for each pyramid level � optimized – Spatial grid � space-time grids

  15. Space-time features Space-time features • Interest point detection: Harris operator • Interest point detection: Harris operator Space-time Key points Cuboids feature Histogram of oriented gradient ( HoG ) gradient ( HoG ) Local histogram Histogram of optical flow Histogram of optical flow ( HoF )

  16. Spatio-temporal bag-of-features Spatio-temporal bag-of-features • Hierarchical structure • Hierarchical structure Space-time Spatio-temporal Key points Cuboids feature feature Local Local Local Local histogram histogram

  17. Spatio-temporal grids Spatio-temporal grids

  18. Experiment results Experiment results • Evaluation of spatio-temporal grids • Evaluation of spatio-temporal grids

  19. Experiments results (cont’) Experiments results (cont’)

  20. Experiment results (cont’) Experiment results (cont’) • KTH action database • KTH action database

  21. Experiment results (cont’) Experiment results (cont’)

  22. Comments Comments • The two methods are extensions of key-points based • The two methods are extensions of key-points based image classification. Will dense descriptors be better? • Key-points based methods work surprisingly well for image and sequence classification, why? image and sequence classification, why? • Issues needed to address: – Discriminative key-points learning or design for the given task – Discriminative key-points selection for the given task – More efficient way to use location information

Recommend


More recommend