Computer vision techniques for video surveillance Huiyu Zhou, Ph.D. - PowerPoint PPT Presentation

Computer vision techniques for video surveillance Huiyu Zhou, Ph.D. January, 2016

Film: Spectre

London riots: Tottenham violence, 5 August, 2011

• Motivation • Age classification • Gender classification • Behaviour analysis • Summary

• >4,000,000 cameras, UK, 2014.

• >4,000,000 cameras, UK, 2014. • Major concern: crime in public places.

• >4,000,000 cameras, UK, 2014. • Major concern: crime in public places. • ~70% of offenders are young adolescent males [1]. 1. P. Miller, W. Liu, C. Fowler, K. McLaughlin, H. Zhou, J. Shen, J. Ma, H. Wang, J. Zhang, W. Yan and S. Sezer, “Intelligent Sensor Information System for Public Transport: To Safely Go”, IEEE International Conference on Advanced Video and Signal-Based Surveillance, 2010.

• >4,000,000 cameras, UK, 2014. • Major concern: crime in public places. • ~70% of offenders are young adolescent males [1]. • Our research focus: what is the age/gender of the target? What is s/he doing (behaviour)? 1. P. Miller, W. Liu, C. Fowler, K. McLaughlin, H. Zhou, J. Shen, J. Ma, H. Wang, J. Zhang, W. Yan and S. Sezer, “Intelligent Sensor Information System for Public Transport: To Safely Go”, IEEE International Conference on Advanced Video and Signal-Based Surveillance, 2010.

Challenges • Intrapersonal variation: anatomical changes on faces. Tony Blair: 10+, 30+, 50+ (left to right) • Interpersonal variation: individual evolution of faces. Bill Gates: 10+, 20+, 50+ (left to right)

Whole picture of our system Months 4 years 7 years 14 years • Original images • Adaptive Difference of Gaussian (DoG) • Radon Transform (RT): x – intensity, y – bins • Feature selection/SVM classification

Feature extraction: Adaptive DoG • Benefits – To reduce the effects of rapid intensity changes on faces • Adaptive DoG filtering: – Subtracting two convolutions: σ 1 = σ 0 /8, σ 2 = σ 0 /16 – Gamma correction – Contrast equalisation Contrast equalisation (x - greyscale, y - pixel no.)

Feature extraction: why Radon Transform? • In-plane rotation invariant transform at different rotations (x – 1-D illustration of Radon angle/deg, y – projection displacement). • Detecting facial curves (e.g. wrinkles)

Feature extraction: how I use Radon Transform? • Similarity measured by Radon projection correlation distance [2]. – 2-D Radon transform of different images (x angle/deg, y – projection displacement) H. Zhou, P. Miller and J. Zhang, “Age classification using Radon transform and entropy based scaling SVM”, Proc. 2. Of British Machine Vision Conference, 2011.

Feature selection: entropy based scaling SVM • What is scaling ? – A scheme to select the hyper-parameters (SVM) for the least generalisation error • Scaling SVM – Continuously update kernel Classification results Classification results K and weight w of parameter set 1 of parameter set 2 Illustration of scaling SVM

Experimental work: set-up • Objective: to separate teenagers and adults • Comparisons: our system (DRTP) against 5-fold SVM with a) PCA (principal component analysis) b) LBP (linear binary pattern) c) HOG (histogram of oriented gradients) d) DRT (DoG/RT/no feature selection) e) DRTC (DoG/RT/feature selection) f) HOGSS (HOG with feature selection) • Test databases: FG-NET and MORTH Examples from the two databases

Experimental work: MORTH dataset LBP (x – bins, y – numbers) PCA reconstruction of 50 eigenvectors Images of different ages HOG (x - feature index, y – gradient Proposed (x - feature index, y – values) intensity pixels)

Experimental work: MORTH dataset Feature selection outcomes Classification by seven algorithms

Challenges • Research categories: Face and full body based • Face based: require frontal faces and affected by occlusions [3] 3. H. Zhou and A. Sadka, "Combining perceptual features with diffusion distance for face recognition". IEEE Trans. on System, Man, and Cyber. – Part C, Vol. 41, Issue 5, 577-588, 2011.

Challenges – demo of walking patterns • Full body based: gaits • Side-view problem Courtesy of Biomotion Lab, Canada

Our approach 1) Combination of facial and full body measurements

Our approach 1) Combination of facial and full body measurements 2) Face channel: face detection  PCA features

Face detection and PCA

Our approach 1) Combination of face and full body measurements 2) Face channel: face detection  PCA features 3) Full body channel: background subtraction  PiHOG features

Background subtraction and PiHOG

Our approach 1) Combination of face and full body measurements 2) Face channel: face detection  PCA features 3) Full body channel: background subtraction  PiHOG features “ EntropyBoost ”  4) classifier probability estimate in each channel

Our approach 1) Combination of face and full body measurements 2) Face channel: face detection  PCA features 3) Full body channel: background subtraction  PiHOG features “ EntropyBoost ”  4) classifier probability estimate in each channel 5) Fusing two channels: score integration [4] 4. H. Zhou, P. Miller, J. Zhang, D. Crookes, F. Campbell-West, M. Collins, H. Wang, “ EntropyBoost based gender Classification using facial and full body measurements”, Technical report, 2013.

Demo video: gender classification

Experimental results Gender classification errors of different systems: “CF” – face/body HOG features + SVM; “FP” - face PCA features + SVM; “BH” – body HOG features + SVM; “EF” – our system.

• Motivation • Age classification • Gender classification • Behaviour analysis – Human tracking (single and multiple cameras) – Trajectory clustering – Event reasoning • Summary

Single-camera human tracking • Challenges – Occlusions/pose or light changes

Single-camera human tracking • Challenges – Occlusions/pose or light changes • Heterogeneous sensors – Kalman filter based audio/visual data association scheme [5] 5. H. Zhou, M. Taj and A. Cavallaro, "Target detection and tracking with heterogeneous sensors". IEEE Journal of Selected Topics in Signal Processing, Vol. 2, No. 4, 503-513, 2008.

Demo video can be found at: http://sites.google.com/site/huiyujoe/ Particle filter Graph matching Audio Detection (TOA) Our system

Single-camera human tracking • Challenges – Occlusions/pose or light changes • Heterogeneous sensors – Kalman filter based audio/visual data association scheme [5] • Kernel estimation and local features – Effective combination of mean shift and SIFT features [6] 5. H. Zhou, M. Taj and A. Cavallaro, "Target detection and tracking with heterogeneous sensors". IEEE Journal of Selected Topics in Signal Processing, Vol. 2, No. 4, 503-513, 2008. 6. H. Zhou, Y. Yuan and C. Shi, “Object tracking using SIFT features and mean shift” . Computer Vision and Image Understanding, Vol. 113, No. 3, 345-352, 2009.

More results can be found at: http://sites.google.com/site/huiyujoe/ Mean shift SIFT Our system

Demo: Multi-camera human tracking Simulated Annealing Particle Filter

Trajectory clustering – walking Walking trajectories to be clustered

Clustering using individual features (a) Actual walking trajectories (b) Distance difference features (c) Direction deviation features

Markov Chain Monte Carlo based clustering (b) Proposed approach (a) Ground truthed trajectories

Event reasoning

• Automatic feature extraction and selection for age classification. • Combining facial and full body measurements for gender classification. • Behaviour analysis (ongoing): human tracking, trajectory clustering and event reasoning.

Acknowledgments • Collaborators – Internal: colleagues in ECIT/CSIT … – External: BAE, Thales, Microsoft, IBM, Google, NIH, U. of London … • Funding agencies – EPSRC – Invest NI – EU ICT

Thank you very much! Q & A

Computer vision techniques for video surveillance Huiyu Zhou, Ph.D. - PowerPoint PPT Presentation

Computer vision techniques for video surveillance Huiyu Zhou, Ph.D. January, 2016 Film: Spectre London riots: Tottenham violence, 5 August, 2011 Motivation Age classification Gender classification Behaviour analysis Summary

Computer Vision Computer Vision How does vision work? What is vision for? Ela Claridge

CS262: Computer Vision (and Human-Computer Interaction) John Magee 1 Computer Vision How are

Branding Presentation VISION Mevushal VISION Muscat of Alexandria & Viognier VISION

1/88 Presentation: Advanced Techniques 2/88 Presentation: Advanced Techniques 3/88

Intraday Techniques Intraday Techniques Intraday Techniques Intraday Techniques Combining

CS201 Lecture 02 Computer Vision: Image Formation and Basic Techniques John Magee 1 Computer

Vision Services Vision Services & & Vision Therapy Vision Therapy February 2, 2007

Vision Our National Church partners .. Vision Our National Network partners Vision Getting

Chemical Synthesis Techniques Chemical Synthesis Techniques Chemical Synthesis Techniques

Computer Vision Introduction Historical context Connections to other disciplines Vision and

HIM Without Walls Realizing Our Vision! Realizing Our Vision Realize Our Vision Realizing Our

Deep Learning in Computer Vision Caner Hazrba Deep Learning in Action 24. June 15

J J R R Our Vision . . . Our Vision . . . Our Vision . . . Our Vision . . . TO BE THE BEST

Post- -trauma vision trauma vision Post Post- -trauma vision trauma vision Post syndrome

2017 Humana Vision 130 LOOK Whats NEW! NEW RETAIL FRAME BENEFIT 2 Humana Vision 100

Vision What is the Vision? The American Fork Canyon Vision (Vision) will ho- Few places in the

Soft modes from black hole microstates Onkar Parrikar Department of Physics and Astronomy

Introduction to Topological Data Analysis Persistent Homology Norm Matloff University of

An introduction to shape and topology optimization ric Bonnetier and Charles Dapogny

Pixel Recurrent Neural Networks Aaron van den Oord, Nal Kalchbrenner, Koray Kavukcuoglu Google

CS325 Artificial Intelligence Ch. 24, Computer Vision I Object Recognition Cengiz Gnay,

CPSC 4040/6040 Computer Graphics Images Joshua Levine levinej@clemson.edu Lecture 10 Point

CMPE 450/490 Capstone Project Intruder Alert System Jordan Tymburski

Physics and chemistry of irradiated protostars Johan E. Lindberg 1,2 Jes K. Jrgensen 2,1 , J. D.

Computer vision techniques for video surveillance Huiyu Zhou, Ph.D. - PowerPoint PPT Presentation

Computer vision techniques for video surveillance Huiyu Zhou, Ph.D. January, 2016 Film: Spectre London riots: Tottenham violence, 5 August, 2011 Motivation Age classification Gender classification Behaviour analysis Summary

Computer Vision Computer Vision How does vision work? What is vision for? Ela Claridge

CS262: Computer Vision (and Human-Computer Interaction) John Magee 1 Computer Vision How are

Branding Presentation VISION Mevushal VISION Muscat of Alexandria &amp; Viognier VISION

1/88 Presentation: Advanced Techniques 2/88 Presentation: Advanced Techniques 3/88

Intraday Techniques Intraday Techniques Intraday Techniques Intraday Techniques Combining

CS201 Lecture 02 Computer Vision: Image Formation and Basic Techniques John Magee 1 Computer

Vision Services Vision Services &amp; &amp; Vision Therapy Vision Therapy February 2, 2007

Vision Our National Church partners .. Vision Our National Network partners Vision Getting

Chemical Synthesis Techniques Chemical Synthesis Techniques Chemical Synthesis Techniques

Computer Vision Introduction Historical context Connections to other disciplines Vision and

HIM Without Walls Realizing Our Vision! Realizing Our Vision Realize Our Vision Realizing Our

Deep Learning in Computer Vision Caner Hazrba Deep Learning in Action 24. June 15

J J R R Our Vision . . . Our Vision . . . Our Vision . . . Our Vision . . . TO BE THE BEST

Post- -trauma vision trauma vision Post Post- -trauma vision trauma vision Post syndrome

2017 Humana Vision 130 LOOK Whats NEW! NEW RETAIL FRAME BENEFIT 2 Humana Vision 100

Vision What is the Vision? The American Fork Canyon Vision (Vision) will ho- Few places in the

Soft modes from black hole microstates Onkar Parrikar Department of Physics and Astronomy

Introduction to Topological Data Analysis Persistent Homology Norm Matloff University of

An introduction to shape and topology optimization ric Bonnetier and Charles Dapogny

Pixel Recurrent Neural Networks Aaron van den Oord, Nal Kalchbrenner, Koray Kavukcuoglu Google

CS325 Artificial Intelligence Ch. 24, Computer Vision I Object Recognition Cengiz Gnay,

CPSC 4040/6040 Computer Graphics Images Joshua Levine levinej@clemson.edu Lecture 10 Point

CMPE 450/490 Capstone Project Intruder Alert System Jordan Tymburski

Physics and chemistry of irradiated protostars Johan E. Lindberg 1,2 Jes K. Jrgensen 2,1 , J. D.

Branding Presentation VISION Mevushal VISION Muscat of Alexandria & Viognier VISION

Vision Services Vision Services & & Vision Therapy Vision Therapy February 2, 2007