Applications in Visual Object Tracking Yuanwei Wu 10-21-2016 1 - PowerPoint PPT Presentation

Week 42: Siamese Network: Architecture and Applications in Visual Object Tracking Yuanwei Wu 10-21-2016 1

Outline • Siamese Architecture • Siamese Applications in Computer Vision • Paper review  Visual Object Tracking using Siamese CNN • Future Work 2

What does “Siamese” mean? 3 Source: http://vision.ia.ac.cn/zh/senimar/reports/Siamese-Network-Architecture-and-Applications-in-Computer-Vision.pdf

Siamese Architecture 4 Source: Learning Hierarchies of Invariant Features. Yann LeCun. helper.ipam.ucla.edu/publications/gss2012/gss2012_10739.pdf

Siamese Architecture and loss function 5 Source: Learning Hierarchies of Invariant Features. Yann LeCun. helper.ipam.ucla.edu/publications/gss2012/gss2012_10739.pdf

Siamese Applications in Computer Vision: 1. Signature Verification 6 Source: http://vision.ia.ac.cn/zh/senimar/reports/Siamese-Network-Architecture-and-Applications-in-Computer-Vision.pdf

Siamese Applications in Computer Vision: 2. Dimensionality Reduction 7 Source: http://vision.ia.ac.cn/zh/senimar/reports/Siamese-Network-Architecture-and-Applications-in-Computer-Vision.pdf

Siamese Applications in Computer Vision: 3.1 Learning Image Descriptors CNN Model 8 Source: http://vision.ia.ac.cn/zh/senimar/reports/Siamese-Network-Architecture-and-Applications-in-Computer-Vision.pdf

Siamese Applications in Computer Vision: 3.2 Learning Image Descriptors 9 Source: http://vision.ia.ac.cn/zh/senimar/reports/Siamese-Network-Architecture-and-Applications-in-Computer-Vision.pdf

Siamese Applications in Computer Vision: 4.1 Face Verification 10 Source: http://vision.ia.ac.cn/zh/senimar/reports/Siamese-Network-Architecture-and-Applications-in-Computer-Vision.pdf

Paper Review: Fully-Convolutional Siamese Networks for Object Tracking @article{bertinetto2016fully, title={Fully-Convolutional Siamese Networks for Object Tracking}, author={Bertinetto, Luca and Valmadre, Jack and Henriques, Jo{\~a}o F and Vedaldi, Andrea and Torr, Philip HS}, journal={arXiv preprint arXiv:1606.09549}, year={2016} } 15

Architecture of Siamese CNN Source: Bertinetto, Luca and Valmadre, Jack and Henriques, Jo{\~a}o F and Vedaldi, Andrea and Torr, Philip HS, 16 fully-Convolutional Siamese Networks for Object Tracking, arXiv preprint, 2016.

Details of the Architecture of Siamese CNN 1. Source: 1: Alex Krizhevsky, Ilya Sutskever, Geoffrey E. Hinton, ImageNet Classification with Deep Convolutional Neural Networks, NIPS 2012. 17

Details of the Architecture of Siamese CNN 1. 2. Cross-correlation layer Source: 1: Alex Krizhevsky, Ilya Sutskever, Geoffrey E. Hinton, ImageNet Classification with Deep Convolutional Neural Networks, NIPS 2012. 2: Bertinetto, Luca and Valmadre, Jack and Henriques, Jo{\~a}o F and Vedaldi, Andrea and Torr, Philip HS, 18 fully-Convolutional Siamese Networks for Object Tracking, arXiv preprint, 2016.

Training: dataset • ImageNet Video dataset of 2015:  contains ~4000 videos  with ~1 million annotated frames Source: Bertinetto, Luca and Valmadre, Jack and Henriques, Jo{\~a}o F and Vedaldi, Andrea and Torr, Philip HS, 19 fully-Convolutional Siamese Networks for Object Tracking, arXiv preprint, 2016.

Training: preprocessing on the images • Preprocessing: 2820 videos, examplar image: 127 x 127, search image: 255 x 255 Source: Bertinetto, Luca and Valmadre, Jack and Henriques, Jo{\~a}o F and Vedaldi, Andrea and Torr, Philip HS, 20 fully-Convolutional Siamese Networks for Object Tracking, arXiv preprint, 2016.

Training: recap the steps • ImageNet Video dataset of 2015:  contains ~4000 videos  with ~1 million annotated frames • Preprocessing:  2820 videos  examplar image: 127 x 127  search image: 255 x 255 • Training with a standard Stochastic Gradient Descent (SGD) solver using MathConvNet Source: Bertinetto, Luca and Valmadre, Jack and Henriques, Jo{\~a}o F and Vedaldi, Andrea and Torr, Philip HS, 21 fully-Convolutional Siamese Networks for Object Tracking, arXiv preprint, 2016.

Training: loss function • Employing a discriminative training approach using positive and negative pairs and adopting the logistic loss: Source: Bertinetto, Luca and Valmadre, Jack and Henriques, Jo{\~a}o F and Vedaldi, Andrea and Torr, Philip HS, 22 fully-Convolutional Siamese Networks for Object Tracking, arXiv preprint, 2016.

Training: loss function • Employing a discriminative training approach using positive and negative pairs and adopting the logistic loss: • The loss of a score map is the mean of the individual losses: Source: Bertinetto, Luca and Valmadre, Jack and Henriques, Jo{\~a}o F and Vedaldi, Andrea and Torr, Philip HS, 23 fully-Convolutional Siamese Networks for Object Tracking, arXiv preprint, 2016.

Training: loss function • Employing a discriminative training approach using positive and negative pairs and adopting the logistic loss: • The loss of a score map is the mean of the individual losses: • Applying SGD to find the conv-net Ѳ using Source: Bertinetto, Luca and Valmadre, Jack and Henriques, Jo{\~a}o F and Vedaldi, Andrea and Torr, Philip HS, 24 fully-Convolutional Siamese Networks for Object Tracking, arXiv preprint, 2016.

Tracking algorithm • Use a search image centered at the previous position of the target. Source: Bertinetto, Luca and Valmadre, Jack and Henriques, Jo{\~a}o F and Vedaldi, Andrea and Torr, Philip HS, 25 fully-Convolutional Siamese Networks for Object Tracking, arXiv preprint, 2016.

Tracking algorithm • Use a search image centered at the previous position of the target. • Only search for the object within a region of approximately four times its previous size. Source: Bertinetto, Luca and Valmadre, Jack and Henriques, Jo{\~a}o F and Vedaldi, Andrea and Torr, Philip HS, 26 fully-Convolutional Siamese Networks for Object Tracking, arXiv preprint, 2016.

Tracking algorithm • Use a search image centered at the previous position of the target. • Only search for the object within a region of approximately four times its previous size. • A cosine window is added to the score map to penalize large displacements. Source: Bertinetto, Luca and Valmadre, Jack and Henriques, Jo{\~a}o F and Vedaldi, Andrea and Torr, Philip HS, 27 fully-Convolutional Siamese Networks for Object Tracking, arXiv preprint, 2016.

Tracking algorithm • Use a search image centered at the previous position of the target. • Only search for the object within a region of approximately four times its previous size. • A cosine window is added to the score map to penalize large displacements. • The position of the maximum score relative to the center of the score map, multiplied by the stride of the network, gives the displacement of the target from frame to frame. Source: Bertinetto, Luca and Valmadre, Jack and Henriques, Jo{\~a}o F and Vedaldi, Andrea and Torr, Philip HS, 28 fully-Convolutional Siamese Networks for Object Tracking, arXiv preprint, 2016.

Experiments: training dataset size • Accuracy: is calculated as the average Intersection-over-Union (IoU) • Robustness: in terms of the total number of failures Source: Bertinetto, Luca and Valmadre, Jack and Henriques, Jo{\~a}o F and Vedaldi, Andrea and Torr, Philip HS, 29 fully-Convolutional Siamese Networks for Object Tracking, arXiv preprint, 2016.

Experiments: training dataset size • Accuracy: is calculated as the average Intersection- over-Union (IoU) • Robustness: in terms of the total number of failures • Using a larger video dataset could increase the performance even further. Source: Bertinetto, Luca and Valmadre, Jack and Henriques, Jo{\~a}o F and Vedaldi, Andrea and Torr, Philip HS, 30 fully-Convolutional Siamese Networks for Object Tracking, arXiv preprint, 2016.

Experiments: OTB13 benchmark results Source: Bertinetto, Luca and Valmadre, Jack and Henriques, Jo{\~a}o F and Vedaldi, Andrea and Torr, Philip HS, 31 fully-Convolutional Siamese Networks for Object Tracking, arXiv preprint, 2016.

Experiments: VOT15 benchmark results Source: Bertinetto, Luca and Valmadre, Jack and Henriques, Jo{\~a}o F and Vedaldi, Andrea and Torr, Philip HS, 32 fully-Convolutional Siamese Networks for Object Tracking, arXiv preprint, 2016.

Experiments: VOT15 benchmark results Source: Bertinetto, Luca and Valmadre, Jack and Henriques, Jo{\~a}o F and Vedaldi, Andrea and Torr, Philip HS, 33 fully-Convolutional Siamese Networks for Object Tracking, arXiv preprint, 2016.

Applications in Visual Object Tracking Yuanwei Wu 10-21-2016 1 - PowerPoint PPT Presentation

Week 42: Siamese Network: Architecture and Applications in Visual Object Tracking Yuanwei Wu 10-21-2016 1 Outline Siamese Architecture Siamese Applications in Computer Vision Paper review Visual Object Tracking using Siamese

Overview Introduction Object Tracking Vehicle Tracking Theory & Implementation

Multi-Object Tracking Challenge CV3DST Lecture Exercises Multi-Object Tracking Multi-Object

Object Oriented Object 3 Programming Object 1 Object 2 Object 4 For : COP 3330. Object

Tracking H akan Ard o March 4, 2013 H akan Ard o Tracking March 4, 2013 1 / 57

Biovision team 2 Retina Visual cortex 3 Retina Visual cortex 3 Retina Visual cortex 3

Tracking using Goal CONDENSATION: Model-based visual tracking in dense Conditional Density

Similarity Mapping with Enhanced Siamese Network for Multi-object Tracking Minyoung Kim

Overview Overview Visual displays Visual displays Visual and tactile displays Visual and

Object-Oriented Databases Object Oriented Databases ODMG Standard Object Model, Object

Object oriented Object oriented Object oriented Object oriented approach and UML approach and

Visual Object Tracking: An overview P a n H e , P h . D s t u d e n t @ U F M A L T L a b h

Multi-object tracking (MOT): visual and audio-visual Daniel Gatica-Perez (joint work with Kevin

Tracking H akan Ard o February 22, 2012 H akan Ard o Tracking February 22, 2012 1

CHRONIC CHRONIC VISUAL LOSS VISUAL LOSS Wasu Supakornthanasarn, MD. Visual loss Sensory

A Model of Visual Imagery A Model of Visual Imagery John Abbondanza, OD, FCOVD John Abbondanza,

Visual Object Tracking Jianan Wu Megvii (Face++) Researcher wjn@megvii.com Dec 2017

Trial Issues in Protest Cases Keir Monteith QC, Garden Court Chambers (Chair) Russell Fraser,

Justification Defences: Direct action and freedom of expression Henry Blaxland QC, Barrister

Investor Presentation August 2019 Important Disclosure No representation or warranty, express or

Twinning 9% Credits and 4% Credits Dan Rosen Klein Hornig LLP 2019 NH&RA Annual

Birmingham Womens and Childrens NHS Foundation Trust At a glance... Who are we? We

National Options and Discretions in a Banking Union A Contradiction? Workshop: Challenges for

Forgiving the Angel of Death Group Project COMM-1080-504 Matthew Maxwell Maria Ipsen Kennedy

Lessons from the LIPGENE Lessons from the LIPGENE Project: Project: Policy Dilemmas that Arise

Applications in Visual Object Tracking Yuanwei Wu 10-21-2016 1 - PowerPoint PPT Presentation

Week 42: Siamese Network: Architecture and Applications in Visual Object Tracking Yuanwei Wu 10-21-2016 1 Outline Siamese Architecture Siamese Applications in Computer Vision Paper review Visual Object Tracking using Siamese

Overview Introduction Object Tracking Vehicle Tracking Theory &amp; Implementation

Multi-Object Tracking Challenge CV3DST Lecture Exercises Multi-Object Tracking Multi-Object

Object Oriented Object 3 Programming Object 1 Object 2 Object 4 For : COP 3330. Object

Tracking H akan Ard o March 4, 2013 H akan Ard o Tracking March 4, 2013 1 / 57

Biovision team 2 Retina Visual cortex 3 Retina Visual cortex 3 Retina Visual cortex 3

Tracking using Goal CONDENSATION: Model-based visual tracking in dense Conditional Density

Similarity Mapping with Enhanced Siamese Network for Multi-object Tracking Minyoung Kim

Overview Overview Visual displays Visual displays Visual and tactile displays Visual and

Object-Oriented Databases Object Oriented Databases ODMG Standard Object Model, Object

Object oriented Object oriented Object oriented Object oriented approach and UML approach and

Visual Object Tracking: An overview P a n H e , P h . D s t u d e n t @ U F M A L T L a b h

Multi-object tracking (MOT): visual and audio-visual Daniel Gatica-Perez (joint work with Kevin

Tracking H akan Ard o February 22, 2012 H akan Ard o Tracking February 22, 2012 1

CHRONIC CHRONIC VISUAL LOSS VISUAL LOSS Wasu Supakornthanasarn, MD. Visual loss Sensory

A Model of Visual Imagery A Model of Visual Imagery John Abbondanza, OD, FCOVD John Abbondanza,

Visual Object Tracking Jianan Wu Megvii (Face++) Researcher wjn@megvii.com Dec 2017

Trial Issues in Protest Cases Keir Monteith QC, Garden Court Chambers (Chair) Russell Fraser,

Justification Defences: Direct action and freedom of expression Henry Blaxland QC, Barrister

Investor Presentation August 2019 Important Disclosure No representation or warranty, express or

Twinning 9% Credits and 4% Credits Dan Rosen Klein Hornig LLP 2019 NH&amp;RA Annual

Birmingham Womens and Childrens NHS Foundation Trust At a glance... Who are we? We

National Options and Discretions in a Banking Union A Contradiction? Workshop: Challenges for

Forgiving the Angel of Death Group Project COMM-1080-504 Matthew Maxwell Maria Ipsen Kennedy

Lessons from the LIPGENE Lessons from the LIPGENE Project: Project: Policy Dilemmas that Arise

Overview Introduction Object Tracking Vehicle Tracking Theory & Implementation

Twinning 9% Credits and 4% Credits Dan Rosen Klein Hornig LLP 2019 NH&RA Annual