Summary of our efforts Last year Deep CNN for - PowerPoint PPT Presentation

Video ¡Concept ¡Detec-on ¡by ¡Deep ¡Nets ¡with ¡FLAIR ¡ Cees ¡Snoek, ¡Koen ¡van ¡de ¡Sande, ¡Daniel ¡Fon2jne ¡ Qualcomm ¡Technologies ¡ University ¡of ¡Amsterdam ¡ Netherlands ¡B.V. ¡ The ¡Netherlands ¡ Presented ¡by ¡ Thomas ¡Mensink , ¡UvA ¡

Summary ¡of ¡our ¡efforts ¡ Last ¡year ¡ ¡ ¡ ¡Deep ¡CNN ¡for ¡video ¡concept ¡detec2on ¡and ¡localiza2on ¡ ¡ ¡ ¡ ¡ This ¡year ¡ ¡ ¡ ¡ ¡Tangen2al ¡improvements ¡for ¡concept ¡detec2on ¡ ¡ ¡ ¡ Our ¡main ¡innova+on ¡is ¡in ¡concept ¡localiza+on ¡ ¡

TASK ¡I ¡ DETECTING ¡CONCEPTS ¡

Conclusion ¡from ¡TRECVID ¡2013 ¡ Video ¡fusion ¡ Frame ¡fusion ¡ Video ¡deep ¡net ¡ MediaMill ¡2012 ¡ Bag ¡of ¡words ¡and ¡deep ¡net ¡profit ¡from ¡each ¡other ¡

MediaMill ¡TRECVID ¡2014 ¡runs ¡ Bag ¡of ¡codes ¡ Net ¡of ¡convolu-ons ¡ Late ¡Fusion ¡by ¡weighted ¡averaging ¡ ¡ ¡ ¡ Run: ¡Fusion ¡1 ¡ Run: ¡Fusion ¡2 ¡ Run: ¡Fusion ¡3 ¡ Baseline ¡Run: ¡ Fuse-‑all ¡ ¡8x ¡CNN ¡+ ¡BoW ¡ Best ¡CNN ¡+ ¡BoW ¡ 8x ¡CNN ¡

MediaMill: ¡Color ¡difference ¡coding ¡ • Densely ¡sampled ¡points ¡ • SIFT, ¡C-‑SIFT ¡and ¡T-‑SIFT ¡descriptors ¡ • PCA ¡reduc2on ¡to ¡80D ¡ • Fisher ¡vector ¡coding ¡with ¡codebook ¡size ¡256 ¡ • Spa2al ¡pyramid ¡1x1+1x3 ¡ • Spa2al ¡coordinate ¡coding ¡ ¡ • Linear ¡classifier ¡ Color Descriptor software available for download at http://colordescriptors.com

MediaMill: ¡Video ¡deep ¡learning ¡ Convolu2onal ¡neural ¡network ¡with ¡8 ¡layers ¡with ¡weights ¡ ¡ Trained ¡using ¡error ¡back ¡propaga2on ¡ – ImageNet ¡for ¡pre-‑training ¡

Results ¡ Fusion ¡1 ¡/ ¡2 ¡ Baseline: ¡8x ¡CNN ¡ Fusion ¡3: ¡Best ¡CNN ¡+ ¡BoW ¡ Bag ¡of ¡words ¡and ¡deep ¡net ¡profit ¡from ¡each ¡other, ¡ ¡ be<er ¡results ¡with ¡more ¡nets ¡

Results ¡per ¡concept ¡

TASK ¡II ¡ LOCALIZING ¡CONCEPTS ¡ Fisher ¡and ¡VLAD ¡with ¡FLAIR, ¡Koen ¡van ¡de ¡Sande, ¡Cees ¡Snoek, ¡and ¡Arnold ¡Smeulders ¡ CVPR ¡2014 ¡

Goal: ¡meaningful ¡localiza2on ¡ Finding ¡ where , ¡ when , ¡ what ¡is ¡happening ¡ Challenges: ¡huge ¡search ¡space, ¡non-‑rigid ¡deforma2on ¡

Related ¡work ¡ Sliding ¡Window ¡ Boos2ng ¡Cascade ¡ Image ¡ Video ¡ Image ¡ Video ¡ … ¡ [Rowley, ¡1996] ¡ [Rodriguez, ¡2008] ¡ [Viola ¡& ¡Jones, ¡2001] ¡ [Ke, ¡2005] ¡ Branch ¡and ¡Bound ¡ Deformable ¡Parts ¡ Image ¡ Video ¡ Image ¡ Video ¡ [Lampert, ¡2009] ¡ [Yuan, ¡2011] ¡ [Felzenswalb, ¡2008] ¡ [Tian, ¡2013] ¡

Inspira2on: ¡Selec2ve ¡Search ¡ [Uijlings, ¡2013] ¡ Itera2ons ¡of ¡selec2ve ¡search ¡ Hierarchical ¡grouping ¡ of ¡super-‑pixels ¡ ¡ ¡ Object ¡proposals ¡ High ¡recall ¡with ¡modestly ¡sized ¡ ¡object ¡hypotheses ¡set ¡ Feasible ¡to ¡train ¡an ¡expensive ¡classifier ¡

Selec2ve ¡Search ¡ Mul2ple ¡complementary ¡invariant ¡color ¡spaces ¡ Loca2on ¡hypotheses ¡are ¡class-‑independent ¡ VOC2007 ¡test ¡ 1,500 ¡windows/image ¡ 98.0% ¡recall ¡ 14 ¡ Software available for download at http://koen.me/research/selectivesearch/

Local ¡object ¡classifica2on ¡ Requires ¡ repe--ve ¡computa2ons ¡on ¡ overlapping ¡regions ¡ Spa-al ¡Pyramids ¡ [ Lazebnik, ¡CVPR06 ] ¡ (#regions: ¡10-‑100) ¡ Object ¡Detec-on ¡[ Sande, ¡ICCV11 ] ¡ (#regions: ¡1,000-‑10,000) ¡ ¡ Repeat ¡for ¡each ¡region ¡ Feature Feature Kernel Local Feature Encoding Pooling Classification Extraction

Features ¡ Use ¡SIFT ¡and ¡ColorSIFT ¡descriptors ¡ Bag-‑of-‑words, ¡VLAD, ¡Fisher ¡vector ¡encoding ¡ ¡ Encoding ¡2000 ¡boxes ¡per ¡image ¡is ¡expensive ¡ Bag-‑of-‑words: ¡ ¡ ¡10s ¡ VLAD: ¡ ¡ ¡ ¡30s ¡ Fisher: ¡ ¡120s ¡

Key ¡idea ¡ Decompose ¡assignment ¡over ¡codebook ¡elements ¡ Codebook ¡ Point ¡feature ¡ Codeword ¡index ¡ Decomposi2on ¡

Area-‑independent ¡decomposi2on ¡ Fast ¡box ¡evalua2on ¡with ¡integral ¡images ¡ 0 ¡ ¡ ¡ ¡0 ¡ ¡ ¡ ¡0 ¡ ¡ ¡ ¡1 ¡ 0 ¡ ¡ ¡ ¡1 ¡ ¡ ¡ ¡1 ¡ ¡ ¡ ¡2 ¡ ¡ 1 ¡ ¡ ¡ ¡2 ¡ ¡ ¡ ¡2 ¡ ¡ ¡ ¡3 ¡ 1 1 ¡ ¡ ¡ ¡2 ¡ ¡ ¡ ¡2 ¡ (2 ¡ ¡ ¡ ¡ ¡0 ¡ ¡ ¡ ¡ ¡2) ¡ (1 ¡ ¡ ¡ ¡ ¡0 ¡ ¡ ¡ ¡ ¡1) ¡ 2 ¡ ¡ ¡ ¡2 ¡ ¡ ¡ ¡4 ¡ ¡ ¡ ¡4 ¡ ¡ Box ¡feature ¡encoding ¡ 2 ¡ ¡ ¡ ¡3 ¡ ¡ ¡ ¡5 ¡ ¡ ¡ ¡5 ¡ Decomposi2on ¡ Integral ¡image ¡

Porikil, ¡CVPR ¡2005 ¡ VLAD ¡with ¡FLAIR ¡ Decomposi2on ¡as ¡mul2-‑dimensional ¡integral ¡image ¡ ¡ Sparsity ¡drops ¡memory ¡from ¡14GB ¡to ¡1GB/image ¡ Supports ¡power ¡norm, ¡L2 ¡norm ¡and ¡spa2al ¡pyramid ¡ ¡ 18X ¡speedup ¡

Fisher ¡with ¡FLAIR ¡ Decomposi2on ¡as ¡four ¡mul2-‑dimensional ¡integral ¡images ¡ [See ¡paper] ¡ Supports ¡power ¡norm, ¡L2 ¡norm, ¡spa2al ¡pyramids ¡ No ¡need ¡for ¡approxima2ons ¡ Scalable ¡to ¡modern ¡datasets ¡ ¡ 18X ¡speedup ¡

Overall ¡detec2on ¡speedup ¡and ¡accuracy ¡ ¡ Time (s) per image ¡ ¡ ¡ ¡ Fisher ¡with ¡FLAIR ¡is ¡be<er ¡and ¡faster ¡than ¡BoW ¡

MediaMill ¡TRECVID ¡2014 ¡runs ¡ ¡MediaMill ¡2014 ¡SIN ¡runs ¡ Selec-ve ¡Search ¡ Bounding ¡box ¡ ¡ Fisher ¡with ¡FLAIR ¡ annota2ons ¡ Run ¡ Run ¡ Run ¡ Run ¡

Implementa2on ¡details ¡ PCA-‑reduced ¡ColorSIFT ¡descriptors ¡to ¡80D ¡ Fisher ¡with ¡FLAIR ¡encoding ¡ Spa2al ¡pyramid ¡ ¡ Linear ¡SVM ¡ Hard ¡nega2ve ¡mining ¡

Best ¡box ¡ Boat ¡ Other ¡boxes ¡

Best ¡box ¡ Airplane ¡ Other ¡boxes ¡

Results ¡ * ¡ ¡ ¡ ¡ ¡8x ¡CNN ¡+ ¡FLAIR ¡ + ¡ ¡ ¡ ¡ ¡Fusion ¡1 ¡+ ¡FLAIR ¡  ¡ ¡Fusion ¡2 ¡+ ¡FLAIR ¡ ☐ ¡ ¡ ¡Fusion ¡3 ¡+ ¡FLAIR ¡ FLAIR ¡aCer ¡deep ¡nets ¡is ¡best ¡

Conclusions ¡ Bag ¡of ¡words ¡and ¡deep ¡net ¡profit ¡from ¡each ¡other ¡ ¡ ¡ Encoding ¡Fisher ¡with ¡FLAIR ¡is ¡18x ¡faster ¡ ¡ ¡ ¡ Area ¡independent ¡ ¡ ¡ ¡ ¡Supports ¡spa2al ¡pyramids, ¡power ¡norm, ¡L2 ¡norm ¡ ¡ ¡ ¡ ¡No ¡approxima2on ¡ Allows ¡for ¡large-‑scale ¡localiza2on ¡in ¡video ¡ ¡ ¡ 27 ¡ ¡

Summary of our efforts Last year Deep CNN for - PowerPoint PPT Presentation

Video Concept Detec-on by Deep Nets with FLAIR Cees Snoek, Koen van de Sande, Daniel Fon2jne Qualcomm Technologies University of Amsterdam Netherlands B.V.

OUR SKILLS OUR SKILLS OUR SKILLS OUR SKILLS OUR SKILLS OUR SKILLS OUR SKILLS OUR SKILLS OUR

Baldwin Space Summary October 25 1 Baldwin School Space Summary 2 Baldwin School Space Summary

TOGETHER WWW.aLGOduE.iT SUMMARY 06 OuR SPECiaLiSTS 09 OuR 03 fRuiTS 08 OuR RESuLTS OuR

The benefits of multilingualism and efforts to The benefits of multilingualism and efforts to fix

International Efforts and International Efforts and Issues in Urban Search & Issues in Urban

Build Upon Past Efforts Dudley Square Transportation & Air Quality Study BTD/BRA

Resurrection of Fukushima, NPO 1 1. Efforts to secure living environment 2. Efforts to produce

Anima imas R River C r Comm mmunit unity F y Forum La Plata County 416 Fire Recovery Efforts

Planning Efforts & Policies Planning Efforts & Policies Strategic Planning for Climate

Efforts to Secure Efforts to Secure Electronic Financial Transactions Electronic Financial

LTS Efforts in Network Mapping LTS Efforts in Network Mapping Dr B Ann Cox Dr B Ann Cox Dr. B.

REASONABLE AND ACTIVE EFFORTS: Ensuring Timely Child Safety, Permanency, and Wellbeing Scott

Japanese EGS experience and Japanese EGS experience and modeling efforts modeling efforts a

1 Product Range Products 2 summary summary summary summary Relays with 8 and 11-Pins

An Ultramarathon Pie with Doge Glaze An Ultramarathon Pie with Doge Glaze Marathon: The Summary

Local Health Summary of 2013 Advisory Council Efforts Presented to the Board of Directors

Feature and model selection Ask your TA any questions related to grading TA office hours

A Bayesian method to detect targets of selection in Evolve-and-Resequence experiments Rui Borges,

The Mathematics of X-ray Tomography Tatiana A. Bubba Department of Mathematics and Statistics,

Swedish IT Incident Centre Pr sterberg SITIC 05-06-30 SITICs task Support society in

Perturbative and non Perturbative calculations of holographic Renyi relative divergence Tomonori

DHS S&T Cyber Security Division (CSD) Overview AIMS-3 Workshop February 9-11, 2011 UCSD

Relative Fisher Information and Natural Gradient for Learning Large Modular Models Ke Sun 1 Frank

Perfect vs. Imperfect Steganography Stegosystems can be divided into two classes: Perfectly

Summary of our efforts Last year Deep CNN for - PowerPoint PPT Presentation

Video Concept Detec-on by Deep Nets with FLAIR Cees Snoek, Koen van de Sande, Daniel Fon2jne Qualcomm Technologies University of Amsterdam Netherlands B.V.

OUR SKILLS OUR SKILLS OUR SKILLS OUR SKILLS OUR SKILLS OUR SKILLS OUR SKILLS OUR SKILLS OUR

Baldwin Space Summary October 25 1 Baldwin School Space Summary 2 Baldwin School Space Summary

TOGETHER WWW.aLGOduE.iT SUMMARY 06 OuR SPECiaLiSTS 09 OuR 03 fRuiTS 08 OuR RESuLTS OuR

The benefits of multilingualism and efforts to The benefits of multilingualism and efforts to fix

International Efforts and International Efforts and Issues in Urban Search &amp; Issues in Urban

Build Upon Past Efforts Dudley Square Transportation &amp; Air Quality Study BTD/BRA

Resurrection of Fukushima, NPO 1 1. Efforts to secure living environment 2. Efforts to produce

Anima imas R River C r Comm mmunit unity F y Forum La Plata County 416 Fire Recovery Efforts

Planning Efforts &amp; Policies Planning Efforts &amp; Policies Strategic Planning for Climate

Efforts to Secure Efforts to Secure Electronic Financial Transactions Electronic Financial

LTS Efforts in Network Mapping LTS Efforts in Network Mapping Dr B Ann Cox Dr B Ann Cox Dr. B.

REASONABLE AND ACTIVE EFFORTS: Ensuring Timely Child Safety, Permanency, and Wellbeing Scott

Japanese EGS experience and Japanese EGS experience and modeling efforts modeling efforts a

1 Product Range Products 2 summary summary summary summary Relays with 8 and 11-Pins

An Ultramarathon Pie with Doge Glaze An Ultramarathon Pie with Doge Glaze Marathon: The Summary

Local Health Summary of 2013 Advisory Council Efforts Presented to the Board of Directors

Feature and model selection Ask your TA any questions related to grading TA office hours

A Bayesian method to detect targets of selection in Evolve-and-Resequence experiments Rui Borges,

The Mathematics of X-ray Tomography Tatiana A. Bubba Department of Mathematics and Statistics,

Swedish IT Incident Centre Pr sterberg SITIC 05-06-30 SITICs task Support society in

Perturbative and non Perturbative calculations of holographic Renyi relative divergence Tomonori

DHS S&amp;T Cyber Security Division (CSD) Overview AIMS-3 Workshop February 9-11, 2011 UCSD

Relative Fisher Information and Natural Gradient for Learning Large Modular Models Ke Sun 1 Frank

Perfect vs. Imperfect Steganography Stegosystems can be divided into two classes: Perfectly

International Efforts and International Efforts and Issues in Urban Search & Issues in Urban

Build Upon Past Efforts Dudley Square Transportation & Air Quality Study BTD/BRA

Planning Efforts & Policies Planning Efforts & Policies Strategic Planning for Climate

DHS S&T Cyber Security Division (CSD) Overview AIMS-3 Workshop February 9-11, 2011 UCSD