Scaling up semantic indexing Mats Sjberg Satoru Ishikawa, Markus - PowerPoint PPT Presentation

Scaling up semantic indexing Mats Sjöberg Satoru Ishikawa, Markus Koskela, Jorma Laaksonen, Erkki Oja CBIR research group (PicSOM) http://research.ics.tkk.fi/cbir/ Department of Information and Computer Science Aalto University, School of Science mats.sjoberg@aalto.fi

About us ◮ The PicSOM group from Aalto University has taken part in TRECVID since 2005. ◮ Before 2010 the university was called Helsinki University of Technology (Aalto = HUT + HSE + UIAH). ◮ In this year we participated in the semantic indexing (SIN) and known-item search (KIS) tasks. PicSOM group November 30, 2011 2/16

Motivation ◮ We are currently working with the Finnish Broadcasting Company (YLE) and the National Audiovisual Archive (KAVA) on content-based analysis on the live TV signal. ◮ This includes doing fast online semantic indexing on streaming video ⇒ increased emphasis on scalability and speed. ◮ Also, improving the speed of offline training of detectors. ◮ In TRECVID 2011 we focused on radically improving the speed of both the online and the offline components of the semantic indexing pipeline. PicSOM group November 30, 2011 3/16

Semantic indexing pipeline feature 1 classifier feature 2 classifier fusion . . . feature N classifier ◮ (Color)SIFT + SVM ( χ 2 ) + (weighted) geom. mean fusion. ◮ Similarity Cluster weighting (Wilkins et al, 2007). ◮ Offline: extract features from training data, train classifiers (parameter selection most time consuming). ◮ Online: extract features from new image(s), predict with trained detectors. PicSOM group November 30, 2011 4/16

Feature extraction ◮ Bag-of-visual-words features (BoV) very successful. ◮ Best results for PicSOM group in TRECVID: ColorSIFT with dense sampling, 1x1-2x2 pyramid, soft assignment, ◮ However, computationally very expensive: about 1 image per second. ◮ Consider: (online) 25 frames per second video (!), or (offline) 3 million image database: 35 days. PicSOM group November 30, 2011 5/16

Feature extraction, cont. ◮ We have looked at other non-BoV features. ◮ Local Binary Patterns (LBP) 1 , simple and efficient texture operator, useful e.g. for face description. ◮ A promising choice: CENsus TRansform hISTogram (Centrist) 2 . ◮ Basically an LBP histogram reduced in dimensionality (40) with PCA, plus mean and stddev. ◮ This done in a 2 level spatial pyramid, giving a dimensionality of ( 40 + 2 ) × ( 25 + 5 + 1 ) = 1302. 1 Pietikäinen, Hadid, Zhao, Ahonen:, Computer Vision Using Local Binary Patterns, Springer, 2011 2 Wu, Rehg: CENTRIST: A Visual Descriptor for Scene Categorization, PAMI, 2011. PicSOM group November 30, 2011 6/16

SIFT vs Centrist Example: extract features for 2268 images ◮ ColorSIFT: 43 minutes, about 1 image per second ◮ Centrist: 49 seconds, about 50 images per second Centrist is roughly 50 times faster. Now live video starts to look feasible! PicSOM group November 30, 2011 7/16

Training classifiers ◮ Kernel SVM’s state-of-the-art, but computationally expensive. ◮ Linear classifiers fast, but less accurate. ◮ Offline, but constrains database size, concept vocabulary, less room for experimentation. Parameter selection most time consuming phase: ◮ C-SVM has two parameters ( C , γ ) (LIBSVM 1 ), ◮ linear classifier ( L 2 regularised logistic regression solver from LIBLINEAR) has only one parameter ( C ). 1 Chih-Chung Chang and Chih-Jen Lin, LIBSVM : a library for support vector machines, ACM TIST, 2011. PicSOM group November 30, 2011 8/16

Training classifiers, cont. ◮ Parameter selection times in TRECVID 2011, with a somewhat naive line search followed by grid search. ◮ SVM: on average 3 days! ◮ linear: on average a bit more than 1 hour! ◮ (A strong bias towards SVM since our cluster has a maximum run-time of 7 days!) hours SVM linear × min 0.6 0.2 3.5 max 168.0 4.2 40.3 median 33.9 1.2 27.2 average 79.1 1.3 61.1 PicSOM group November 30, 2011 9/16

Prediction with trained classifier ◮ Critical in online scenario: detect concepts in new images. ◮ Prediction with LIBSVM takes around 100–500 milliseconds per image with ColorSIFT features ◮ Consider: with 300 concepts (e.g. TRECVID) this is in the order of 100 seconds per image. ◮ LIBLINEAR takes 1–3 milliseconds per image. ◮ In the order of 1 second per image or less for 300 concepts ◮ Real-time video is typically 25 images per second or more, of course not all frames need to be classified PicSOM group November 30, 2011 10/16

Experiments classifier feature MXIAP SVM ColorSIFT 0.1233 SIFT 0.1139 Centrist 0.0939 linear ColorSIFT 0.0329 SIFT 0.0292 Centrist 0.0289 EdgeFourier 0.0101 ScalableColor 0.0182 ◮ Centrist not quite as good as BoV features, but quite good considering 50-fold speedup. ◮ LIBLINEAR for single features much worse than LIBSVM. PicSOM group November 30, 2011 11/16

Time estimates classifier + features MXIAP offline (days) online (secs) SVM ColorSIFT 0.1233 77.0 45.6 SVM Centrist 0.0939 5.5 45.0 SVM 3 best fusion 0.1363 123.3 136.0 linear ColorSIFT 0.0329 73.7 1.1 linear 3 best fusion 0.0827 113.5 2.3 linear 12 fusion 0.0986 189.2 7.0 linear 14 fusion 0.1145 591.2 11.4 SVM Centrist + linear 10 0.1116 81.2 50.2 SVM 3 + linear 14 0.1398 601.1 146.4 ◮ Rough estimate of offline and online processing times. ◮ Scenario: 1M images, detecting 300 concepts online. PicSOM group November 30, 2011 12/16

Time estimates, cont. classifier + features MXIAP offline (days) online (secs) SVM ColorSIFT 0.1233 77.0 45.6 SVM Centrist 0.0939 5.5 45.0 SVM 3 best fusion 0.1363 123.3 136.0 linear ColorSIFT 0.0329 73.7 1.1 linear 3 best fusion 0.0827 113.5 2.3 linear 12 fusion 0.0986 189.2 7.0 linear 14 fusion 0.1145 591.2 11.4 SVM Centrist + linear 10 0.1116 81.2 50.2 SVM 3 + linear 14 0.1398 601.1 146.4 ◮ Centrist result is in the same order of magnitude as ColorSIFT, but much faster to calculate. PicSOM group November 30, 2011 13/16

Time estimates, cont. classifier + features MXIAP offline (days) online (secs) SVM ColorSIFT 0.1233 77.0 45.6 SVM Centrist 0.0939 5.5 45.0 SVM 3 best fusion 0.1363 123.3 136.0 linear ColorSIFT 0.0329 73.7 1.1 linear 3 best fusion 0.0827 113.5 2.3 linear 12 fusion 0.0986 189.2 7.0 linear 14 fusion 0.1145 591.2 11.4 SVM Centrist + linear 10 0.1116 81.2 50.2 SVM 3 + linear 14 0.1398 601.1 146.4 ◮ Linear results improve strongly by adding features. ◮ Even with five times more features, 10-fold speed increase compared to SVM. PicSOM group November 30, 2011 14/16

Time estimates, cont. classifier + features MXIAP offline (days) online (secs) SVM ColorSIFT 0.1233 77.0 45.6 SVM Centrist 0.0939 5.5 45.0 SVM 3 best fusion 0.1363 123.3 136.0 linear ColorSIFT 0.0329 73.7 1.1 linear 3 best fusion 0.0827 113.5 2.3 linear 12 fusion 0.0986 189.2 7.0 linear 14 fusion 0.1145 591.2 11.4 SVM Centrist + linear 10 0.1116 81.2 50.2 SVM 3 + linear 14 0.1398 601.1 146.4 ◮ Linear prediction is fast even with many features. PicSOM group November 30, 2011 15/16

Conclusions ◮ For offline speed, fast feature calculation is most critical. ◮ Centrist is 50 times faster than best BoV feature. ◮ For online speed, prediction time of classifier is most critical. ◮ Linear classifier is 50 − 100 times faster than kernel SVM. ◮ With many features, linear classifier can achieve same order of magnitude MXIAP as single best SVM. PicSOM group November 30, 2011 16/16

Scaling up semantic indexing Mats Sjberg Satoru Ishikawa, Markus - PowerPoint PPT Presentation

Scaling up semantic indexing Mats Sjberg Satoru Ishikawa, Markus Koskela, Jorma Laaksonen, Erkki Oja CBIR research group (PicSOM) http://research.ics.tkk.fi/cbir/ Department of Information and Computer Science Aalto University, School of

Outline Scaling Scalinga Plenitude of Power Laws Scaling-at-large Scaling-at-large

UP UP AND OUT: SCALING SOFTWARE WITH AKKA Jonas Bonr CTO Typesafe @jboner Scaling software

Distributed Indexing Indexing, session 8 CS6200: Information Retrieval Slides by: Jesse Anderton

Indexing Multimedia Multimedia Databases Databases Indexing Indexing Multimedia Databases

Analysis of Scaling Algorithms for Matrix & Operator Scaling Contents Scaling Algorithms

Retrieval by Content Part 3: Text Retrieval Latent Semantic Indexing Srihari: CSE 626 1 Latent

NPFL103: Information Retrieval (11) Latent semantic indexing Pavel Pecina Institute of Formal

Indexing Presentation - The Basics Attached is the slide deck for a short presentation on indexing

Indexing and Searching Indexing and Searching TDT4215 TDT4215 Indexing & Searching 3

Bitmap Indexing and related indexing techniques Presented by: El Ghailani Maher Outline I

Chapter 6 Hash-Based Indexing Efficient Support for Equality Search Hash-Based Indexing Static

Indexing December 12, 2008 Indexing Introduction New tuple is stored without any order next

Creating Semantic Mashups: Bridging Web 2.0 and the Semantic Web Jamie Taylor, Colin Evans, Toby

: on the Semantic Web : on the Semantic Web Building a Semantic Prototype for Danish Building a

Semantic Processing Augmenting CFGs Currying Quantifier scope Semantic Grammars L445 / L545

Align, Disambiguate, and Walk A Unified Approach for Measuring Semantic Similarity Semantic

Run-time Evaluation of Opportunities for Object Inlining in Java Ond rej Lhot ak and

Torus actions in the normalization problem Jasmin Raissy Dipartimento di Matematica "L.

Radio follow-up with Japanese VLBI Network & East Asia VLBI Network K. Niinuma Yamaguchi

Writing a 3-D Multiplayer Game with Kawa and JMonkeyEngine Per Bothner (Kawa)

Characterization of rational conformal QFTs and their boundary conditions 1 Marcel Bischoff

Say it LOUD! R Yossi

Composite Higgs and LHC phenomenology LianTao Wang University of Chicago Lattice for BSM 2016.

Notes on specifying systems in EST Robert Meolic, Tatjana Kapus Faculty of EE & CS

Sambuz

Useful Links

Newsletter

Mail Us

Scaling up semantic indexing Mats Sjberg Satoru Ishikawa, Markus - PowerPoint PPT Presentation

Scaling up semantic indexing Mats Sjberg Satoru Ishikawa, Markus Koskela, Jorma Laaksonen, Erkki Oja CBIR research group (PicSOM) http://research.ics.tkk.fi/cbir/ Department of Information and Computer Science Aalto University, School of

Outline Scaling Scalinga Plenitude of Power Laws Scaling-at-large Scaling-at-large

UP UP AND OUT: SCALING SOFTWARE WITH AKKA Jonas Bonr CTO Typesafe @jboner Scaling software

Distributed Indexing Indexing, session 8 CS6200: Information Retrieval Slides by: Jesse Anderton

Indexing Multimedia Multimedia Databases Databases Indexing Indexing Multimedia Databases

Analysis of Scaling Algorithms for Matrix &amp; Operator Scaling Contents Scaling Algorithms

Retrieval by Content Part 3: Text Retrieval Latent Semantic Indexing Srihari: CSE 626 1 Latent

NPFL103: Information Retrieval (11) Latent semantic indexing Pavel Pecina Institute of Formal

Indexing Presentation - The Basics Attached is the slide deck for a short presentation on indexing

Indexing and Searching Indexing and Searching TDT4215 TDT4215 Indexing &amp; Searching 3

Bitmap Indexing and related indexing techniques Presented by: El Ghailani Maher Outline I

Chapter 6 Hash-Based Indexing Efficient Support for Equality Search Hash-Based Indexing Static

Indexing December 12, 2008 Indexing Introduction New tuple is stored without any order next

Creating Semantic Mashups: Bridging Web 2.0 and the Semantic Web Jamie Taylor, Colin Evans, Toby

: on the Semantic Web : on the Semantic Web Building a Semantic Prototype for Danish Building a

Semantic Processing Augmenting CFGs Currying Quantifier scope Semantic Grammars L445 / L545

Align, Disambiguate, and Walk A Unified Approach for Measuring Semantic Similarity Semantic

Run-time Evaluation of Opportunities for Object Inlining in Java Ond rej Lhot ak and

Torus actions in the normalization problem Jasmin Raissy Dipartimento di Matematica &quot;L.

Radio follow-up with Japanese VLBI Network &amp; East Asia VLBI Network K. Niinuma Yamaguchi

Writing a 3-D Multiplayer Game with Kawa and JMonkeyEngine Per Bothner (Kawa)

Characterization of rational conformal QFTs and their boundary conditions 1 Marcel Bischoff

Say it LOUD! R Yossi

Composite Higgs and LHC phenomenology LianTao Wang University of Chicago Lattice for BSM 2016.

Notes on specifying systems in EST Robert Meolic, Tatjana Kapus Faculty of EE &amp; CS

Sambuz

Useful Links

Newsletter

Mail Us

Analysis of Scaling Algorithms for Matrix & Operator Scaling Contents Scaling Algorithms

Indexing and Searching Indexing and Searching TDT4215 TDT4215 Indexing & Searching 3

Torus actions in the normalization problem Jasmin Raissy Dipartimento di Matematica "L.

Radio follow-up with Japanese VLBI Network & East Asia VLBI Network K. Niinuma Yamaguchi

Notes on specifying systems in EST Robert Meolic, Tatjana Kapus Faculty of EE & CS