Interactive Image Mining Annie Morin 1 , Nguyen-Khang Pham 1,2 1 - PowerPoint PPT Presentation

Interactive Image Mining Annie Morin 1 , Nguyen-Khang Pham 1,2 1 TEXMEX/IRISA 2 Cantho University, Vietnam

Outline � Image retrieval and motivation � Examples � Image topic discovery � FCA and adaptation on images � Image retrieval � Image indexing using indicators of FCA � Numerical results � Conclusion and future works CARME 2011 2

Image retrieval Sorted list of images Image query Image retrieval System Decreasing similarity Image database CARME 2011 3

Starting Point: Video Google Hessian - Affine detector [Mikolaczyk & Schmid, 2004] - Interest point SIFT computation [Lowe, 2004] detection - SIFT computation Quantification SIFT ∈ R 128 Bag of “visual words” SIFT K-means clustering visual words CARME 2011 4 Video Google [Sivic & Zisserman, 2003]

SIFT descriptor ∈ R 128 SIFT = Scalable Invariant Feature Transform 4 x 4 x 8 (directions) = 128 (dimensions) CARME 2011 5

R 128 6 6 Visual words … (for inst.., k -means) Clustering CARME 2011 SIFT Visual words R 128

7 word N … word 3 CARME 2011 Contingency table word 2 word 1 image M image 1 image 2 …

Data Analysis Techniques Used � Video Google � tf*idf weighting � Probabilistic Latent Semantic Analysis (Sivic et al. , 2005) � Our goal � Replace these techniques by Factorial Correspondence Analysis (FCA) CARME 2011 8

Motivation Success of application of Factorial Correspondence Analysis on textual data CARME 2011 9

Some examples : the Caltech4 database � Caltech4 database � 4090 images divided into 5 categories � faces : 435 � motorbikes : 800 � airplanes : 800 � cars (rear): 1155 � backgrounds : 900 � Vocabulary : 2224 visual words (Sivic et al. , 2005) CARME 2011 10

11 CARME 2011 Caltech4 dataset

12 12 Factorial Correspondence CARME 2011 Analysis

Factorial Correspondence analysis images visual words Image with all its visual words Characteristic visual words for this image in this plane CARME 2011 13

14 CARME 2011 Cars

15 CARME 2011 Motorbikes

16 CARME 2011 Faces

17 CARME 2011 Airplanes

18 CARME 2011 Backgrounds

19 Blue:pos Red:neg Image information extraction CARME 2011

Example : Categorization of images Alogic database (961 images, 1000 visual words) CARME 2011 20

21 Categorization of images CARME 2011

Numerical results: databases � Caltech4 database � 4090 images divided into 5 categories � Vocabulary : 2224 visual words � Nister database � 2250 groups of 4 images � 10200 images in total � 5000 visual words CARME 2011 22

23 CARME 2011 Nistér dataset

24 Experimental results : methods comparison CARME 2011

25 Exhaustive search Experimental results CARME 2011 Nistér-Stewénius

Speeding up the retrieval by using image categorization � One factorial axis � two different topics � One image may belong to several topics � Topics determination: � Contribution to the inertia of an axis � Quality on an axis CARME 2011 26

Image Indexing using FCA � Hypothesis: � Two similar images share some common properties � Indicators of FCA have relevant properties � Representation quality of images on axes � An image is well represented on some axes � Contribution to the inertia of axes � An image highly contributes to the inertia of some axes CARME 2011 27

Image Indexing using FCA � Inverted file � property � Contains images which possess this property � For an axis we construct two inverted files: � Negative part/Positive part � Each file contains images well represented on this axis (or images which highly contribute to the inertia of this axis) CARME 2011 28

Image Indexing using FCA � Retrieval schema � For a given image query � Sort axes by their representation quality (contribution to the inertia) � Determine relevant properties of the query (i.e. some first axes) and take the correspondent inverted files � Merge the inverted files by majority vote � candidate list � Search in the candidate list CARME 2011 29

Inverted files … axis α Topics + - + - F 1 F 1 F 2 F 2 1 0 0 1 image 1 0 1 1 0 image 2 Topics determination 0 0 0 1 image 3 z image i 1 0 1 0 α image 4 i 1 0 0 1 image 5 … Inverted files 1 2 2 1 Coordinates of the images in the factorial space 4 4 3 5 5 CARME 2011 30

Experimental results � Approximative search with inverted files � Method 1: � Inverted files based on the contribution � Method 2: � Inverted files based on the quality of representation � Exhaustive search CARME 2011 31

FCA with indexing technique Precision and time results Times Methods #images 5 img 10 img 50 img 100 img (ms) FCA, exhausted 4090 95.92 94.61 91.35 89.64 0.50 search Representation quality-based 984 95.97 94.63 91.23 89.43 0.15 indexing Contribution- 791 95.85 94.47 91.01 88.93 0.13 based indexing Tf*IDF, exhausted 4090 88.24 84.81 77.52 73.72 2.94 search Caltech4 dataset - 3.5 times faster than FCA without indexing - Number of axes kept: 15 - 20 times faster than TF*IDF CARME 2011 32 - #images: size of candidate list

FCA with indexing technique Methods #images precision Times (ms) FCA, exhausted search 10200 79.82 12.01 FCA with indexing 650 79.75 1.04 nf= 21, nf_thres = 11 FCA with indexing 79.96 397 1.33 nf: auto, nf_thres: auto TF*IDF, exhausted search 10200 73.04 36.45 Nistér dataset - 10 times faster than FCA without indexing #image: size of candidate size - 30 times faster than TF*IDF precision: precision at 4 first CARME 2011 33 returned images

Results Acceleration gain Methods min max mean Inverted files based on 2.7 17.3 6.9 the contribution Inverted files based on the quality of 3.2 13.8 6.7 représentation CARME 2011 34 34

Parallelization of the filtering step � Filtering of non Most of the time is for filtering relevant images � Use inverted files to filter the images sharing few topics with the query � Refinement step � Sequential search among the images candidates Computation time distribution for filtering step and refinement step for 1 million images CARME 2011 35

Results with Nister-Stevenius database +1 million images from FlickR Response time Acceleration Methods P@3 (ms) gain Exhaustive 0.623 860.88 - 0.625 Filtering on 79.99 10.76 CPU 0.625 Filtering on 7.53 114.33 GPU � Method with a parallel filtering on GPU 10 times faster than the method without parallelization of the � filtering step 100 times faster than the exhaustive search � CARME 2011 36

Conclusion � Adaptation of FCA on images for indexing and retrieval � Hierarchical CA to focus on some areas of a factorial plane CARME 2011 37

� Work of J.B Sivic and A Zisserman using probabilistic latent semantic analysis and visual words in images CARME 2011 38

Interactive Image Mining Annie Morin 1 , Nguyen-Khang Pham 1,2 1 - PowerPoint PPT Presentation

Interactive Image Mining Annie Morin 1 , Nguyen-Khang Pham 1,2 1 TEXMEX/IRISA 2 Cantho University, Vietnam Outline Image retrieval and motivation Examples Image topic discovery FCA and adaptation on images Image retrieval

Web Mining Web Mining Web Mining Web Mining Web mining is the use of data mining techniques

Web Mining Web Mining Web mining is the use of data mining techniques to automatically

Image Restoration Image Enhancement and Image Restoration both deal with improving images. Image

Interactive Proofs Lecture 18 AM 1 Interactive Proofs 2 Interactive Proofs IP[k] 2

Cement, Aggregates, Mining Presentation Cement, Aggregates and Mining Cement, Aggregates and

Frequent Pattern Mining Frequent Sequence Mining Frequent Tree Mining Christian Borgelt

Data Mining 2020 Frequent Pattern Mining (2) Ad Feelders Universiteit Utrecht October 2, 2020

Introduction What is data mining? to Data Mining: On what kind of data? Data Mining

Web MINING Web MINING Overview Overview Dr Ahmed Rafea Rafea Dr Ahmed 1 Web Mining Outline

Image Processing Todays Class Image Representations: Matrices Image Representations: RGB,

Topic 7: Topic 7: Image Morphing Image Morphing 1. 1. Intro to basic image morphing Intro to

Image Features Sanja Fidler CSC420: Intro to Image Understanding 1 / 64 Image Features Image

RGBD Tutorial 14210240041 Gu Pan Image RGB YUV Lab Depth Image RGB image Depth image Each pixel in

Image Features Sanja Fidler CSC420: Intro to Image Understanding 1 / 1 Image Features Image

Web Mining Andreas Andersson Gustav Strmberg Sandra Stendahl Introduction Web mining o

Week 5 Video 1 Relationship Mining Correlation Mining Relationship Mining Discover

3D Vision Viktor Larsson Spring 2019 Schedule Feb 18 Introduction Feb 25 Geometry, Camera

E9 205 Machine Learning for Signal Processing 23-8-17 Outline Basics for Image Processing

ImageProof: Enabling Authentication for Large-Scale Image Retrieval Shangwei Guo 1 Jianliang Xu 1

Computational Photography Si Lu Spring 2018 http://web.cecs.pdx.edu/~lusi/CS510/CS510_Computati

Learning Representations for Visual Object Class Recognition Marcin Marszaek Cordelia Schmid

Texture and materials Subhransu Maji CMPSCI 670: Computer Vision December 1, 2016 CMPSCI 670

image matching presented by Dmytro Mishkin joint work with Anastasia Mishchuk, Milan Pultar,

For Forensics Sake What to do when IR Strikes By : Joe Gumke Joe Gumke Twitter : @joegumke