Classification and Machine Learning techniques for CBIR: - PowerPoint PPT Presentation

Classification and Machine Learning techniques for CBIR: introduction to the RETIN system Matthieu Cord ETIS CNRS UMR 8051 Classification and Machine Learning techniques for CBIR: introduction to the RETIN system – p.1/81

Content-Based Image Retrieval • Retrieve large categories of pictures in generalist image database • Vector-based description of images • User interaction • Statistical learning approach → Multimodality (category retrieval) → Efficient strategies in text retrieval → Interactive strategies (active learning) Classification and Machine Learning techniques for CBIR: introduction to the RETIN system – p.2/81

Outline 1. Binary Classification for CBIR 2. Active learning: (a) Error Reduction and Uncertainly-Based strategies (b) RETIN scheme: Boundary Correction and diversity 3. Semi-supervised classification 4. Long Term Learning Classification and Machine Learning techniques for CBIR: introduction to the RETIN system – p.3/81

Supervised Classification for CBIR Classification and Machine Learning techniques for CBIR: introduction to the RETIN system – p.4/81

Introduction • Vector-based description of images; Classification and Machine Learning techniques for CBIR: introduction to the RETIN system – p.5/81

Introduction • binary classification Classification and Machine Learning techniques for CBIR: introduction to the RETIN system – p.6/81

Supervised Classification Three representative methods for CBIR: • Bayes Classifiers (Vasconcelos) • k-Nearest Neighbors • Support Vector Machines (Chapelle) Specific characteristics [Chang ICIP’03]: (c1) High dimension and non-linearity of input space (c2) Few training data (c3) Many unlabelled data (c4) Interactive learning (Relevance feedback) (c5) Unbalanced training data Classification and Machine Learning techniques for CBIR: introduction to the RETIN system – p.7/81

Support Vector Machines (1/4) Classification by an hyperplan: Revelant labelled picture Irrelevant labelled picture Relevant unlabelled picture Irrelevant unlabelled picture Classification and Machine Learning techniques for CBIR: introduction to the RETIN system – p.8/81

Support Vector Machines (2/4) Choose the hyperplan which maximizes the margin: Revelant labelled picture Irrelevant labelled picture Relevant unlabelled picture Irrelevant unlabelled picture Classification and Machine Learning techniques for CBIR: introduction to the RETIN system – p.9/81

Support Vector Machines (3/4) Quadratic problem: n n α i − 1 � � α ⋆ = argmax α i α j y i y j < x i , x j > 2 α i =1 i,j =1  n �  α i y i = 0  with i =1   ∀ i ∈ [1 , n ] 0 ≤ α i ≤ C Decision function: n � y i α ⋆ f ( x ) = i < x , x i > + b i =1 Classification and Machine Learning techniques for CBIR: introduction to the RETIN system – p.10/81

Support Vector Machines (3/4) Support Vectors: Revelant labelled picture Irrelevant labelled picture Relevant unlabelled picture Irrelevant unlabelled picture Classification and Machine Learning techniques for CBIR: introduction to the RETIN system – p.11/81

"Kernelization" Kernelization of SVM: • SVM decision function: N � y i α ⋆ f ( x ) = i < x , x i > + b (1) i =1 • "Kernelized" version: N � y i α ⋆ f ( x ) = i k ( x , x i ) + b (2) i =1 Classification and Machine Learning techniques for CBIR: introduction to the RETIN system – p.12/81

Kernels Dealing with the class of kernels k corresponding to dot product in an induced space H via a map Φ : R p → H Φ : x �→ Φ( x ) that is k ( x , x ′ ) = < Φ( x ) , Φ( x ′ ) > Initial: X Induit: Φ( X ) Classification and Machine Learning techniques for CBIR: introduction to the RETIN system – p.13/81

Kernels • Usual kernels: Lin., Polynomial, Sigmoid, RBF ... • Choice of a kernel depends on the database and its usage: → Different levels of performances for two different kernels; • In our experiments: Gaussian kernels give the best results → The most adapted to CBIR; → In the following experiments: Gaussian kernels with χ 2 distance, because feature vector are distributions. Classification and Machine Learning techniques for CBIR: introduction to the RETIN system – p.14/81

Spectral analysis of kernel matrices 1.2 1 0.8 Energie 0.6 Performances 0.4 Gaussien Chi2 Triangulaire 0.2 Polynomiale 3 Gaussien L2 Linéaire 0 0 200 400 600 800 1000 Valeur propre Large distribution ⇒ high performances; Classification and Machine Learning techniques for CBIR: introduction to the RETIN system – p.15/81

SVM and Kernels Deal with (c1) high dimension and non-linear input space: → Use of a kernel function to induce a feature space → Relevance function f using Kernel in SVM: N � y i α ⋆ f ( x ) = i k ( x , x i ) + b i =1 When a method cannot be directly "kernelized": KPCA. Classification and Machine Learning techniques for CBIR: introduction to the RETIN system – p.16/81

Experiments Protocol: • COREL Photo database (6,000 images); • 50 categories, 100-300 size; • Training set of 200 points (unbalanced). • Statistical measure: Mean Average Precision MAP Methods MAP(%) Time No learning 8 - Bayes/Parzen 18 0.09s k-NN 16 0.20s SVM 20 0.13s • SVM selected [Gosselin CVDB04] Classification and Machine Learning techniques for CBIR: introduction to the RETIN system – p.17/81

Experiments Training with 10 examples => poor top-similarity ranking results → User interaction (c4) to enhance the retrieval 2 components: the parameter tuning of f and the optimization of the set of examples Classification and Machine Learning techniques for CBIR: introduction to the RETIN system – p.18/81

Active learning for CBIR Classification and Machine Learning techniques for CBIR: introduction to the RETIN system – p.19/81

Active Learning Deal with the few training data (c2) and interactive learning (c4) characteristics → optimize training data to get the best classification with as few as possible user labeling Strategies of selective sampling : • Relevance-Based (RB): → Select the most relevant image • Uncertainly-Based (UB) • Error Reduction (ER) → Priority to the classification error minimization Classification and Machine Learning techniques for CBIR: introduction to the RETIN system – p.20/81

Labelling the most relevant (RB) Classification and Machine Learning techniques for CBIR: introduction to the RETIN system – p.21/81

Labelling the most relevant (RB) Classification and Machine Learning techniques for CBIR: introduction to the RETIN system – p.22/81

Labelling the most relevant (RB) Revelant labelled picture Irrelevant labelled picture Relevant unlabelled picture Irrelevant unlabelled picture Classification and Machine Learning techniques for CBIR: introduction to the RETIN system – p.23/81

Labelling the most relevant (RB) Revelant labelled picture Irrelevant labelled picture Relevant unlabelled picture Irrelevant unlabelled picture Classification and Machine Learning techniques for CBIR: introduction to the RETIN system – p.24/81

Labelling the most difficult to classify (UB) Revelant labelled picture Irrelevant labelled picture Relevant unlabelled picture Irrelevant unlabelled picture Classification and Machine Learning techniques for CBIR: introduction to the RETIN system – p.25/81

Labelling the most difficult to classify (UB) Revelant labelled picture Irrelevant labelled picture Relevant unlabelled picture Irrelevant unlabelled picture Classification and Machine Learning techniques for CBIR: introduction to the RETIN system – p.26/81

Active Learning The aim of an active learner is to select the most interesting picture x ⋆ → We propose to express the following methods as the minimization of a cost function g ( x ) : x ⋆ = argmin g ( x ) x For Relevance-Based active learning: g ( x ) = − f ( x ) where f ( x ) is the relevance function Classification and Machine Learning techniques for CBIR: introduction to the RETIN system – p.27/81

Uncertainly-based (UB) Active learner: x ⋆ = argmin g ( x ) x • UB strategy selects the picture which is the most difficult to classify: g ( x ) = | f ( x ) | • Method: • SVM active (Tong): → Works in the version space → Needs an accurate estimation of the boundary Classification and Machine Learning techniques for CBIR: introduction to the RETIN system – p.28/81

Error Reduction (ER) • ER strategy (Roy and McCallum): select the picture which will minimize the new expected test error: � P A +( x ,c ) ˆ g ( x ) = P A ( c | x ) E ˆ c ∈{− 1 , 1 } with: • ˆ P A ( c | x ) the estimation of the probability of class c given x , with the training set A • E ˆ P A +( x ,c ) the estimation of the expectation of the test error, with training set A + ( x , c ) • Require an accurate estimation of ˆ P A ( c | x ) Classification and Machine Learning techniques for CBIR: introduction to the RETIN system – p.29/81

Classification and Machine Learning techniques for CBIR: - PowerPoint PPT Presentation

Classification and Machine Learning techniques for CBIR: introduction to the RETIN system Matthieu Cord ETIS CNRS UMR 8051 Classification and Machine Learning techniques for CBIR: introduction to the RETIN system p.1/81 Content-Based Image

Machine Learning Algorithms for Classification Machine Learning Algorithms for Classification

Video Retrieval - In reality, Video have many additional factors, but of course, based on

MSc Course MACHINE LEARNING TECHNIQUES AND APPLICATIONS Classification with GMM + Bayes 1

Scaling up semantic indexing Mats Sjberg Satoru Ishikawa, Markus Koskela, Jorma Laaksonen,

Introduction to Machine Learning Introduction to Machine Learning Introduction to Machine

Quantum Machine Learning Adam Brown, HEP-AI Quantum Computing Machine Learning Quantum

MICROSOFT AZURE MACHINE LEARNING Oscar Naim Microsoft Microsoft Azure Machine Learning What is

MACHINE LEARNING Overview 1 1 APPLIED MACHINE LEARNING 2011-2012 APPLIED MACHINE LEARNING

MACHINE LEARNING kernels 1 MACHINE LEARNING 2012 MACHINE LEARNING Kernels: Intuition How

Introduction to Machine Learning COMPSCI 371D Machine Learning COMPSCI 371D Machine

Machine Learning Classification over Encrypted Data Raphael Bost, Raluca Ada Popa, Stephen Tu,

A Machine Learning Approach A Machine Learning Approach A Machine Learning Approach A Machine

APPLIED MACHINE LEARNING Methods for Clustering K-means, Soft K-means DBSCAN 1 MACHINE

Graph Classification Classification Outline Introduction, Overview Classification using

Classification of Symmetry Classification of Symmetry Classification of Symmetry Classification

Classifiers: Support Vector Machine 1 MACHINE LEARNING What is Classification? Female Adult

Reinforcing patient relevance in evidence generation Feedback from breakout session 5C EMAs

QUERY AND DOCUMENT EXPANSION IN TEXT RETRIEVAL Clara Isabel Cabezas University of Maryland

Techniques to improve Dictionary Based CLIR Sai Madhurya Peyyeti KX48810 Different Techniques

Evaluation of Rich and Explicit Feedback for Exploratory Search Esben Srig 1 , Nicolas Collignon

Supporting Survivors of Relationship Violence with Serious Mental Illness Presenter: Annie

TEC Roadshow 2016 Welcome Agenda What well cover today: Welcome TECs current

Undergraduate Research Investment of time Interactions with Performance and effort over an

Alpha Presentation Twitch Comment Rankings and Smart Ads The Capstone Experience Team Amazon