Sifting through images with Multinomial Relevance Feedback Dorota G - PowerPoint PPT Presentation

Sifting through images with Multinomial Relevance Feedback Dorota G� lowacka, Alan Medlar and John Shawe-Taylor Univeristy College London December 8, 2010 Dorota G� lowacka, Alan Medlar and John Shawe-Taylor Sifting through images with Multinomial Relevance Feedback

Image Retrieval Problem ◮ Problem - content-based image retrieval when the user is unable to specify the required content through tags or other properties of the images. ◮ The system must extract information from the user through limited feedback. ◮ Solution - a protocol that operates through a sequence of rounds in each of which a set of images is displayed and the user must indicate which is closest to their target. ◮ A novel approach that makes use of the Dirichlet distribution as the conjugate prior to the multinomial distribution in order to model the system’s knowledge about the expected responses to the images. Dorota G� lowacka, Alan Medlar and John Shawe-Taylor Sifting through images with Multinomial Relevance Feedback

Comparative Feedback ◮ The search engine supports the user in finding an image matching her query. ◮ The search engine calculates a set of images x i , 1 , . . . , x i , k and presents them to the user. ◮ If one of the images matches the user’s query, then the search terminates. ◮ Otherwise the user chooses one of the images x ∗ i as most relevant according to a distribution D { x ∗ i = x i , j | x i , 1 , . . . , x i , k ; t } , where t denotes the ideal target image. Dorota G� lowacka, Alan Medlar and John Shawe-Taylor Sifting through images with Multinomial Relevance Feedback

Comparative Feedback ◮ Following Auer & Leung 2009, we assume the following probability measure of choosing image x i , j : S ( x i , j , t ) + α D { x ∗ i = x i , j | x i , 1 , . . . , x i , k ; t } = ( 1 − α ) � k k j = 1 S ( x i , j , t ) ◮ The similarity measure S ( · , · ) is given by S ( x , t ) = exp {− ad ( x , t ) 2 } Dorota G� lowacka, Alan Medlar and John Shawe-Taylor Sifting through images with Multinomial Relevance Feedback

The Dirichlet Sampling Search Algorithm ◮ Problem - model our kowledge about the user’s interests. ◮ The user feedback can be viewed as a multinomial distribution. ◮ A natural choice for the model is its conjugate prior. ◮ Algorithm is based on the Dirichlet Process: n Γ( α ) � θ α m i − 1 P (Θ | α, M ) = � n i i =1 Γ( α m i ) i =1 where M = { m 1 , m 2 , . . . , m n } is the base measure and is the mean value of Θ, and α is a precision parameter. Dorota G� lowacka, Alan Medlar and John Shawe-Taylor Sifting through images with Multinomial Relevance Feedback

The Dirichlet Sampling Search Algorithm ◮ The posterior has updated precision parameter α ∗ = α + 1 and the base measure α M + 1 n i X i M ∗ = α + 1 where X i is the partition of size n i . Dorota G� lowacka, Alan Medlar and John Shawe-Taylor Sifting through images with Multinomial Relevance Feedback

Image Selection ◮ Trade-off between exploration and exploitation. ◮ Draw k samples from the posterior distribution and select the image with the highest porobability. ◮ Images with higher weights m are more likely to be relevant and thus more likely to be presented to the user. Dorota G� lowacka, Alan Medlar and John Shawe-Taylor Sifting through images with Multinomial Relevance Feedback

Experimental setup ◮ The data - the VOC2007 dataset with 23 categories and 9963 images. ◮ Each image is annotated by a bounding box and the feature value for an object class is the size (as calculated from the bounding box) of the largest object from this class in the image. ◮ If no object from a particular class is present, then the feature value is 0. ◮ We set k = 2 , 5 , 10 so that only 2, 5, 10 images are presented to the user in each iteration. ◮ All the results are averaged over 1000 searches for randomly selected target images from the dataset. Dorota G� lowacka, Alan Medlar and John Shawe-Taylor Sifting through images with Multinomial Relevance Feedback

PicHunter ◮ PicHunter (Cox et al. 2000) uses Bayes’ rule to predict the user’s target image ◮ The system maintains a set of probabilities p 1: n for every image x 1: n in a dataset. Initially, all the probabilities p i = 1 n . ◮ After each iteration, the system estimates the probability that image x i is the user’s target image. ◮ The probabilities are updated as in p i = p i ∗ G ( d ( x i , s m )), where d ( x i , s m ) = � x i − s m � is the distance between x i and the image s m selected by the user in iteration m , and G is defined as: exp ( − d ( x i , s m )) /σ G = (1) � n j =1 exp ( − d ( x j , s m )) /σ Dorota G� lowacka, Alan Medlar and John Shawe-Taylor Sifting through images with Multinomial Relevance Feedback

Auer & Leung Algorithm ◮ Weighting scheme that demotes less relevant images by a discount factor β ◮ Initially, the weights of all images are set to 1. ◮ At each iteration, a set of images are presnted to the user who choses one of the images as most relevant. ◮ If the search has not terminated, all the images presented to the user have their weights set to 0. ◮ All the images far from the image selected by the user are demoted by the discount factor β . Dorota G� lowacka, Alan Medlar and John Shawe-Taylor Sifting through images with Multinomial Relevance Feedback

Comparison Table: Comparison of the performance the AL algorithm, the DS algorithm and PicHunter PH as the value of k increases. k = 2 k = 5 k = 10 Target Size AL DS PH AL DS PH AL DS PH 1 845 330 1188 431 123 448 228 71 216 5 448 99 917 92 46 356 43 24 264 10 219 60 733 51 28 172 22 16 136 Dorota G� lowacka, Alan Medlar and John Shawe-Taylor Sifting through images with Multinomial Relevance Feedback

Sparse data representation ◮ With large datsets, calculating the distances of all images from the k images presented to the user is expensive ◮ We produce a small dataset of images selected randomly from the large dataset ◮ At each iteration we replace images with the lowest probability with new images selected from the large dataset Dorota G� lowacka, Alan Medlar and John Shawe-Taylor Sifting through images with Multinomial Relevance Feedback

Sparse data representation 0.8 0.4 DS DS sparse DS sparse DS 0.7 0.35 0.6 0.3 Distance of closest image Average distance 0.5 0.25 0.4 0.2 0.3 0.15 0.2 0.1 0.1 0.05 0 5 10 15 20 25 30 0 5 10 15 20 25 30 Number of iterations Number of iterations Figure: (a) The average distance from the target of the images shown to the user in 30 iterations of the DS algorithm with and without sparse data representation; (b) The distance of the image closest to the target in each iteration. Dorota G� lowacka, Alan Medlar and John Shawe-Taylor Sifting through images with Multinomial Relevance Feedback

Real users experiments Dorota G� lowacka, Alan Medlar and John Shawe-Taylor Sifting through images with Multinomial Relevance Feedback

Real users experiments ◮ 10 users performed 4 searches using the DS algorithm, the AL algorithm as well as a random search (40 searches altogether) ◮ 6 images presented to the user at each iteration ◮ users instructed to terminate the search when they found the target image or after 50 iterations of the search algorithm ◮ average number of iterations to find the target image using the DS algorithm was 29 ◮ average number of iterations to find the target image using the AL algorithm and random search was 47 and 48, respectively Dorota G� lowacka, Alan Medlar and John Shawe-Taylor Sifting through images with Multinomial Relevance Feedback

Real users experiments 0.95 0.5 random random DS DS 0.9 AL AL 0.45 0.85 Distance of closest image 0.4 Average distance 0.8 0.75 0.35 0.7 0.3 0.65 0.25 0.6 0.55 0.2 1 2 3 4 5 6 7 8 9 10 1 2 3 4 5 6 7 8 9 10 Number of iterations Number of iterations Figure: (a) The average distance from the target of the images shown to the user in the first 10 iterations of the DS algorithm, the AL algorithm and random search; (b) The distance of the image closest to the target in each iteration. Dorota G� lowacka, Alan Medlar and John Shawe-Taylor Sifting through images with Multinomial Relevance Feedback

Conlusions ◮ A new approach to content-based image retrieval based on multinomial relevance feedback. ◮ The model suggests an algorithm for generating images for presentation that trades exploration and exploitation. ◮ The model allows to make predictions about the scaling of the algorithm and convergence properties. ◮ The experiments confirm that the new approach outperforms earlier work using a more heuristic strategy. Dorota G� lowacka, Alan Medlar and John Shawe-Taylor Sifting through images with Multinomial Relevance Feedback

Sifting through images with Multinomial Relevance Feedback Dorota G - PowerPoint PPT Presentation

Sifting through images with Multinomial Relevance Feedback Dorota G lowacka, Alan Medlar and John Shawe-Taylor Univeristy College London December 8, 2010 Dorota G lowacka, Alan Medlar and John Shawe-Taylor Sifting through images with

Relevance Feedback Relevance Feedback Relevance Feedback Prof. Paolo Ciaccia Prof. Paolo

Topic of this talk Topic of this talk From E- -Relevance Relevance From E to W- -Relevance

Relevance Vector Machines Jukka Lankinen LUT February 21, 2011 Jukka Lankinen Relevance Vector

Fast Algorithm for Generalized Multinomial Models with Ranking Data Jiaqi Gu (Joint work with

Chapter 11 Categorical Data Analysis Categorical Data and the Multinomial Distribution

Sifting Through Lines, Events, and Trees: Stories of EHR Visual Exploratory Analysis

CS4495/6495 Introduction to Computer Vision 2A-L1 Images as functions Images as functions Images

Health effects of radiofrequency radiation: sifting and weighing the science Dr. Meg Sears PhD

Sifting the Primes Gihan Marasingha University of Oxford 18 March 2005 Irreducible forms: q 1 (

ISASI 2009 Seminar Presentation Wednesday Sept 16, 2009 Paper: SIFTING ASHES FROM THE WRECKAGE:

New Sifting Iterations (bringing the combinatorics back) Zarathustra Brady Sieve theoretic

Enhancing Sketch-Based Image Retrieval by Re-Ranking and Relevance Feedback Heechan Shin CS688

NPFL103: Information Retrieval (6) Result summaries, Relevance Feedback, Qvery Expansion Pavel

Google example query: heat in query doesnt match with thermodynamics in hospital

Relevance Feedback & Other Query Expansion Techniques (Thesaurus, Semantic Network) (COSC

Pseudo-Relevance Feedback CS6200: Information Retrieval Slides by: Jesse Anderton

Probabilistic Classifiers -- Generative Naive Bayes Announcements Math for Visual

Applications November 20, 2008 CS 486/686 University of Waterloo Outline Alchemy

Nave Bayes CMSC 470 Marine Carpuat Slides credit: Dan Jurafsky & James Martin, Jacob

Multimodal Abstractive Summarization for How2 Videos ACL19 Shru* Palaskar Jindrich

Visualization DOs & DONTs Duen Horng (Polo) Chau Georgia Tech Partly based on materials

Extensible Interactive Graphics data models (static) graphics but is less strong in

Shi Shiny ny app app wi with h THE HE go golem em and d ht html ml wi widget dgets

Probability Chapter 4 Section 2: Fundamentals Section 3: Addition Rule Section 4:

Sifting through images with Multinomial Relevance Feedback Dorota G - PowerPoint PPT Presentation

Sifting through images with Multinomial Relevance Feedback Dorota G lowacka, Alan Medlar and John Shawe-Taylor Univeristy College London December 8, 2010 Dorota G lowacka, Alan Medlar and John Shawe-Taylor Sifting through images with

Relevance Feedback Relevance Feedback Relevance Feedback Prof. Paolo Ciaccia Prof. Paolo

Topic of this talk Topic of this talk From E- -Relevance Relevance From E to W- -Relevance

Relevance Vector Machines Jukka Lankinen LUT February 21, 2011 Jukka Lankinen Relevance Vector

Fast Algorithm for Generalized Multinomial Models with Ranking Data Jiaqi Gu (Joint work with

Chapter 11 Categorical Data Analysis Categorical Data and the Multinomial Distribution

Sifting Through Lines, Events, and Trees: Stories of EHR Visual Exploratory Analysis

CS4495/6495 Introduction to Computer Vision 2A-L1 Images as functions Images as functions Images

Health effects of radiofrequency radiation: sifting and weighing the science Dr. Meg Sears PhD

Sifting the Primes Gihan Marasingha University of Oxford 18 March 2005 Irreducible forms: q 1 (

ISASI 2009 Seminar Presentation Wednesday Sept 16, 2009 Paper: SIFTING ASHES FROM THE WRECKAGE:

New Sifting Iterations (bringing the combinatorics back) Zarathustra Brady Sieve theoretic

Enhancing Sketch-Based Image Retrieval by Re-Ranking and Relevance Feedback Heechan Shin CS688

NPFL103: Information Retrieval (6) Result summaries, Relevance Feedback, Qvery Expansion Pavel

Google example query: heat in query doesnt match with thermodynamics in hospital

Relevance Feedback &amp; Other Query Expansion Techniques (Thesaurus, Semantic Network) (COSC

Pseudo-Relevance Feedback CS6200: Information Retrieval Slides by: Jesse Anderton

Probabilistic Classifiers -- Generative Naive Bayes Announcements Math for Visual

Applications November 20, 2008 CS 486/686 University of Waterloo Outline Alchemy

Nave Bayes CMSC 470 Marine Carpuat Slides credit: Dan Jurafsky &amp; James Martin, Jacob

Multimodal Abstractive Summarization for How2 Videos ACL19 Shru* Palaskar Jindrich

Visualization DOs &amp; DONTs Duen Horng (Polo) Chau Georgia Tech Partly based on materials

Extensible Interactive Graphics data models (static) graphics but is less strong in

Shi Shiny ny app app wi with h THE HE go golem em and d ht html ml wi widget dgets

Probability Chapter 4 Section 2: Fundamentals Section 3: Addition Rule Section 4:

Relevance Feedback & Other Query Expansion Techniques (Thesaurus, Semantic Network) (COSC

Nave Bayes CMSC 470 Marine Carpuat Slides credit: Dan Jurafsky & James Martin, Jacob

Visualization DOs & DONTs Duen Horng (Polo) Chau Georgia Tech Partly based on materials