Photo Annotation and Concept-Based Retrieval Tasks Eleftherios - PowerPoint PPT Presentation

Photo Annotation Concept-based Retrieval Results Conclusions MLKD's Participation at the ImageCLEF 2011 Photo Annotation and Concept-Based Retrieval Tasks Eleftherios Spyromitros-Xioufis, Konstantinos Sechidis, Grigorios Tsoumakas and Ioannis Vlahavas Machine Learning and Knowledge Discovery Group, Department of Informatics, Aristotle University of Thessaloniki, Greece 1 CLEF 2011, 19-22 September 2011, Amsterdam Eleftherios Spyromitros – Xioufis | espyromi@csd.auth.gr

Setup Photo Annotation Visual Concept-based Retrieval Textual Results Multi-modal Conclusions Thresholding Photo annotation task Sunset Plants Trees Aesthetic Day Sky Partly blurred Calm Outdoor Cute • A multi-label classification problem (each image belongs to many concepts) • Evaluation measures: 1. Mean interpolated average precision ( MIAP ) 2. Example-based F-measure ( F-ex ) 3. Semantic R-precision ( SR-Precision ) • Model selection: based on Mean Average Precision (MAP) • MAP estimation: 3 fold cross-validation on the 8000 training images • 5 submissions in total: • Visual • Textual • 2 Multi-modal (3 variations) CLEF 2011, 19-22 September 2011, Amsterdam Eleftherios Spyromitros – Xioufis | espyromi@csd.auth.gr

Setup Photo Annotation Visual Concept-based Retrieval Textual Results Multi-modal Conclusions Thresholding Visual model – feature extraction • The ColorDescriptor [van de Sande et al., 2010] software was used for visual feature extraction • 2 point detection strategies: Harris-Laplace, Dense Sampling • 7 descriptors: SIFT, HSV-SIFT, HueSIFT, OpponentSIFT, C-SIFT, rgSIFT and RGB-SIFT • Codebook generation • K-means (other?) clustering on 250,000 randomly sampled points (more points?) • Codebook size (k) fixed to 4096 words (more words?) • Hard assignment of points to clusters • 14 multi-label training datasets in total • #features: 4096 • #labels: 99 3 CLEF 2011, 19-22 September 2011, Amsterdam Eleftherios Spyromitros – Xioufis | espyromi@csd.auth.gr

Setup Photo Annotation Visual Concept-based Retrieval Textual Results Multi-modal Conclusions Thresholding Visual model – learning method • The Binary Relevance (problem transformation) method was used: • Transforms the multi-label classification task into multiple binary classification tasks • Any single-label classifier can be used (Random Forest #trees:150 #features:40 ) • Instance weighting to deal with class imbalance: 𝑛𝑗𝑜:𝑛𝑏𝑘 𝑛𝑗𝑜:𝑛𝑏𝑘 𝑥 𝑛𝑗𝑜 = 𝑥 𝑛𝑏𝑘 = 𝑛𝑗𝑜 𝑛𝑏𝑘 Training set for 𝝁 𝟐 𝒈 𝟐 𝒈 𝟑 𝒈 𝟓𝟏𝟘𝟕 𝝁 𝟐 𝝁 𝟑 𝝁 𝟘𝟘 … … 𝒚 𝟐 1 0 … 1 0 1 … 1 𝒚 𝟑 0 1 … 0 1 0 … 0 … … … … … … … … … 𝒚 𝟗𝑳 0 0 … 1 0 0 … 1 Feature Space Target 4 CLEF 2011, 19-22 September 2011, Amsterdam Eleftherios Spyromitros – Xioufis | espyromi@csd.auth.gr

Setup Photo Annotation Visual Concept-based Retrieval Textual Results Multi-modal Conclusions Thresholding Visual model – learning method • The Binary Relevance (problem transformation) method was used: • Transforms the multi-label classification task into multiple binary classification tasks • Any single-label classifier can be used (Random Forest #trees:150 #features:40 ) • Instance weighting to deal with class imbalance: 𝑛𝑗𝑜:𝑛𝑏𝑘 𝑛𝑗𝑜:𝑛𝑏𝑘 𝑥 𝑛𝑗𝑜 = 𝑥 𝑛𝑏𝑘 = 𝑛𝑗𝑜 𝑛𝑏𝑘 Training set for 𝝁 𝟑 𝒈 𝟐 𝒈 𝟑 𝒈 𝟓𝟏𝟘𝟕 𝝁 𝟐 𝝁 𝟑 𝝁 𝟘𝟘 … … 𝒚 𝟐 1 0 … 1 0 1 … 1 𝒚 𝟑 0 1 … 0 1 0 … 0 … … … … … … … … … 𝒚 𝟗𝑳 0 0 … 1 0 0 … 1 Target Feature Space 5 CLEF 2011, 19-22 September 2011, Amsterdam Eleftherios Spyromitros – Xioufis | espyromi@csd.auth.gr

Setup Photo Annotation Visual Concept-based Retrieval Textual Results Multi-modal Conclusions Thresholding Visual model – learning method • The Binary Relevance (problem transformation) method was used: • Transforms the multi-label classification task into multiple binary classification tasks • Any single-label classifier can be used (Random Forest #trees:150 #features:40 ) • Instance weighting to deal with class imbalance: 𝑛𝑗𝑜:𝑛𝑏𝑘 𝑛𝑗𝑜:𝑛𝑏𝑘 𝑥 𝑛𝑗𝑜 = 𝑥 𝑛𝑏𝑘 = 𝑛𝑗𝑜 𝑛𝑏𝑘 Training set for 𝝁 𝟘𝟘 𝒈 𝟐 𝒈 𝟑 𝒈 𝟓𝟏𝟘𝟕 𝝁 𝟐 𝝁 𝟑 𝝁 𝟘𝟘 … … 𝒚 𝟐 1 0 … 1 0 1 … 1 𝒚 𝟑 0 1 … 0 1 0 … 0 … … … … … … … … … 𝒚 𝟗𝑳 0 0 … 1 0 0 … 1 Target Feature Space 6 CLEF 2011, 19-22 September 2011, Amsterdam Eleftherios Spyromitros – Xioufis | espyromi@csd.auth.gr

Setup Photo Annotation Visual Concept-based Retrieval Textual Results Multi-modal Conclusions Thresholding Textual model – feature extraction • Flickr user tags were used • Initial vocabulary: the union of tag sets of the training images • Stemming : porter stemmer (English..) & stop word removal -> 27000 stems 2 • Feature selection using 𝜓 𝑛𝑏𝑦 criterion [Lewis et al., 2004] : 𝜓 2 statistic for each feature with respect to each label is calculated • Features are ranked according to their maximum 𝜓 2 score across all labels • • After evaluation of different sizes top 4000 features were selected 7 CLEF 2011, 19-22 September 2011, Amsterdam Eleftherios Spyromitros – Xioufis | espyromi@csd.auth.gr

Setup Photo Annotation Visual Concept-based Retrieval Textual Results Multi-modal Conclusions Thresholding Textual model – learning method • Ensemble of Classifier Chains (ECC) [Read et al., 2009] : • Random chains are created • Feature set for each label in the chains is augmented with the previous labels • Able to capture correlations, class imbalance is still a problem Chain order: 1,2,..,99 Training set for 𝝁 𝟐 𝒈 𝟐 𝒈 𝟑 𝒈 𝟓𝟏𝟏𝟏 𝝁 𝟐 𝝁 𝟑 𝝁 𝟘𝟘 … … 𝒚 𝟐 1 0 … 1 0 1 … 1 𝒚 𝟑 0 1 … 0 1 0 … 0 … … … … … … … … … 𝒚 𝟗𝑳 0 0 … 1 0 0 … 1 Feature Space Target 8 CLEF 2011, 19-22 September 2011, Amsterdam Eleftherios Spyromitros – Xioufis | espyromi@csd.auth.gr

Setup Photo Annotation Visual Concept-based Retrieval Textual Results Multi-modal Conclusions Thresholding Textual model – learning method • Ensemble of Classifier Chains (ECC) [Read et al., 2009] : • Random chains are created • Feature set for each label in the chains is augmented with the previous labels • Able to capture correlations, class imbalance is still a problem Chain order: 1,2,..,99 Training set for 𝝁 𝟑 𝒈 𝟐 𝒈 𝟑 𝒈 𝟓𝟏𝟏𝟏 𝝁 𝟐 𝝁 𝟑 𝝁 𝟘𝟘 … … 𝒚 𝟐 1 0 … 1 0 1 … 1 𝒚 𝟑 0 1 … 0 1 0 … 0 … … … … … … … … … 𝒚 𝟗𝑳 0 0 … 1 0 0 … 1 Feature Space Target 9 CLEF 2011, 19-22 September 2011, Amsterdam Eleftherios Spyromitros – Xioufis | espyromi@csd.auth.gr

Setup Photo Annotation Visual Concept-based Retrieval Textual Results Multi-modal Conclusions Thresholding Textual model – learning method • ECC is also a problem transformation method: • Again coupled with Random Forest as base classifier (#trees:10, #features:default) • Ensemble size: 15 (150 random trees in total for each label) • Again instance weighting for class imbalance Chain order: 1,2,..,99 Training set for 𝝁 𝟘𝟘 𝒈 𝟐 𝒈 𝟑 𝒈 𝟓𝟏𝟏𝟏 𝝁 𝟐 𝝁 𝟑 𝝁 𝟘𝟘 … … 𝒚 𝟐 1 0 … 1 0 1 … 1 𝒚 𝟑 0 1 … 0 1 0 … 0 … … … … … … … … … 𝒚 𝟗𝑳 0 0 … 1 0 0 … 1 Feature Space Target 10 CLEF 2011, 19-22 September 2011, Amsterdam Eleftherios Spyromitros – Xioufis | espyromi@csd.auth.gr

Setup Photo Annotation Visual Concept-based Retrieval Textual Results Multi-modal Conclusions Thresholding Multi-modal Harris-Laplace model 𝑞 ℎ𝑚 𝑑 𝑘 𝑦 𝑗 ∀𝑘 7 descriptor average 𝑦 𝑗 𝑞 𝑒𝑡 𝑑 𝑘 𝑦 𝑗 ∀𝑘 Dense-sampling model Averaging/ 𝑞 𝑑 𝑘 𝑦 𝑗 ∀𝑘 7 descriptor average Arbitrator 𝑞 𝑔𝑚𝑗𝑑𝑙𝑠 𝑑 𝑘 𝑦 𝑗 ∀𝑘 Textual model • A hierarchical late fusion scheme: • 3 different views of the images: • Harris Laplace -> concepts related to objects (Fish and Ship) • Dense sampling -> concepts related to scenes (Night and Macro) • Textual - > concepts which are typically tagged by users (Dog , Insect, …) • 2 ways to combine the 3 different views: • Averaging • Arbitrator (the best view based on internal evaluation) 11 CLEF 2011, 19-22 September 2011, Amsterdam Eleftherios Spyromitros – Xioufis | espyromi@csd.auth.gr

Photo Annotation and Concept-Based Retrieval Tasks Eleftherios - PowerPoint PPT Presentation

Photo Annotation Concept-based Retrieval Results Conclusions MLKD's Participation at the ImageCLEF 2011 Photo Annotation and Concept-Based Retrieval Tasks Eleftherios Spyromitros-Xioufis, Konstantinos Sechidis, Grigorios Tsoumakas and Ioannis

Image organization, annotation, Image organization, annotation, and retrieval from a human- -

Text REtrieval Conference (TREC) Question Answering Tasks and Evaluation Methods Hoa Trang Dang

Annotation for Arabic Information Retrieval Ashraf I. Kaloub Rebhi S. Baraka Alaqsa-Community

Retrieval Max Gubin mail@maxgubin.com Information Retrieval History 4000 1950 2000 BC

We Dont Need No Annotation (Efficient Training for Image Retrieval) Ondra Chum Visual

Voice Based Information Retrieval System How far is it from text based retrieval system? PRAJNA

Content-Based Image Retrieval Queries Commercial Systems Retrieval Features

CONCEPT DRIFT IN ONTOLOGY MAPPING AND SEMANTIC ANNOTATION ADAPTATION Cdric PRUSKI

Annotation of Video and Film Matthias Zeppelzauer St. Plten University of Applied Sciences

Information Retrieval Based on Extraction of Domain Specific Information Retrieval Based on

Deep Learning Based Semantic Video Indexing and Retrieval Anna Podlesnaya, Sergey Podlesnyy

XML Retrieval XML Retrieval XML Retrieval XML Retrieval DB/IR in DB/IR in Theory Theory Web

xDOC: A System for XML Based Document Annotation and Searching Michael K. Baldwin Department of

What are the Big Ideas of an IB Education? IB is Concept Based A concept is a big

Web Annotations Building the Experience Annotation An annotation is something added. It is not

Lecture 2 Annotation tools & Segmentation Summary of Part 1 Annotation theory

Teamware: A Collaborative, Teamware: A Collaborative, Web-based Annotation Environment Web-based

Content-Based Retrieval (CBR) - In Multimedia Systems, a mini-handbook Author: Chao Cai ID:

Content Based Image Retrieval Techniques Ambrose Tuscano (atuscan1@umbc.edu) University of

A Study of KIND Metaphor and Simile Annotation based on Parsing and ConceptNet Meng-Hsien Shih

Passage Based Retrieval (COSC 488) Nazli Goharian nazli@cs.georgetown.edu 1 Passage Based

Concept Mix : Self-Service Analytical Data Integration Based on the Concept-Oriented Model

LDA-Based Document Models for Ad-hoc Retrieval (Xing Wei and W. Bruce Croft) Haiyun Jin, Lei Ju,

Models for Retrieval Models for Retrieval 1. HMM/N-gram-based 2. Latent Semantic Indexing (LSI)

Photo Annotation and Concept-Based Retrieval Tasks Eleftherios - PowerPoint PPT Presentation

Photo Annotation Concept-based Retrieval Results Conclusions MLKD's Participation at the ImageCLEF 2011 Photo Annotation and Concept-Based Retrieval Tasks Eleftherios Spyromitros-Xioufis, Konstantinos Sechidis, Grigorios Tsoumakas and Ioannis

Image organization, annotation, Image organization, annotation, and retrieval from a human- -

Text REtrieval Conference (TREC) Question Answering Tasks and Evaluation Methods Hoa Trang Dang

Annotation for Arabic Information Retrieval Ashraf I. Kaloub Rebhi S. Baraka Alaqsa-Community

Retrieval Max Gubin mail@maxgubin.com Information Retrieval History 4000 1950 2000 BC

We Dont Need No Annotation (Efficient Training for Image Retrieval) Ondra Chum Visual

Voice Based Information Retrieval System How far is it from text based retrieval system? PRAJNA

Content-Based Image Retrieval Queries Commercial Systems Retrieval Features

CONCEPT DRIFT IN ONTOLOGY MAPPING AND SEMANTIC ANNOTATION ADAPTATION Cdric PRUSKI

Annotation of Video and Film Matthias Zeppelzauer St. Plten University of Applied Sciences

Information Retrieval Based on Extraction of Domain Specific Information Retrieval Based on

Deep Learning Based Semantic Video Indexing and Retrieval Anna Podlesnaya, Sergey Podlesnyy

XML Retrieval XML Retrieval XML Retrieval XML Retrieval DB/IR in DB/IR in Theory Theory Web

xDOC: A System for XML Based Document Annotation and Searching Michael K. Baldwin Department of

What are the Big Ideas of an IB Education? IB is Concept Based A concept is a big

Web Annotations Building the Experience Annotation An annotation is something added. It is not

Lecture 2 Annotation tools &amp; Segmentation Summary of Part 1 Annotation theory

Teamware: A Collaborative, Teamware: A Collaborative, Web-based Annotation Environment Web-based

Content-Based Retrieval (CBR) - In Multimedia Systems, a mini-handbook Author: Chao Cai ID:

Content Based Image Retrieval Techniques Ambrose Tuscano (atuscan1@umbc.edu) University of

A Study of KIND Metaphor and Simile Annotation based on Parsing and ConceptNet Meng-Hsien Shih

Passage Based Retrieval (COSC 488) Nazli Goharian nazli@cs.georgetown.edu 1 Passage Based

Concept Mix : Self-Service Analytical Data Integration Based on the Concept-Oriented Model

LDA-Based Document Models for Ad-hoc Retrieval (Xing Wei and W. Bruce Croft) Haiyun Jin, Lei Ju,

Models for Retrieval Models for Retrieval 1. HMM/N-gram-based 2. Latent Semantic Indexing (LSI)

Lecture 2 Annotation tools & Segmentation Summary of Part 1 Annotation theory