Selecting Effective Expansion Terms for Diversity S. Vargas, R.L.T. - PowerPoint PPT Presentation

Selecting Effective Expansion Terms for Diversity S. Vargas, R.L.T. Santos, C. Macdonald and I. Ounis

Query Ambiguity and Underspecification 1

Query Expansion q = “spiders” Relevance Feedback Pseudo Relevance Feedback q* = “types of spiders in Europe” 2

Search Result Diversification top 5 top 5 not diverse diverse query Search Engine Re-Ranking MMR IA-Select xQuAD 3

Query Expansion and Search Result Diversification? ● Does Query Expansion help retrieving diverse search results? ● If not, can it be adapted to do so? ● Query Expansion can fail for difficult queries. ● Ambiguous queries are difficult! ● In this particular scenario, we identify two problems: – Incoherence – Bias 4

Incoherence ● Ambiguous queries result in incoherent feedback sets. ● Query Expansion techniques tend to select terms that are meaningful to the feedback set as a whole. ● This may end up selecting excessively general terms for the expanded query. 5

Bias (I) ● The feedback set may be biased towards documents covering a single, dominant subtopic. ● Terms important to marginal subtopics may never be selected. ● The retrieval performance may be improved, but the subtopic coverage may be degraded. 6

Bias (II) query 79 in TREC 2010 Web Track = “voyager” s 3 s 2 70 relevant 9 relevant documents in documents in ClueWeb09b ClueWeb09b 7

Example (III) ● If we use the relevant documents in ClueWeb09 to expand the query: q* all = “voyager spacecraft saturn jupiter solar interstellar” q* 2 = “voyager spacecraft saturn jupiter solar interstellar” q* 3 = “voyager trek maqui borg janeway star uss quadrant” ● Result: nrel@20(s 2 ) nrel@20(s 3 ) q 2 0 q* all , q* 2 17 0 q* 3 1 7 8

Selection of Expansion Terms (I) ● We propose to identify and select “good” terms. ● The procedure is the following: 1.Identify groups of documents covering the same subtopic. 2.Generate a local expanded query for each feedback group. 3.Select terms from those local expanded queries so that subtopic coverage is maximized with minimum redundancy. 9

Selection of Expansion Terms (II) ● We adapt the xQuAD algorithm (document selection) to the term selection problem. ● We call it ts xQuAD 10

Selection of Expansion Terms (III) ● Going back to the “voyager” example: q* xQuAD = “voyager trek spacecraft maqui saturn nasa” ● The expanded query contains terms from both subtopics. ● The subtopic coverage is improved: nrel@20(s 2 ) nrel@20(s 3 ) q* xQuAD 6 4 11

Research Questions ● RQ1 : What is the effect of state-of-art query expansion from pseudo-relevance feedback in terms of diversity metrics? ● RQ2: How does ts xQuAD perform in terms of ad-hoc retrieval and diversity compared to existing query expansion approaches? 12

Experimental Setup ● Context: diversity task of the TREC 2009, 2010 and 2011 Web Tracks. – Corpus: ClueWeb09 Category B. – 150 queries with 3 to 8 subtopics. ● Terrier for indexing and retrieval: – Retrieval models: BM25, DPH, TF-IDF, PL2. – Query Expansion techniques: Bo1, Bo2 and KL. – Ad-hoc metrics: MAP, nDCG. – Diversity metrics: α -nDCG, ERR-IA, S-recall. 13

RQ1: Experiment What is the effect of state-of-art query expansion from PRF in terms of diversity metrics? ● We evaluate query expansion techniques in a pseudo-relevance feedback setting. ● We expand queries using the first 5 and 10 retrieved documents from the original query. 14

RQ1: Results Results with BM25 and Bo1 0.4 0.35 0.3 0.25 0.2 0.15 0.1 0.05 0 MAP@20 nDCG@20 α-nDCG@20 S-recall@20 original Bo1 15

RQ2: Experiment (I) How does ts xQuAD perform in terms of ad-hoc retrieval and diversity compared to existing query expansion approaches? ● We consider a relevance feedback setting where feedback from the assessors for a given query is used to generate an expanded query. ● We simulate a situation where users provide feedback for their interpretation of the query: – The problem lies in the combination of different sources referring to possibly more than one subtopic. – We assume that there is complete information about the subtopics each document covers 16

RQ2: Experiment (II) ● We compare our proposed ts xQuAD (λ=1.0) built on top of different retrieval and query expansion models with their standard variants. ● Feedback documents are extracted from the TREC relevance judgments with the following constraints: – Residual evaluation method. – Similar number of documents for each subtopic in feedback and evaluation. – We chose subtopics with, at least, 6 relevant documents. 17

RQ2: Results Results with BM25 and Bo1 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0 MAP@20 nDCG@20 α-nDCG@20 S-recall@20 original Bo1 Bo1-tsxQuAD 18

Conclusions ● We have analyzed the suitability of query expansion techniques for search result diversification. ● We have proposed a term selection strategy to improve the diversity of expanded queries. ● A thorough evaluation shows that it improves the diversity of the search results at a negligible cost in terms of ad-hoc relevance. ● Future work: apply ts xQuAD to the pseudo-relevance feedback scenario. 19

Thanks for you attention! Questions?

RQ1: Results 21

RQ2: Results 22

Selecting Effective Expansion Terms for Diversity S. Vargas, R.L.T. - PowerPoint PPT Presentation

Selecting Effective Expansion Terms for Diversity S. Vargas, R.L.T. Santos, C. Macdonald and I. Ounis Query Ambiguity and Underspecification 1 Query Expansion q = spiders Relevance Feedback Pseudo Relevance Feedback q* = types

Dotmetrics Exclusive Users Selecting basic dimensions (country, devices) Selecting timeframe

Expansion Study F Expansion Study For Oswego East High School Expansion Study F Expansion Study

UNITY IN DIVERSITY PROF L D MOSOMA INTRODUCTION TERMS OF UNITY IN DIVERSITY UNITY ( Veritas )

expansion in Montana Bryce Ward Economic Impacts of Medicaid Expansion Economic Impacts of

1 CONTENTS 1. Supplier Diversity Data Call 2. Insurer Response Rate 3. Supplier Diversity

Fundamentals of Diversity Reception What is diversity? Diversity is a technique to combine

Part II. Fading and Diversity Impact of Fading in Detection; Time Diversity; Antenna Diversity;

Part II. Fading and Diversity Impact of Fading in Detection; Time Diversity; Antenna Diversity;

Linear approximation and Taylor expansion of -terms F. Olimpieri Aix-Marseille Univ, CNRS,

A smarter tool for selecting safe and effective pedestrian crossings Stacy Rendall

III. Staying in a Hotel - Terms of Accommodation Contract Headings: Terms Standard

Medicaid Expansion Means For WV What is Medicaid and the Medicaid Expansion? Who is

Business Expansion Division Business Expansion Division Enhancement of a Pro-Business Environment

Barry McKeown Barry McKeown Committee on Actuarial Diversity Committee on Actuarial Diversity

Diversity Initiative September 17, 2018 Diversity Committee The Diversity Committee consisted of

DRIVING DIVERSITY AND INCLUSION INDUSTRY COLLABORATION TO CLOSE THE DIVERSITY GAP IN COMMERCIAL

Introduction to MPI and OpenMP myson @ postech.ac.kr CSE700-PL @ POSTECH Programming Language

London Plan Integrated Impact Assessment The IIA Report provides an assessment of the London Plan

@natinfracom #ukinfra2050 What are the key principles for developing an integrated

On the Use of PLDA i-Vector Scoring for Clustering Short Segments Itay Salmun Irit Opher Itshak

Lecture 10: High Dimensionality Information Visualization CPSC 533C, Fall 2007 Tamara Munzner

Using and Extending AspectJ for Separating Concerns in Parallel Java Code Bruno Harbulot

Graphical Linear Algebra (a CT2019 tutorial) Pawel Sobocinski U. Southampton -> Technical

Generalizability Theory; Understanding Variance in Research Casperhp12345@gmail.com

Sambuz

Useful Links

Newsletter

Mail Us

Selecting Effective Expansion Terms for Diversity S. Vargas, R.L.T. - PowerPoint PPT Presentation

Selecting Effective Expansion Terms for Diversity S. Vargas, R.L.T. Santos, C. Macdonald and I. Ounis Query Ambiguity and Underspecification 1 Query Expansion q = spiders Relevance Feedback Pseudo Relevance Feedback q* = types

Dotmetrics Exclusive Users Selecting basic dimensions (country, devices) Selecting timeframe

Expansion Study F Expansion Study For Oswego East High School Expansion Study F Expansion Study

UNITY IN DIVERSITY PROF L D MOSOMA INTRODUCTION TERMS OF UNITY IN DIVERSITY UNITY ( Veritas )

expansion in Montana Bryce Ward Economic Impacts of Medicaid Expansion Economic Impacts of

1 CONTENTS 1. Supplier Diversity Data Call 2. Insurer Response Rate 3. Supplier Diversity

Fundamentals of Diversity Reception What is diversity? Diversity is a technique to combine

Part II. Fading and Diversity Impact of Fading in Detection; Time Diversity; Antenna Diversity;

Part II. Fading and Diversity Impact of Fading in Detection; Time Diversity; Antenna Diversity;

Linear approximation and Taylor expansion of -terms F. Olimpieri Aix-Marseille Univ, CNRS,

A smarter tool for selecting safe and effective pedestrian crossings Stacy Rendall

III. Staying in a Hotel - Terms of Accommodation Contract Headings: Terms Standard

Medicaid Expansion Means For WV What is Medicaid and the Medicaid Expansion? Who is

Business Expansion Division Business Expansion Division Enhancement of a Pro-Business Environment

Barry McKeown Barry McKeown Committee on Actuarial Diversity Committee on Actuarial Diversity

Diversity Initiative September 17, 2018 Diversity Committee The Diversity Committee consisted of

DRIVING DIVERSITY AND INCLUSION INDUSTRY COLLABORATION TO CLOSE THE DIVERSITY GAP IN COMMERCIAL

Introduction to MPI and OpenMP myson @ postech.ac.kr CSE700-PL @ POSTECH Programming Language

London Plan Integrated Impact Assessment The IIA Report provides an assessment of the London Plan

@natinfracom #ukinfra2050 What are the key principles for developing an integrated

On the Use of PLDA i-Vector Scoring for Clustering Short Segments Itay Salmun Irit Opher Itshak

Lecture 10: High Dimensionality Information Visualization CPSC 533C, Fall 2007 Tamara Munzner

Using and Extending AspectJ for Separating Concerns in Parallel Java Code Bruno Harbulot

Graphical Linear Algebra (a CT2019 tutorial) Pawel Sobocinski U. Southampton -&gt; Technical

Generalizability Theory; Understanding Variance in Research Casperhp12345@gmail.com

Sambuz

Useful Links

Newsletter

Mail Us

Graphical Linear Algebra (a CT2019 tutorial) Pawel Sobocinski U. Southampton -> Technical