V.3 Query Processing 1. Term-at-a-Time 2. Document-at-a-Time 3. - PowerPoint PPT Presentation

      V.3 Query Processing 1. Term-at-a-Time 2. Document-at-a-Time 3. WAND 4. Quit & Continue 5. Buckley’s Algorithm 6. Fagin’s Threshold Algorithms 7. Query Processing with Importance Scores 8. Query Processing with Champion Lists   Based on MRS Chapter 7 and RBY Chapter 9 IR&DM ’13/’14 ! 49

      Query Types • Conjunctive   (i.e., all query terms are required) • Disjunctive   (i.e., subset of query terms sufficient) • Phrase or proximity   (i.e., query terms must occur in right order or close enough) • Mixed-mode with negation   (e.g., “ harry potter” review + movie - book ) • Combined with ranking of result documents according to   X score ( q, d ) = score ( t, d ) t ∈ q with score ( t , d ) depending on retrieval model (e.g., tf . idf t , d ) IR&DM ’13/’14 ! 50

Inverted Index alf d 123 , 2, [4, 14] d 133 , 1, [47] d 266 , 3, [1, 9, 20] ben d 123 , 2, [6, 22] d 133 , 1, [66] d 268 , 3, [1, 4, 23] gil d 567 , 2, [7, 99] d 136 , 1, [22] d 233 , 3, [5, 12, 23] willow d 144 , 2, [5, 19] d 177 , 1, [55] d 244 , 3, [7, 11,22] yeast d 234 , 2, [8, 17] d 299 , 1, [26] d 999 , 3, [5, 66, 7] zoo d 888 , 2, [7, 77] d 889 , 1, [23] d 890 , 3, [1, 9, 20] • Document-ordered or score-ordered posting lists • Posting lists with skip pointers allow for faster traversal IR&DM ’13/’14 ! 51

Overview of Query Processing Methods • Holistic query processing methods determine whole query result • Term-at-a-Time • Document-at-a-Time   • Top- k query processing methods determine top- k query result • WAND • Quit & Continue • Fagin’s Threshold Algorithms   • Opportunities for optimization over naïve merge & sort baseline • skipping in document-ordered posting lists • early termination of query processing for score-ordered posting lists IR&DM ’13/’14 ! 52

  1. Term-at-a-Time Query Processing • Term-at-a-Time (T AA T) query processing • reads posting lists for query terms ⟨ t 1 , …, t | q | ⟩ successively • maintains an accumulator for each result document with value   X after the first j posting lists have been read acc ( d ) = score ( t i , d ) i ≤ j Accumulators ! d 1 : 0.0 a d 1 , 1.0 d 4 , 2.0 d 7 , 0.2 d 8 , 0.1 ! d 4 : 0.0 b d 7 : 0.0 d 4 , 1.0 d 7 , 2.0 d 8 , 0.2 d 9 , 0.1 ! d 8 : 0.0 c d 4 , 3.0 d 7 , 1.0 d 9 : 0.0 ! • required memory depends on the number of accumulators maintained • top- k results can be determined by sorting accumulators at the end IR&DM ’13/’14 ! 53

V.3 Query Processing 1. Term-at-a-Time 2. Document-at-a-Time 3. - PowerPoint PPT Presentation

V.3 Query Processing 1. Term-at-a-Time 2. Document-at-a-Time 3. WAND 4. Quit & Continue 5. Buckleys Algorithm 6. Fagins Threshold Algorithms 7. Query Processing with Importance Scores 8. Query Processing with Champion

Improve Query Performance with the Query Log Analyzer Kees Vegter Field Engineer Query Log

CS4224/CS5424 Lecture 9 Distributed Query Processing Query Processing Translates query into a

Query Processing Relevance feedback; query expansion; Web Search 1 Overview Indexes Query

Query Execution 2 and Query Optimization Instructor: Matei Zaharia cs245.stanford.edu Query

Chapter 3: Top-k Query Processing and Indexing 3.1 Top-k Algorithms 3.2 Approximate Top-k Query

Query Understanding: A Manifesto Daniel Tunkelang queryunderstanding.com Overview What is

Perfect Query FORMULA 5 critical sections in every successful query letter (c) 2019

Query Op)miza)on 1 Query op)miza)on Given an SQL query,

Online Query Processing Exposure to online query processing algorithms and fundamentals A

Efficient query processing Efficient scoring, distributed query processing Web Search 1 Ranking

Query Processing Query Processing Steps balance < 2500 ( balance ( account)) balance

A Generic Mapping-based Query Translation A Generic Mapping-based Query Translation from SPARQL

Information Retrieval > Query Us User er Query Words Query Words Search Personalization

Module 13: Optimizing Query Performance Overview Introduction to the Query Optimizer

CAS CS 460/660 Introduction to Database Systems Query Evaluation II 1.1 Cost-based Query

Query Execuon Declarave Query (SQL) We start from

Compilers and computer architecture: A realistic compiler to RISC-V Martin Berger 1 November /

CSE 341: Programming Languages Spring 2007 Lecture 5 Type synonyms, more pattern-matching,

RSA Accumulator Oct 29, 2019 Overview Definitions modulus math RSA Accumulator

Accumulators & Difference Lists Accumulators & Difference Lists York University CSE 3401

loop unrolling (ASM) loop : cmpl %edx, %esi jle endOfLoop addq (%rdi,%rdx,8), %rax incq

Spark Stony Brook University CSE545, Spring 2019 Situations where MapReduce is not efficient

Professor: Alvin Chao CS149 Array Activities int[ ] nums = {10, 3, 7, -5}; nums 10 3

Accumulating results Contents Three ways of computing results during re- cursion Collecting

V.3 Query Processing 1. Term-at-a-Time 2. Document-at-a-Time 3. - PowerPoint PPT Presentation

V.3 Query Processing 1. Term-at-a-Time 2. Document-at-a-Time 3. WAND 4. Quit & Continue 5. Buckleys Algorithm 6. Fagins Threshold Algorithms 7. Query Processing with Importance Scores 8. Query Processing with Champion

Improve Query Performance with the Query Log Analyzer Kees Vegter Field Engineer Query Log

CS4224/CS5424 Lecture 9 Distributed Query Processing Query Processing Translates query into a

Query Processing Relevance feedback; query expansion; Web Search 1 Overview Indexes Query

Query Execution 2 and Query Optimization Instructor: Matei Zaharia cs245.stanford.edu Query

Chapter 3: Top-k Query Processing and Indexing 3.1 Top-k Algorithms 3.2 Approximate Top-k Query

Query Understanding: A Manifesto Daniel Tunkelang queryunderstanding.com Overview What is

Perfect Query FORMULA 5 critical sections in every successful query letter (c) 2019

Query Op)miza)on 1 Query op)miza)on Given an SQL query,

Online Query Processing Exposure to online query processing algorithms and fundamentals A

Efficient query processing Efficient scoring, distributed query processing Web Search 1 Ranking

Query Processing Query Processing Steps balance &lt; 2500 ( balance ( account)) balance

A Generic Mapping-based Query Translation A Generic Mapping-based Query Translation from SPARQL

Information Retrieval &gt; Query Us User er Query Words Query Words Search Personalization

Module 13: Optimizing Query Performance Overview Introduction to the Query Optimizer

CAS CS 460/660 Introduction to Database Systems Query Evaluation II 1.1 Cost-based Query

Query Execu*on Declara*ve Query (SQL) We start from

Compilers and computer architecture: A realistic compiler to RISC-V Martin Berger 1 November /

CSE 341: Programming Languages Spring 2007 Lecture 5 Type synonyms, more pattern-matching,

RSA Accumulator Oct 29, 2019 Overview Definitions modulus math RSA Accumulator

Accumulators &amp; Difference Lists Accumulators &amp; Difference Lists York University CSE 3401

loop unrolling (ASM) loop : cmpl %edx, %esi jle endOfLoop addq (%rdi,%rdx,8), %rax incq

Spark Stony Brook University CSE545, Spring 2019 Situations where MapReduce is not efficient

Professor: Alvin Chao CS149 Array Activities int[ ] nums = {10, 3, 7, -5}; nums 10 3

Accumulating results Contents Three ways of computing results during re- cursion Collecting

Query Processing Query Processing Steps balance < 2500 ( balance ( account)) balance

Information Retrieval > Query Us User er Query Words Query Words Search Personalization

Query Execuon Declarave Query (SQL) We start from

Accumulators & Difference Lists Accumulators & Difference Lists York University CSE 3401