LING 573 D3 Query Expansion with Deep Processing Melanie Bolla, - PowerPoint PPT Presentation

LING 573 D3 Query Expansion with Deep Processing Melanie Bolla, Woodley Packard, and T.J. Trimble

System Architecture Questions Indri IR via Input Output Condor processing Processing Answers

Input Processing Coreference Resolution WordNet Questions Query Attributes Declarative Reformulation

Coreference Resolution • Intuition: Replace pronominal or underspecified references with antecedent • Do some clean up • System: Stanford CoreNLP dcoref • Rule based sieve architecture for coreference resolution • Implementation: Parallelization via Condor • Improvements!

Coreference Resolution Questions “Documents” CoreNLP via Condor Coreference Resolved results Questions

Coreference Resolution • Document • Target + question series • Coreference resolution is done over document Bing Crosby. What was his Bing Crosby. What was Bing Crosby's profession? For which movie did he profession? For which movie did he win an Academy Award? What was win an Academy Award? What was his nickname? What is the title of his Bing Crosby's nickname? What is the all-time best-selling record? He is an title of Bing Crosby's all-time best- alumnus of which university? How selling record? He is an alumnus of old was Crosby when he died? which university? How old was Crosby when he died?

Coreference Resolution • Query Formulation: • Get replacements from dcoref • Do replacements over question file, with some additional cleaning (possessives, etc.) • Submit to Indri using #4(q)

Coreference Resolution • Results: • Initial Results: • Baseline: • Lenient: 0.2390; Strict: 0.1525 • Coref: • Lenient: 0.2013; Strict: 0.1339

Coreference Resolution • Results: • Initial Results: • Baseline: • Lenient: 0.2390; Strict: 0.1525 • Coref: • Lenient: 0.2013; Strict: 0.1339 • -_-`

Coreference Resolution • Error Analysis: • Problematic resolutions: • What is Crosby’s nickname? • What is Crosby’s wife’s name? • -> What is What is Crosby’s nickname’s wife’s name? • Due to overzealous resolution in the face of impaired punctuation • Not very good regex replacement

Coreference Resolution • Fixes (post-deadline): • Constrain replacements to only “the best” • extraneous determiner additions • make sure possessives line up right • enforce only adding content • etc. • On devtest: reduction in replacement candidates from about 160 to 72

Coreference Resolution • Results: • Baseline: Lenient: 0.2390; Strict: 0.1525 • Coref: Lenient: 0.2013; Strict: 0.1339 • Baseline Improved: • Lenient: 0.2618; Strict: 0.1813 • Coref Improved (post-deadline): • Lenient: 0.2780; Strict: 0.1868

Coreference Resolution • Future Work: • What if coreference fed into declaratives? • Where did Moon play in college? • Where did Warren Moon play in college? • Warren Moon played in college.

WordNet Related Nouns • Insert “related nouns” of adjectives in WordNet into bag of word query • Intuition: “how tall” -> “height” • Initial drop in score • Baseline: Lenient: 0.2390; Strict: 0.1525 • Initial: Lenient: 0.2278; Strict: 0.1512

WordNet Related Nouns • Error Analysis: • Some words had terrible attributes: • “current” -> “currentness, currency, up- to-dateness” • “other” -> “otherness, distinctness, separateness” • “many” -> “numerousness, numerosity, multiplicity”

WordNet Related Nouns • Removed “many”: • Baseline: • Lenient: 0.2390; Strict: 0.1525 • Initial: • Lenient: 0.2278; Strict: 0.1512 • Removed “many”: • Lenient: 0.2378; Strict: 0.1563

Declarative Reformulation • Intuition: documents have statements, not questions; shallow reformulation stinks • Declarative Reformulation using the ERG • Parse question into flat semantic representation, MRS • Fiddle with MRS • Generate with ERG • Improvements!

Declarative Reformulation • Input: • What position did Moon play in professional football? • Where did Moon play in college? • Output: • A position did moon play in professional football. • Moon played in college.

Declarative Reformulation Reform Generate Parse with with ERG on Reform ERG via ACE Questions Condor on Condor Reform Reformed Questions

Declarative Reformulation • Baseline: • Lenient: 0.2618; Strict: 0.1813 • Declaratives: • Lenient: 0.2695; Strict: 0.1905

System Architecture Questions Indri IR via Input Output Condor processing Processing Answers

Answer Processing • Choosing better snippets • Starting from the center of the document seemed to work the best • This might be overfitting … • Baseline: • Lenient: 0.2390; Strict: 0.1525 • Improvement: • Lenient: 0.2695; Strict: 0.1905

Answer Processing • Remove HTML • 2 lines of code with NLTK • Baseline: • Lenient: 0.2621; Strict: 0.1835 • Improvement: • Lenient: 0.2642; Strict: 0.1881

MRS matching • Match question to answer based on MRS graph structure • Big improvement! • Baseline: • Lenient: 0.2695; Strict: 0.1905 • MRS-matching: Lenient: 0.3263; Strict: 0.2452 • Post-deadline: Lenient: 0.3317; Strict: 0.2564

Results (devtest) Bold: D3 final score Italics: best score Test Lenient Score Strict Score IR Recall Baseline 0.1319 0.0753 ? Baseline Improved (B) 0.2618 0.1813 67.5 / 55.6 B + Declarative (D) 0.2695 0.1905 68.4 / 57.1 B + WordNet Attributes (W) 0.2545 0.1743 66.5 / 54.6 B + Coreference (C) 0.2780 0.1868 ? D3: B + D + W 0.2622 0.1835 67.5 / 56.1 B + W + C 0.2706 0.1853 ? B + D + W + C 0.2642 0.1881 ?

Results (devtest) … with MRS matching Bold: D3 final score Italics: best score Test Lenient Strict Score Score Baseline Improved (B) 0.3209 0.2379 B + Declarative (D) 0.3263 0.2452 B + WordNet Attributes (W) 0.3216 0.2398 Baseline + Coreference (C) 0.3343 0.2445 D3: B + D + W 0.3269 0.2471 Post-deadline: B + D + W + C 0.3453 0.2565

Issues • Indri • Finding the best/proper Indri Query Language operators • WordNet • WSD, weird relationships • Coreference • Match happy system

Successes • Taking 250 characters from the middle of the snippet • Constraining Coreference Resolution • Declarative Reformulation • HTML cleaning • MRS based matching

Influential Related Reading • ERG and MRS: Copestake 2000, Copestake 2002, Flickinger 2003, Copestake 2005 • WordNet: ? class 10 slide 6 • Coreference Resolution: Raghunathan et al., 2010, etc. • Class reading on Indri: http://sourceforge. net/p/lemur/wiki/Home/

LING 573 D3 Query Expansion with Deep Processing Melanie Bolla, - PowerPoint PPT Presentation

LING 573 D3 Query Expansion with Deep Processing Melanie Bolla, Woodley Packard, and T.J. Trimble System Architecture Questions Indri IR via Input Output Condor processing Processing Answers System Architecture Questions Indri IR via

Query Processing Relevance feedback; query expansion; Web Search 1 Overview Indexes Query

Query Expansion & Passage Reranking NLP Systems & Applications LING 573 April 17, 2014

Improve Query Performance with the Query Log Analyzer Kees Vegter Field Engineer Query Log

CS4224/CS5424 Lecture 9 Distributed Query Processing Query Processing Translates query into a

Query Execution 2 and Query Optimization Instructor: Matei Zaharia cs245.stanford.edu Query

Luo Si Department of Computer Science Purdue University Query Expansion: Outline Query

Question Processing: Formulation & Expansion Ling573 NLP Systems and Applications May 8,

Chapter 3: Top-k Query Processing and Indexing 3.1 Top-k Algorithms 3.2 Approximate Top-k Query

Systems & Applications: Introduction Ling 573 NLP Systems and Applications March 29, 2016

Systems & Applications: Introduction Ling 573 NLP Systems and Applications April 1, 2014

Expansion Study F Expansion Study For Oswego East High School Expansion Study F Expansion Study

Introduction to Deep Processing Techniques for NLP Deep Processing Techniques for NLP Ling 571

Introduction to Deep Processing Techniques for NLP Deep Processing Techniques for NLP Ling 571

Shallow & Deep QA Systems Ling 573 NLP Systems and Applications April 9, 2013 Announcement

Query Understanding: A Manifesto Daniel Tunkelang queryunderstanding.com Overview What is

Perfect Query FORMULA 5 critical sections in every successful query letter (c) 2019

Uncle Sam is Watching Multinational corporations operating in China are subject to not only

LING 409 Presentation Schedule WEEK 3 (May 20) Winnie Cheng #1 ----------------------------

Stress pattern and reduction correlations in Spanish Karolina Bro University of Warsaw

EMEA Performance Indicators Pre-Authorisation Bo Aronsson EMEA EMEA-EFPIA Info Day 2009 1

Workshop FF A Report from the Trenches Lessons Learned in Industrial Environmental

www.braling.it Mission Braling is an independent advisory, consultancy and co-investment firm

Lomon Billions Bruce Griffin Senior Vice President Strategic Development Disclaimer

MIT 2010/01 Development of Mitigation Strategies: Inshore Bottom Longlining Dave Goad Vita Maris

LING 573 D3 Query Expansion with Deep Processing Melanie Bolla, - PowerPoint PPT Presentation

LING 573 D3 Query Expansion with Deep Processing Melanie Bolla, Woodley Packard, and T.J. Trimble System Architecture Questions Indri IR via Input Output Condor processing Processing Answers System Architecture Questions Indri IR via

Query Processing Relevance feedback; query expansion; Web Search 1 Overview Indexes Query

Query Expansion &amp; Passage Reranking NLP Systems &amp; Applications LING 573 April 17, 2014

Improve Query Performance with the Query Log Analyzer Kees Vegter Field Engineer Query Log

CS4224/CS5424 Lecture 9 Distributed Query Processing Query Processing Translates query into a

Query Execution 2 and Query Optimization Instructor: Matei Zaharia cs245.stanford.edu Query

Luo Si Department of Computer Science Purdue University Query Expansion: Outline Query

Question Processing: Formulation &amp; Expansion Ling573 NLP Systems and Applications May 8,

Chapter 3: Top-k Query Processing and Indexing 3.1 Top-k Algorithms 3.2 Approximate Top-k Query

Systems &amp; Applications: Introduction Ling 573 NLP Systems and Applications March 29, 2016

Systems &amp; Applications: Introduction Ling 573 NLP Systems and Applications April 1, 2014

Expansion Study F Expansion Study For Oswego East High School Expansion Study F Expansion Study

Introduction to Deep Processing Techniques for NLP Deep Processing Techniques for NLP Ling 571

Introduction to Deep Processing Techniques for NLP Deep Processing Techniques for NLP Ling 571

Shallow &amp; Deep QA Systems Ling 573 NLP Systems and Applications April 9, 2013 Announcement

Query Understanding: A Manifesto Daniel Tunkelang queryunderstanding.com Overview What is

Perfect Query FORMULA 5 critical sections in every successful query letter (c) 2019

Uncle Sam is Watching Multinational corporations operating in China are subject to not only

LING 409 Presentation Schedule WEEK 3 (May 20) Winnie Cheng #1 ----------------------------

Stress pattern and reduction correlations in Spanish Karolina Bro University of Warsaw

EMEA Performance Indicators Pre-Authorisation Bo Aronsson EMEA EMEA-EFPIA Info Day 2009 1

Workshop FF A Report from the Trenches Lessons Learned in Industrial Environmental

www.braling.it Mission Braling is an independent advisory, consultancy and co-investment firm

Lomon Billions Bruce Griffin Senior Vice President Strategic Development Disclaimer

MIT 2010/01 Development of Mitigation Strategies: Inshore Bottom Longlining Dave Goad Vita Maris

Query Expansion & Passage Reranking NLP Systems & Applications LING 573 April 17, 2014

Question Processing: Formulation & Expansion Ling573 NLP Systems and Applications May 8,

Systems & Applications: Introduction Ling 573 NLP Systems and Applications March 29, 2016

Systems & Applications: Introduction Ling 573 NLP Systems and Applications April 1, 2014

Shallow & Deep QA Systems Ling 573 NLP Systems and Applications April 9, 2013 Announcement