Informa(on Retrieval as Sta(s(cal Transla(on Presented - PowerPoint PPT Presentation

Informa(on ¡Retrieval ¡as ¡ ¡ Sta(s(cal ¡Transla(on ¡ Presented ¡by: ¡Lin ¡Gong ¡

Introduc(on ¡ How ¡do ¡people ¡search ¡a ¡query? ¡ Ideal ¡ Informa(on ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ Query ¡ document ¡ need ¡ segment ¡ Query ¡genera(on! ¡ -‑> ¡Find ¡the ¡most ¡likely ¡documents ¡given ¡the ¡query. ¡

A ¡Closer ¡Look ¡ maximize ¡ query ¡ ¡ By ¡Baye’s ¡law: ¡ ¡

Main ¡Idea ¡ The ¡language ¡modeling ¡approach ¡is ¡novel ¡and ¡mo(vated. ¡ However, ¡it ¡has ¡two ¡problems: ¡ -‑ ¡Can ¡not ¡model ¡different ¡forms ¡or ¡styles ¡of ¡queries. ¡ -‑ ¡Can ¡not ¡address ¡the ¡important ¡issues ¡of ¡synonymy ¡and ¡ polysemy. ¡ High-‑performance ¡document ¡retrieval ¡systems ¡must ¡be ¡ sophis(cated ¡enough ¡to ¡handle ¡all ¡these ¡problems. ¡ The ¡paper ¡proposes ¡a ¡new ¡probabilis(c ¡approach ¡based ¡on ¡ sta(s(cal ¡machine ¡transla(on ¡and ¡aims ¡to ¡develop ¡a ¡general ¡ sta(s(cal ¡framework ¡for ¡handling ¡these ¡issues. ¡

What ¡is ¡Sta(s(cal ¡Machine ¡Transla(on? ¡ Machine ¡transla(on: ¡ Sta(s(cal ¡transla(on ¡system: ¡

Document-‑Query ¡Transla(on ¡ Model ¡1: ¡A ¡mixture ¡model ¡ Query: ¡m ¡ Document: ¡n ¡ q1 ¡ q2 ¡ q3 ¡ …. ¡ q m ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ Document ¡: ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡d1, ¡d2, ¡d3………….. ¡dn ¡

Model ¡1: ¡A ¡Mixture ¡Model ¡ Model ¡0 ¡

Model ¡1’: ¡A ¡Binomial ¡Model ¡ Possion ¡Distribu(on: ¡

Building ¡a ¡Transla(on-‑Based ¡IR ¡System ¡ Ø Use ¡mutual ¡informa(on ¡sta(s(c ¡to ¡construct ¡an ¡ ar(ficial ¡cumula(ve ¡distribu(on ¡func(on ¡over ¡words ¡ in ¡each ¡document. ¡ Ø Use ¡EM ¡algorithm ¡of ¡three ¡itera(ons ¡to ¡fit ¡the ¡ transla(on ¡probabili(es ¡of ¡Model ¡1 ¡and ¡Model ¡1’. ¡ Ø Do ¡experiments ¡on ¡TREC ¡data. ¡

Sample ¡Transla(on ¡Probabili(es ¡ ¡ A]er ¡EM ¡Algorithm ¡

Experimental ¡Results ¡ Precision ¡and ¡recall ¡curve ¡on ¡AP. ¡ Average ¡precision: ¡19.4% ¡ ¡ Average ¡recall: ¡10% ¡ ¡ Precision ¡and ¡recall ¡curve ¡on ¡SJMN. ¡ Average ¡precision: ¡27.3% ¡ ¡ Average ¡recall: ¡22.8% ¡ ¡

Experimental ¡Results ¡ Comparison ¡between ¡two ¡and ¡three ¡itera(ons ¡of ¡EM. ¡ ¡ Documents ¡with ¡shorter ¡query ¡length. ¡ Decrease ¡in ¡performance! ¡

Experimental ¡Results ¡ Precision ¡and ¡recall ¡curve ¡on ¡SDR. ¡ Average ¡precision: ¡22.2% ¡ ¡ Average ¡recall: ¡18.4% ¡ ¡ Comparison ¡between ¡Model ¡0 ¡and ¡LM. ¡ Performance ¡is ¡similar! ¡

Conclusion ¡ Ø Propose ¡an ¡approach ¡to ¡informa(on ¡retrieval ¡with ¡ sta(s(cal ¡machine ¡transla(on. ¡ ¡ Ø Present ¡two ¡models ¡for ¡document ¡query ¡genera(on ¡ process. ¡ Ø Train ¡the ¡parameters ¡with ¡EM ¡algorithm ¡and ¡do ¡ experiments ¡on ¡TREC ¡dataset. ¡

Thanks! ¡

Informa(on Retrieval as Sta(s(cal Transla(on Presented - PowerPoint PPT Presentation

Informa(on Retrieval as Sta(s(cal Transla(on Presented by: Lin Gong Introduc(on How do people search a query? Ideal Informa(on

Address Transla+on Main Points Address Transla+on Concept

Address Transla+on Main Points Address Transla+on Concept

Sta$s$cs Sta$s$cs Fourth Dimension of a Sta$s$cal Programmer

XML Retrieval XML Retrieval XML Retrieval XML Retrieval DB/IR in DB/IR in Theory Theory Web

Informa Half Year Results Presentation 24th July 2019 Informa Stephen A. Carter, Group Chief

Cheap transla,on Automated ques,on answering Visualizing

Competence-based Curriculum Learning for Neural Machine Transla:on Anthony Platanios

Simultaneous Transla/on for Hiero Simon Fraser University

Informa(on Retrieval Introduc(on Debapriyo Majumdar Information Retrieval Spring

Retrieval by Content Part 2: Text Retrieval Term Frequency and Inverse Document Frequency

Retrieval by Content Image Retrieval Image Retrieval Problem Large Image and video data sets

Information Retrieval Introducing Information Retrieval and Web Search Information Retrieval

CS54701: Information Retrieval CS-54701 Information Retrieval Retrieval Models: Language models

Retrieval Models: Outline CS490W: Web I nformation Search & Management Retrieval Models

Model Divergence Retrieval LM, session 10 CS6200: Information Retrieval Slides by: Jesse

F orwa rd L ooking Sta te me nt Ce rta in o f the sta te me nts ma de in this Pre se nta tio

Document Vectors in the Wild: Building a Content Recommendation System for Reuters.com James

THE RESEARCH PRACTICE GAP: A COMPARATIVE PERSPECTIVE Ileana Steccolini Newcastle University

on FPGA Shuyi Chen Lizi George Kelly Ran Outline Motivation System Architecture

Attorney Generals D Direct ctive 2 2015-1 Police Body W Worn Cameras a and Stored B Body

Imagine a language learning platform that helps you remember every new word http://lexicum.net

Mobile Libraries & Information Needs in Refugee Camps Allison Easton & Katherine Wells

CULTURAL CONNECTIVITY THROUGH FILM LITERACY ( DEMONSTRATION ON THE USE OF VIRTUAL SENTRO RIZAL )

Academic Writing Digital Media, Culture and Politics MKAD01 Autumn 2010 September 14th, 10:15

Sambuz

Useful Links

Newsletter

Mail Us

Informa(on Retrieval as Sta(s(cal Transla(on Presented - PowerPoint PPT Presentation

Informa(on Retrieval as Sta(s(cal Transla(on Presented by: Lin Gong Introduc(on How do people search a query? Ideal Informa(on

Address Transla+on Main Points Address Transla+on Concept

Address Transla+on Main Points Address Transla+on Concept

Sta$s$cs Sta$s$cs Fourth Dimension of a Sta$s$cal Programmer

XML Retrieval XML Retrieval XML Retrieval XML Retrieval DB/IR in DB/IR in Theory Theory Web

Informa Half Year Results Presentation 24th July 2019 Informa Stephen A. Carter, Group Chief

Cheap transla,on Automated ques,on answering Visualizing

Competence-based Curriculum Learning for Neural Machine Transla:on Anthony Platanios

Simultaneous Transla/on for Hiero Simon Fraser University

Informa(on Retrieval Introduc(on Debapriyo Majumdar Information Retrieval Spring

Retrieval by Content Part 2: Text Retrieval Term Frequency and Inverse Document Frequency

Retrieval by Content Image Retrieval Image Retrieval Problem Large Image and video data sets

Information Retrieval Introducing Information Retrieval and Web Search Information Retrieval

CS54701: Information Retrieval CS-54701 Information Retrieval Retrieval Models: Language models

Retrieval Models: Outline CS490W: Web I nformation Search &amp; Management Retrieval Models

Model Divergence Retrieval LM, session 10 CS6200: Information Retrieval Slides by: Jesse

F orwa rd L ooking Sta te me nt Ce rta in o f the sta te me nts ma de in this Pre se nta tio

Document Vectors in the Wild: Building a Content Recommendation System for Reuters.com James

THE RESEARCH PRACTICE GAP: A COMPARATIVE PERSPECTIVE Ileana Steccolini Newcastle University

on FPGA Shuyi Chen Lizi George Kelly Ran Outline Motivation System Architecture

Attorney Generals D Direct ctive 2 2015-1 Police Body W Worn Cameras a and Stored B Body

Imagine a language learning platform that helps you remember every new word http://lexicum.net

Mobile Libraries &amp; Information Needs in Refugee Camps Allison Easton &amp; Katherine Wells

CULTURAL CONNECTIVITY THROUGH FILM LITERACY ( DEMONSTRATION ON THE USE OF VIRTUAL SENTRO RIZAL )

Academic Writing Digital Media, Culture and Politics MKAD01 Autumn 2010 September 14th, 10:15

Sambuz

Useful Links

Newsletter

Mail Us

Retrieval Models: Outline CS490W: Web I nformation Search & Management Retrieval Models

Mobile Libraries & Information Needs in Refugee Camps Allison Easton & Katherine Wells