Use of Click Data for Web Search Tao Yang UCSB 290N Table of - PowerPoint PPT Presentation

Use of Click Data for Web Search Tao Yang UCSB 290N

Table of Content • Search Engine Logs • Eyetracking data on position bias  Click data for ranker training [Joachims, KDD02] • Case study: Use of click data for search ranking [ Agichtein et al, SIGIR 06]

Search Logs Query logs recorded by search engines Huge amount of data: e.g. 10TB/day at Bing 3

Query session mustang … ford mustang www.fordvehicles.com/ Nova cars/mustang en.wikipedia.org/wiki/ Ford_Mustang AlsoTry www.mustang.com Search sessions 5

Query sessions and analysis Session … Mission Mission Mission Query level Query Query Query Query Query Click level Click Click Click Click Click Eye-tracking level fixation fixation fixation Query-URL correlations: • Query-to-pick • Query-to-query • Pick-to-pick 6

Examples of behavior analysis with search logs • Query-pick (click) analysis • Session detection • Classification  x 1 , x 2 , …, x N  y  eg, whether the session has a commercial intent • Sequence labeling  x 1 , x 2 , …, x N  y 1 , y 2 , …, y N  eg, segment a search sequence into missions and goals • Prediction  x 1 , x 2 , …, x N-1  y N • Similarity  Similarity(S 1 , S 2 )

Query-pick (click) analysis • Search Results for “CIKM” # of clicks received 2/23/2015 CIKM'09 Tutorial, Hong Kong, China 8

Interpret Clicks: an Example • Clicks are good…  Are these two clicks equally “good”? • Non-clicks may have excuses:  Not relevant  Not examined 2/23/2015 CIKM'09 Tutorial, Hong Kong, China 9

Use of behavior data • Adapt ranking to user clicks? # of clicks received 2/23/2015 CIKM'09 Tutorial, Hong Kong, China 10

Non-trivial cases • Tools needed for non-trivial cases # of clicks received 2/23/2015 CIKM'09 Tutorial, Hong Kong, China 11

Eye-tracking User Study 2/23/2015 CIKM'09 Tutorial, Hong Kong, China 12

Eye tracking for different web sites Google user patterns

Click Position-bias  Higher positions receive more user Percentage attention (eye fixation) and clicks than lower positions.  This is true even in the extreme setting Normal Position where the order of positions is reversed. Percentage  “Clicks are informative but biased”. [Joachims+07] Reversed Impression 2/23/2015 CIKM'09 Tutorial, Hong Kong, China 14

Clicks as Relative Judgments for Rank Training • “Clicked > Skipped Above” [Joachims, KDD02]  Preference pairs: 1 #5>#2, #5>#3, #5>#4. 2  Use Rank SVM to optimize 3 the retrieval function. 4 5  Limitation: 6  Confidence of judgments 7  Little implication to user modeling 8 2/23/2015 CIKM'09 Tutorial, Hong Kong, China 15

Additional relation for relative relevance judgments click > skip above last click > click above click > click earlier last click > click previous click > no-click next

Web Search Ranking by Incorporating User Behavior Information Rank pages relevant for a query • Eugene Agichtein, Eric Brill, Susan Dumais SIGIR 2006 • Categories of Features (Signals) for Web Search Ranking  Content match – e.g., page terms, anchor text, term weights, term span  Document quality – e.g., web topology, spam features • Add one more category:  Implicit user feedback from click data 17

Rich User Behavior Feature Space • Observed and distributional features  Aggregate observed values over all user interactions for each query and result pair  Distributional features: deviations from the “expected” behavior for the query • Represent user interactions as vectors in user behavior space  Presentation : what a user sees before a click  Clickthrough : frequency and timing of clicks  Browsing : what users do after a click 18

Ranking Features (Signals) Presentation ResultPosition Position of the URL in Current ranking QueryTitleOverlap Fraction of query terms in result Title Clickthrough DeliberationTime Seconds between query and first click ClickFrequency Fraction of all clicks landing on page ClickDeviation Deviation from expected click frequency Browsing DwellTime Result page dwell time DwellTimeDeviation Deviation from expected dwell time for query 19

Use of Click Data for Web Search Tao Yang UCSB 290N Table of - PowerPoint PPT Presentation

Use of Click Data for Web Search Tao Yang UCSB 290N Table of Content Search Engine Logs Eyetracking data on position bias Click data for ranker training [Joachims, KDD02] Case study: Use of click data for search ranking [

Click on M odel File for CAD Click on M odel File for CAD Click on Model File for CAD Click

Duy H. Ho , Raj Marri , Sirisha Rella , Yugyung Lee University of Missouri Kansas City Click

Web Services Web Services Towards Web Services Towards Web Services Towards Web Services A

Privacy as a Click to add title Click to add title Business Opportunity Click to add subtitle

Click to edit Master title style DRVR Click to edit Master title style Click to edit Master

Click to edit Master title style Click to edit Master title style Click to edit Master title

Web Mining Web Mining Web Mining Web Mining Web mining is the use of data mining techniques

Click to add title Click to add title business Click to add subtitle Click to add subtitle John

Click to add title Click to add title Click to add subtitle Click to add subtitle Roberta Di

50 th Anniversary Click here to add text. Click here to add text. July 2, 1964 July 2, 2014

Click to add title Click to add title Speaker: Click to add subtitle Click to add subtitle

Click to add title Click to add title Click to add subtitle Click to add subtitle Key themes

Raise your hand in Zoom Click on Participants Your hand is raised Click hand to lower it

Web Mining Web Mining Web mining is the use of data mining techniques to automatically

Web MINING Web MINING Overview Overview Dr Ahmed Rafea Rafea Dr Ahmed 1 Web Mining Outline

Web Mining Web Mining to automatically discover and extract information from Web

CS290N Summary 2015 Tao Yang Text books [CMS] Bruce Croft, Donald Metzler, Trevor Strohman,

Ask Your Neurons: A Neural-based Approach to Answering Questions about Images Mateusz Malinowski

Introduction to OpenRefine Owen Stephens Felix Lohmeier Using these slides These slides were

Ask an Electric Vehicle Driver! Earth Week 2020 50 th Anniversary of Earth Day Sponsored by the

Systems & Applications: Introduction Ling 573 NLP Systems and Applications April 1, 2014

Improving and Proving Marketing ROI with Testing How Shoebuy.com uses cross-site testing to

Selective Early Request Termination Selective Early Request Termination for Busy Internet

SOCIAL ENGINEERING THREATS & COUNTERMEASURES IN AN OVERLY

Use of Click Data for Web Search Tao Yang UCSB 290N Table of - PowerPoint PPT Presentation

Use of Click Data for Web Search Tao Yang UCSB 290N Table of Content Search Engine Logs Eyetracking data on position bias Click data for ranker training [Joachims, KDD02] Case study: Use of click data for search ranking [

Click on M odel File for CAD Click on M odel File for CAD Click on Model File for CAD Click

Duy H. Ho , Raj Marri , Sirisha Rella , Yugyung Lee University of Missouri Kansas City Click

Web Services Web Services Towards Web Services Towards Web Services Towards Web Services A

Privacy as a Click to add title Click to add title Business Opportunity Click to add subtitle

Click to edit Master title style DRVR Click to edit Master title style Click to edit Master

Click to edit Master title style Click to edit Master title style Click to edit Master title

Web Mining Web Mining Web Mining Web Mining Web mining is the use of data mining techniques

Click to add title Click to add title business Click to add subtitle Click to add subtitle John

Click to add title Click to add title Click to add subtitle Click to add subtitle Roberta Di

50 th Anniversary Click here to add text. Click here to add text. July 2, 1964 July 2, 2014

Click to add title Click to add title Speaker: Click to add subtitle Click to add subtitle

Click to add title Click to add title Click to add subtitle Click to add subtitle Key themes

Raise your hand in Zoom Click on Participants Your hand is raised Click hand to lower it

Web Mining Web Mining Web mining is the use of data mining techniques to automatically

Web MINING Web MINING Overview Overview Dr Ahmed Rafea Rafea Dr Ahmed 1 Web Mining Outline

Web Mining Web Mining to automatically discover and extract information from Web

CS290N Summary 2015 Tao Yang Text books [CMS] Bruce Croft, Donald Metzler, Trevor Strohman,

Ask Your Neurons: A Neural-based Approach to Answering Questions about Images Mateusz Malinowski

Introduction to OpenRefine Owen Stephens Felix Lohmeier Using these slides These slides were

Ask an Electric Vehicle Driver! Earth Week 2020 50 th Anniversary of Earth Day Sponsored by the

Systems &amp; Applications: Introduction Ling 573 NLP Systems and Applications April 1, 2014

Improving and Proving Marketing ROI with Testing How Shoebuy.com uses cross-site testing to

Selective Early Request Termination Selective Early Request Termination for Busy Internet

SOCIAL ENGINEERING THREATS &amp; COUNTERMEASURES IN AN OVERLY

Systems & Applications: Introduction Ling 573 NLP Systems and Applications April 1, 2014

SOCIAL ENGINEERING THREATS & COUNTERMEASURES IN AN OVERLY