Automatic Search Engine Evaluation Automatic Search Engine - PowerPoint PPT Presentation

Automatic Search Engine Evaluation Automatic Search Engine Evaluation with Click- -through Data Analysis through Data Analysis with Click Yiqun Liu Liu Yiqun State Key Lab of Intelligent Tech. & Sys State Key Lab of Intelligent Tech. & Sys Jun. 3th, 2007 Jun. 3th, 2007

Yiqun Liu Ph.D student from Tsinghua University, Beijing, China. liuyiqun03@gmail.com Recent work: • Using query log and click-through data analysis to: • identify search engine users’ information need types • evaluate search engine performance automatically • separate key resource pages from others • estimate Web page quality Our Lab: • A joint lab • R&D Support to a widely-used Chinese Search Engine Sogou.com, platform to get research results realized.

Yiqun Liu Ph.D student from Tsinghua University, Beijing, China. liuyiqun03@gmail.com • Web Data Cleansing • Using query-Independent features and ML algorithms • 5% web pages can meet >90% user’s search needs • Query type identification • Identify the type of user’s information need • Over 80% queries are correctly classified • Search engine performance evaluation • Construct query topic set and answer set Automatically . • Obtain similar evaluation results with manual based methods, and cost far less time and labor.

Introduction Introduction • Lots of search engines offer services on the Web • Search Engine Performance Evaluation – Web Users • over 120 million users in mainland – Search Advertisers • spending 5.6 billion RMBs in 2007 – Search engineers and researchers

Introduction Introduction • Evaluation is a key issue in IR research – Evaluation became central to R&D in IR to such an extent that new designs and proposals and their evaluation became one. (Saracevic, 1995) • Cranfield-like evaluation methodology – Proposed by Cleverdon et al in 1966. – A set of query topics, their corresponding answers (usually called qrels) and evaluation metrics. – Adopted by IR workshops such as TREC and NTCIR.

Introduction Introduction • Problems with Web IR evaluation – 9 people months are required to judge one topic for a collection of 8 million documents. (Voorhees, 2001) – Search engines (Yahoo!, Google) index over 10 billion Web documents. – Almost Impossible to use human-assessed query and qrel sets in Web IR system evaluation.

Related works Related works • Efforts in automatic search engine performance evaluation (Cranfield-like) – Considering pseudo feedback documents as correct answers (Soboroff, 2001; Nuray, 2003) – Adopting query topics and qrels extracted from Web page directories such as open directory project (ODP) (Chowdhury, 2002; Beitzel, 2003)

Related works Related works • Efforts in automatic search engine performance evaluation (other evaluation approaches) – Term Relevance Sets (Trels) method. Define a pre-specified list of terms relevant and irrelevant to these queries. (Amitay, 2004) – The use of click-through data. Construct a unified meta search interface to collect users’ behaviour information. (Joachims, 2002)

Our method Our method • A cranfield-like approach – Accepted by major IR research efforts – Difficulty: annotating all correct answers automatically • Click-through behavior analysis – Single user may be cheated by search spams or SEOs. – User group’s behavior information is more reliable.

Automatic Evaluation Process • Information need behind user queries – Proposed by Broder (2003) – Navigational type: One query have only one correct answer. – Informational type: One query may have several correct answers. • Different behavior over different types of information needs

Information needs and Evaluation Information needs and Evaluation • Informational queries cannot be annotated – People click different answers while using different search engines. 0.35 baidu google yahoo sogou 0.3 0.25 0.2 0.15 0.1 0.05 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27

Automatic Evaluation Process

Query Set Classification • Less Effort Assumption & N Clicks Satisfied (nCS) Evidence – While performing a navigational type search request, 60% Navigational Informational & Transactional user tend to click a small number of URLs in the 50% result list. 40% 30% 20% 10% 0% 0.9 0.8 0.7 0.6 0.5 0.4

Query Set Classification • Cover Page Assumption and Top N Results Satisfied (nRS) Evidence – While performing a navigational type search request, Navigational Informational & Transactional 60% user tend to click only the first few URLs in the result 50% list. 40% 30% 20% 10% 0% 0.9 0.8 0.7 0.6 0.5

Query Set Classification • Click Distribution Evidence – Proposed by Lee (Lee, 2005). Also based on click- through information. – Users tend to click the same result while proposing a same navigational type query – Less than 5% informational / Transactional queries’ CD value is over ½, while 51% navigational queries’ corresponding value is more than ½.

Query Set Classification • A decision tree algorithm

Answer Annotation Answer Annotation • Navigational type query annotation – Define: Click focus # ( ) Session of q that clicks r = ClickFocus ( Query q , Result r ) # ( Session of q ) – Annotate q with the result r whose ClickFocus value is the largest.

Answer Annotation Answer Annotation • Annotation Algorithm For a given Query Q in the Query Set and its clicked result list r1 , r2 , …, rM : IF Q is navigational Find R in r1 , r2 , …, rM , ClickFocus ( Q,R ) = ClickDistribution ( Q ); IF CD( Q ) > T1 Annotate Q with R ; EXIT; ELSE Q cannot be annotated; END IF ELSE //Q is informational Q cannot be annotated; END IF

Experiment Results • Experiment data – Collected by Sogou.com from Jun 2006 to Jan 2007. – Over 700 million querying or clicking events totally. • Annotation experiment results – 5% of all results are checked mannually. #(Annotated #(Checked Accuracy queries) sample set) Jun. 06 - Aug. 06 13,902 695 98.13% Sept.06 - Nov. 06 13,884 694 97.41% Dec. 06 - Jan. 07 11,296 565 96.64%

Experiment Results • Performance evaluation experiment – 320 manual-developed queries and corresponding answers are used in the evaluation experiment. – Correlation value between MRRs of the manual and the automatically methods is 0.965.

Applications and Future works Applications and Future works • Choosing the correct search portal – Overall performance – Performance for queries in a certain field • Search engine monitoring – Complicated computer cluster systems are used in modern search engines – To notify the engineers when the search engine fails. (performance going down)

Questions or comments? Thank you!

Automatic Search Engine Evaluation Automatic Search Engine - PowerPoint PPT Presentation

Automatic Search Engine Evaluation Automatic Search Engine Evaluation with Click- -through Data Analysis through Data Analysis with Click Yiqun Liu Liu Yiqun State Key Lab of Intelligent Tech. & Sys State Key Lab of Intelligent Tech.

Search Engine Optimization What is Search Engine Optimization Search Engine Optimization is the

Dependency Dependency- -Based Automatic Evaluation Based Automatic Evaluation Dependency

Search engine evaluation Nisheeth Evaluation Evaluation is key to building effective and

The Economics of Internet Search Hal R. Varian Sept 31, 2007 Search engine use Search

Technologies behind Internet Search Engine Ming-Jer Lee CTO VisionNEXT Inc. Type of Search

Chapter 12. Evaluation Research Chapter 12. Evaluation Research evaluation research? evaluation

User Interface Evaluation Empirical evaluation Heuristic evaluation 1 CS 349 - UI evaluation

Whats New in Engine Research Whats New in Engine Research Mark Musculus Engine Combustion

1 Mapping Relational Data Model Patterns To The App Engine Datastore Max Ross November 19,

Google App Engine Guido van Rossum Stanford EE380 Colloquium, Nov 5, 2008 Google App Engine

Elastic Search - Aditi Choksi (EW18455) Elastic Search Search engine Distributed

Automatic Verification of Automatic Verification of Automatic Verification of Automatic

eyeShot Multimedia Search Engine Multimedia Search Engine eyeShot Extracting text patterns

The search engine you can see Connects people to information and services The search engine you

Information Retrieval CS6200 Search Engine Architecture Jesse Anderton College of Computer and

Audient: Audient: An Acoustic Search Engine An Acoustic Search Engine By Ted Leath Supervisor:

Database Security CS 645 Apr 10, 2008 Gerome Miklau, UMass Amherst Outline Security basics

File System Project Seminar On-Disk Layout Prof. Andreas Polze Andreas Grapentin, Sven Khler

The PanPipe Workflow Manager Daniel Ortiz Genome Data Science Group Institute for Research in

Chapter 18 Parallel Processing Multiple Processor Organization Single instruction, single

Trusts: 2019-2020 Planning Roadmap Part 1 Planning before the 2020 Election; State Income

QPAT Pension Workshop Names You Should Know Retraite Qubec merged administrative body

RIIO-2 Draft Determination National Grid response London, 7 September 2020 National Grid plc

MassHealth Pharmacy Program Status Medicaid Prescription Drug Quality and Cost Management