Web Search Using Mobile Cores Quantifying and Mitigating the Price - PowerPoint PPT Presentation

Web Search Using Mobile Cores Quantifying and Mitigating the Price of Efficiency Vijay Janapa Reddi Benjamin Lee Trishul Chilimbi Kushagra Vaid Engineering & Applied Science Electrical Engineering Runtime Analysis & Design Global Foundation Services Microsoft Research Harvard University Stanford University Microsoft Corporation International Symposium on Computer Architecture 22 June 2010 1

Conventional Wisdom ◦ Moore’s Law provides transistors ◦ Simple cores improve energy efficiency ◦ Parallelism recovers lost performance 2

Simple Cores ◦ Pursue aggregate throughput, energy efficiency ◦ Assume task parallelism ◦ Assume latency tolerance 3

Applications in Transition • Conventional Enterprise ◦ Process independent requests ◦ Exhibit high memory, I/O intensity ◦ Ex: web, database, Java, mail, file servers • Emerging Cloud ◦ Extract information, value from data ◦ Exhibit high compute intensity ◦ Ex: analytics, machine learning 4

Computational Intensity ◦ Microsoft Bing ranks pages with neural network ◦ RMS foreshadows future analytic workloads 5

Cloud Efficiency • Challenges ◦ Migrate computation, data to cloud ◦ Choose efficient components ◦ Understand application, component interaction • Case Study ◦ Mobile cores for efficiency, parallelism for performance? ◦ Achieve efficiency with mobile cores (Intel Atom) ◦ Quantify price of efficiency (Microsoft Bing) 6

Efficiency Atom is more energy, cost efficient than Xeon Price of Efficiency Atom limitations impact latency, relevance, flexibility Mitigating Price of Efficiency Atom over-provisioning should consider platform overheads 7

Search Architecture ◦ Rank pages using neural network ◦ Deploy on server (Xeon), mobile (Atom) processors 8

Processor Activity ◦ Compare Xeon (4-issue, OOO) and Atom (2-issue, IO) ◦ Measure µ arch activity with hardware counters 9

Processor Power ◦ Compare Xeon (15W per core) and Atom (1.5W per core) ◦ Measure processor power at voltage regulator 10

Processor Efficiency ◦ Demonstrate energy, cost efficiency with Atom ◦ Measure max QPS within QoS target 11

Efficiency Atom is more energy, cost efficient than Xeon Price of Efficiency Atom limitations impact latency, relevance, flexibility Mitigating Price of Efficiency Atom over-provisioning should consider platform overheads 12

Price of Efficiency • Latency ◦ Cut-off latency limits refinement opportunities ◦ Per query latency impacts quality-of-service • Relevance ◦ Search rank orders documents ◦ Choice, ordering of results impact relevance • Flexibility ◦ Query activity, complexity increase load ◦ Processor resources impact flexibility 13

Latency ◦ Atom increases latency average ( µ ) by 3 × ◦ Atom increases latency variance ( σ 2 ) 14

Relevance ◦ Consider choice, ordering of top N documents ◦ Atom impacts relevance under all query loads 15

Flexibility ◦ Consider activity, complexity of queries ◦ Atom harms QoS for more complex queries 16

Mitigating Price of Efficiency Efficiency Atom is more energy, cost efficient than Xeon Price of Efficiency Atom limitations impact latency, relevance, flexibility Mitigating Price of Efficiency Atom over-provisioning should consider platform overheads 17

Mitigating Price of Efficiency Mitigating Price of Efficiency • Addressing Latency & Relevance ◦ Address µ architectural limitations ◦ Integrate application-specific accelerators ◦ Manage heterogeneous servers • Addressing Flexibility ◦ Over-provision Atoms ◦ Mitigate platform overheads ◦ Integrate more cores per chip 18

Mitigating Price of Efficiency Platform Overheads ◦ Xeon: 4-core, 2-socket ◦ Atom: 2-core, 1-socket ⇒ Hyp-Atom: 8-core, 2-socket 19

Mitigating Price of Efficiency Total Cost of Ownership (TCO) ◦ Pie slice shows breakdown of TCO $ ◦ Pie size shows throughput per TCO $ 20

Mitigating Price of Efficiency Case for Integration ◦ Hyp-Atom attributes more per TCO $ to servers ◦ Hyp-Atom achieves greater throughput per TCO $ 21

Conclusion Efficiency Atom is more energy, cost efficient than Xeon Price of Efficiency Atom limitations impact latency, relevance, flexibility Mitigating Price of Efficiency Atom over-provisioning should consider platform overheads 22

Conclusion Also in the paper ... • µ architecture ◦ Processor activity from hardware counters ◦ µ architectural bottlenecks • Search ◦ Application phases in computation ◦ Execution time breakdown • Mitigating Price of Efficiency ◦ µ architectural enhancements ◦ Heterogeneous, accelerated processors 23

Conclusion Conclusion • Emerging Cloud Applications ◦ Extract value from data ◦ Increase compute intensity • Energy Efficiency ◦ Improve efficiency by 5 × with mobile processors ◦ Exact price in latency, relevance, flexiblity • Future Challenges ◦ Pursue efficiency given compute intensity ◦ Consider heterogeneous, accelerated processors 24

Web Search Using Mobile Cores Quantifying and Mitigating the Price of Efficiency Vijay Janapa Reddi Benjamin Lee Trishul Chilimbi Kushagra Vaid Engineering & Applied Science Electrical Engineering Runtime Analysis & Design Global Foundation Services Microsoft Research Harvard University Stanford University Microsoft Corporation International Symposium on Computer Architecture 22 June 2010 25

Web Search Using Mobile Cores Quantifying and Mitigating the Price - PowerPoint PPT Presentation

Web Search Using Mobile Cores Quantifying and Mitigating the Price of Efficiency Vijay Janapa Reddi Benjamin Lee Trishul Chilimbi Kushagra Vaid Engineering & Applied Science Electrical Engineering Runtime Analysis & Design Global

TXN/SEC CPU CORES TXN/SEC CPU CORES TXN/SEC CPU CORES TXN/SEC CPU CORES TXN/SEC CPU CORES

MOBILE ADVERTISING Agenda Get off to a mobile start with Media Impact! Why mobile? MI

PROGRAMMING TENSOR CORES: NATIVE VOLTA TENSOR CORES WITH CUTLASS Andrew Kerr, Timmy Liu, Mostafa

Fused and Composable Heterogeneous Cores Roshan Nair and Anirudh Krishna Villivalam Single cores

Web Services Web Services Towards Web Services Towards Web Services Towards Web Services A

Mobile Web Applications using HTML5 L. Cotfas 14 Dec. 2011 Reasons for mobile web development

Search Engines Issues Avi Rappoport Search Tools Consulting Search Issues Enterprise Search

EE 6882 Visual Search Engine Lec. 1: Introduction tinyeye, photo copy search Web image search

MOBILE HTML5 Max Firtman @firt mobile+web developer Wednesday, October 12, 11 who am I? mobile +

Web CS490W: Web I nformation Search & Management Web opened the door for many important

Web Data Representation Web Graph, Text, Images, Metadata, Search spaces Web Search 1 The Web

Mobile Capabilities And Credentials Contents Mobile Landscape Mobile Functionalities

Tabu Search Search Tabu Page 1 Part I Part I Tabu Search Principles Search Principles Tabu

Uninformed Search 2 Informed Search Rest of blind search An informed search strategyone

Informed search algorithms Outline Best-first search Greedy best-first search A *

Foundations of Artificial Intelligence 9. State-Space Search: Tree Search and Graph Search Malte

ON THE EQUIVALENCE BETWEEN GRAPHICAL AND TABULAR REPRESENTATIONS FOR SECURITY RISK ASSESSMENT

Methodology Adapted from Menasc & Almeida. 1 Learning Objectives Discuss the concept

A Dynamic Approach to Scaling in Bundle Methods for Convex Optimization Christoph Helmberg joint

= LEITER f Students 2884 : 4- RADES = 12 DOMAIN = Z CODOWUAIN = 9. } 16,25 4

TriviaQA: A Large Scale Distantly Supervised Challenge Dataset for Reading Comprehension Mandar

A Reference Architecture for Web Servers

CISC 322 Software Architecture Lecture 11: Reference Architecture Emad Shihab Paper by: Ahmed

Performance Metrics for Web Browsing draft fan ippm web metrics 00 Peng Fan

Web Search Using Mobile Cores Quantifying and Mitigating the Price - PowerPoint PPT Presentation

Web Search Using Mobile Cores Quantifying and Mitigating the Price of Efficiency Vijay Janapa Reddi Benjamin Lee Trishul Chilimbi Kushagra Vaid Engineering & Applied Science Electrical Engineering Runtime Analysis & Design Global

TXN/SEC CPU CORES TXN/SEC CPU CORES TXN/SEC CPU CORES TXN/SEC CPU CORES TXN/SEC CPU CORES

MOBILE ADVERTISING Agenda Get off to a mobile start with Media Impact! Why mobile? MI

PROGRAMMING TENSOR CORES: NATIVE VOLTA TENSOR CORES WITH CUTLASS Andrew Kerr, Timmy Liu, Mostafa

Fused and Composable Heterogeneous Cores Roshan Nair and Anirudh Krishna Villivalam Single cores

Web Services Web Services Towards Web Services Towards Web Services Towards Web Services A

Mobile Web Applications using HTML5 L. Cotfas 14 Dec. 2011 Reasons for mobile web development

Search Engines Issues Avi Rappoport Search Tools Consulting Search Issues Enterprise Search

EE 6882 Visual Search Engine Lec. 1: Introduction tinyeye, photo copy search Web image search

MOBILE HTML5 Max Firtman @firt mobile+web developer Wednesday, October 12, 11 who am I? mobile +

Web CS490W: Web I nformation Search &amp; Management Web opened the door for many important

Web Data Representation Web Graph, Text, Images, Metadata, Search spaces Web Search 1 The Web

Mobile Capabilities And Credentials Contents Mobile Landscape Mobile Functionalities

Tabu Search Search Tabu Page 1 Part I Part I Tabu Search Principles Search Principles Tabu

Uninformed Search 2 Informed Search Rest of blind search An informed search strategyone

Informed search algorithms Outline Best-first search Greedy best-first search A *

Foundations of Artificial Intelligence 9. State-Space Search: Tree Search and Graph Search Malte

ON THE EQUIVALENCE BETWEEN GRAPHICAL AND TABULAR REPRESENTATIONS FOR SECURITY RISK ASSESSMENT

Methodology Adapted from Menasc &amp; Almeida. 1 Learning Objectives Discuss the concept

A Dynamic Approach to Scaling in Bundle Methods for Convex Optimization Christoph Helmberg joint

= LEITER f Students 2884 : 4- RADES = 12 DOMAIN = Z CODOWUAIN = 9. } 16,25 4

TriviaQA: A Large Scale Distantly Supervised Challenge Dataset for Reading Comprehension Mandar

A Reference Architecture for Web Servers

CISC 322 Software Architecture Lecture 11: Reference Architecture Emad Shihab Paper by: Ahmed

Performance Metrics for Web Browsing draft fan ippm web metrics 00 Peng Fan

Web CS490W: Web I nformation Search & Management Web opened the door for many important

Methodology Adapted from Menasc & Almeida. 1 Learning Objectives Discuss the concept