Scalability! But at what COST? Abhinav Garg CS 744 - Fall 2018 - PowerPoint PPT Presentation

Oct 30, 2022 •456 likes •663 views

Scalability! But at what COST? Abhinav Garg CS 744 - Fall 2018 Outline Motivation Goal COST Methodology Baseline Measurements Better Baselines Applying COST to prior work Take-aways Which system is better ?

Scalability! But at what COST? Abhinav Garg   CS 744 - Fall 2018
Outline • Motivation • Goal • COST • Methodology • Baseline Measurements • Better Baselines • Applying COST to prior work • Take-aways
Which system is better ? Scaling of System A and System B
Which one would you use ? Scaling Performance Naiad computation before (System A) and after (System B) a performance optimization is applied
Motivation • Scalability is considered most important feature • Big data systems may scale well, often because they introduce a lot of overhead • Are systems truly improving performance?
Goal • A new performance metric for big data platforms • Distinguish scalability from e ffi cient use of resources • Weight system’s scalability against overheads • Do not reward systems with substantial but parallelizable overheads
COST • Configuration that outperforms a single thread • Hardware configuration required before platform outperforms competent single threaded implementation
Methodology • Take measurements from recent graph processing publications • Compare against simple single-threaded implementations running on a laptop • Write competent, but not overly fancy algorithms. • Evaluate Page Rank and Graph Connectivity on twitter_rv and uk_2007_05 graphs (GraphX)
Baseline Measurements Elapsed time for 20 Page Rank iterations
Baseline Measurements Elapsed time for Graph Connectivity (using label propagation)
Better Baselines • Improve graph layout • Hilbert Order instead of Vertex Order • (good, good) locality instead of (great, poor) • Reduces TLB misses and page walks
Better Baselines • Improve algorithms • Label propagation scales due to algorithms sub- optimality • Label propagation does more work than better algorithms • Use Union-Find algorithm
Better Baselines Page Rank 179 sec to convert Graph Connectivity Does not ‘think like a vertex’, but parallelizable
Applying COST to prior work 2 1 3 Time per warm iteration Time for 10 iterations from a cold start Scaling measurements for Page Rank on Twitter Graph
Applying COST to prior work • 1- Hash Table based 1 • 2- Array based • Makes trade-o ff 2 clearer Two Naiad implementations of parallel union-find for graph connectivity
Reasons to tolerate high COST • Integration with existing ecosystem • Target variety of problems • High availability, fault tolerance, or security • Technical expertise of the team Think: Do you really need the high COST system?
Take-aways • Understanding overheads is important • Most scalable systems might not be most e ffi cient • Consider alternative hardware and algorithms • Important to evaluate COST - to explain if high COST is intrinsic, to highlight avoidable ine ffi ciencies
Questions ?
References • Frank McSherry, Michael Isard, Derek Murray. Scalability! But at what COST? HotOS, 2015 • http://www.frankmcsherry.org/graph/scalability/cost/ 2015/01/15/COST.html • https://www.youtube.com/watch?v=6bWBEJBMNG0

Recommend

Scalability and Replication Marco Serafini COMPSCI 532 Lecture 13 Scalability 2 Scalability

Scalability and Replication Marco Serafini COMPSCI 532 Lecture 13 Scalability 2 Scalability Ideal world Linear scalability Speedup Reality Ideal Bottlenecks For example: central coordinator When do we stop

940 views • 36 slides

Performance and Scalability (Chapter 11) Performance and Scalability Performance: How long

Performance and Scalability (Chapter 11) Performance and Scalability Performance: How long is the latency? Scalability: Do we get higher throughput if we add more resources? Performance and Scalability Performance: How long is the

210 views • 17 slides

Root zone scalability model Bart Gijsen October 28, 2009 Root zone scalability model

Root zone scalability model Bart Gijsen October 28, 2009 Root zone scalability model Introduction Development of the model by TNO as part of the Root Scalability Study Team Why quantify? Scalability is a quantitative topic

674 views • 18 slides

Versioning of Topic Map Templates Structuring Versioning and Scalability Scalability Proc.

Versioning of TM Templates and Scalability M. Ueberall, O. Drobnik Introduction Versioning of Topic Map Templates Structuring Versioning and Scalability Scalability Proc. Model Ongoing Work M. Ueberall, O. Drobnik Telematics Group,

550 views • 28 slides

TUTORIAL - TUTORIAL -ABC ABC TOTAL COST for a COST OBJECT TOTAL COST for a COST OBJECT

TUTORIAL - TUTORIAL -ABC ABC TOTAL COST for a COST OBJECT TOTAL COST for a COST OBJECT EXAMPLE: 1 Direct Cost (Labor Material) Direct Cost (Labor Material) Overhead Cost Overhead Cost + + Total

200 views • 15 slides

Scalability! But at what COST? Frank McSherry, Michael Isard, Derek G. Murray Alex Gubbay

Scalability! But at what COST? Frank McSherry, Michael Isard, Derek G. Murray Alex Gubbay What's Wrong With Distributed Systems Reporting? Scalability often touted as the most important feature Fail to evaluate absolute performance

425 views • 11 slides

Cost Report Capital Cost Operating Cost (Up front cost) (Annual cost over time) Utilities

Cost Report Capital Cost Operating Cost (Up front cost) (Annual cost over time) Utilities Construction Insurance Design Maintenance Operating Costs Taxes Reserves Contribution to Reserves Construction Cost

311 views • 12 slides

Cost Allocation Plans and Indirect Cost Rates Cost Allocation Plans and Indirect Cost Rates

Cost Allocation Plans and Indirect Cost Rates Cost Allocation Plans and Indirect Cost Rates Method of charging shared or common cost among different funding sources Cost Allocation Plan (CAP) Charges current shared cost to funding sources

145 views • 14 slides

Chapter 4 Chapter 4 Marginal Costing and Cost-Volume-Profit Analysis Cost behaviour Cost

Chapter 4 Chapter 4 Marginal Costing and Cost-Volume-Profit Analysis Cost behaviour Cost behaviour Cost behaviour is 'the way in which cost per unit of output is affected by fluctuations in the level of activity'. Fixed cost Variable cost

866 views • 47 slides

Hidden Scalability Gotchas Gotchas Hidden Scalability in Memcached Memcached and Friends and

Hidden Scalability Gotchas Gotchas Hidden Scalability in Memcached Memcached and Friends and Friends in Neil Gunther Gunther , , Performance Dynamics Performance Dynamics Neil Shanti Subramanyam , , Oracle Corp Oracle Corp oration oration

761 views • 36 slides

Improving Scalability and Fault Improving Scalability and Fault Tolerance in an Application

Improving Scalability and Fault Improving Scalability and Fault Tolerance in an Application Tolerance in an Application Management Infrastructure Management Infrastructure Nikolay Topilski , Jeannie Albrecht, and Amin Vahdat Williams College

560 views • 18 slides

Linux multi-core scalability Oct 2009 Andi Kleen Intel Corporation andi@firstfloor.org

Linux multi-core scalability Oct 2009 Andi Kleen Intel Corporation andi@firstfloor.org Overview Scalability theory Linux history Some common scalability trouble-spots Application workarounds Motivation CPUs still getting faster

327 views • 22 slides

Scalability: Pushing the Limits PNSQC Presentation, October 2014 Neha Rai, Tim Schooley, Tejas

Scalability: Pushing the Limits PNSQC Presentation, October 2014 Neha Rai, Tim Schooley, Tejas Patil 2 So what is Scalability? Scalability is the ability of a system to successfully handle an increasing workload, or its ability to

695 views • 30 slides

Scalability Testing of Kadeploy using Virtual Machines on Grid5000 Luc Sarzyniec, S

Scalability Testing of Kadeploy using Virtual Machines on Grid5000 Luc Sarzyniec, S ebastien Badia, Emmanuel Jeanvoine, Lucas Nussbaum Grid5000 Scalability testing of Kadeploy on Grid5000 1 / 10 Scalability Testing of Kadeploy

1.29k views • 45 slides

Scalability of web applications CSCI 470: Web Science Keith Vertanen Overview Scalability

Scalability of web applications CSCI 470: Web Science Keith Vertanen Overview Scalability questions What's important in order to build scalable web sites? High availability vs. load balancing Approaches to scaling

600 views • 26 slides

Scalability and Stability of IP and Compact Routing Huaiyuan Ma PhD defense presentation Feb

Scalability and Stability of IP and Compact Routing Huaiyuan Ma PhD defense presentation Feb 26th, 2015 Trondheim Scalability and Stability of IP and Compact Routing 1 Motivation Active BGP Entries Scalability and Stability of IP and

718 views • 35 slides

Welcome! Community Choice Aggregation Expansion in California and its Relation to Investor-Owned

Welcome! Community Choice Aggregation Expansion in California and its Relation to Investor-Owned Utility Procurement August 3, 2017 Overview/Housekeeping This webinar is being recorded Enter questions in control panel chat area at any

494 views • 35 slides

Breaking Down New Yorks Value Based Payment (VBP) Incentives Jason Ganns, Director, Public

Breaking Down New Yorks Value Based Payment (VBP) Incentives Jason Ganns, Director, Public Sector Advisory, KPMG September 2016 September 2016 2 Introduction to Value Based Payment Reform September 2016 3 Background NYS Medicaid in

799 views • 62 slides

COVID-19 Temporary Quarantine and Isolation Center: A Proof of Concept for Behavioral Health

COVID-19 Temporary Quarantine and Isolation Center: A Proof of Concept for Behavioral Health Crisis Stabilization Centers* Updated: 5/28/2020 *Authors: Edward Mersereau, LCSW a , Kathryn E. Boyer, MPA b , Victoria Y. Fan, ScD b , Joshua R.

280 views • 16 slides

Massachusetts Health Care Payment System: Recommendations of the Special Commission New York

Massachusetts Health Care Payment System: Recommendations of the Special Commission New York State Health Foundation Conference October 28, 2009 Health Care Reform: Phase 1 On April 12, 2006, Massachusetts enacted landmark legislation

419 views • 15 slides

EECS 192: Mechatronics Design Lab Discussion 3: Motor Driver and Servo Control GSI: Justin Yim 1

EECS 192: Mechatronics Design Lab Discussion 3: Motor Driver and Servo Control GSI: Justin Yim 1 & 2 Feb 2017 (Week 3) 1 Motor Driver Circuits 2 Wiring 3 Servomotors 4 Summary Ducky (UCB EECS) Mechatronics Design Lab 1 & 2 Feb 2017

887 views • 56 slides

C4ISR Architectures and Software Architectures Rich Hilliard rh@mitre.org IEEE Architecture

C4ISR Architectures and Software Architectures Rich Hilliard rh@mitre.org IEEE Architecture Working Group http://www.pithecanthropus.com/~awg/ Circa 1996? updated info: r.hilliard@computer.org http://www.iso-architecture.org/42010 Contents

407 views • 21 slides

Advances in Optoelectronic Technologies for ROADM Subsystem s Louay Eldada Chief Technology

Advances in Optoelectronic Technologies for ROADM Subsystem s Louay Eldada Chief Technology Officer DuPont Photonics Technologies louay.eldada@usa.dupont.com http://www.photonics.dupont.com Use of ROADM in Optical Networks Long Haul Metro

567 views • 34 slides

CLIMATE JUSTICE: CAN WE AGREE TO DISAGREE? Operationalising competing equity principles to

Completion seminar 28 July 2017 CLIMATE JUSTICE: CAN WE AGREE TO DISAGREE? Operationalising competing equity principles to mitigate global warming Ph.D. Student Yann Robiou du Pont, Australian-German College of Climate & Energy

703 views • 58 slides