Applied Performance Theory @kavya719 kavya applying performance - PowerPoint PPT Presentation

Applied Performance Theory @kavya719

applying performance theory to practice

performance • What’s the additional load the system can support,   without degrading response time ? • What’re the system utilization bottlenecks ? • What’s the impact of a change on response time,   maximum throughput ? capacity • How many additional servers to support 10x load? • Is the system over-provisioned ?

#YOLO method   load simulation   Stressing the system to empirically determine actual   performance characteristics, bottlenecks.   Can be incredibly powerful. performance modeling

performance modeling model as * real-world system theoretical model analyze results translate back * makes assumptions about the system: request arrival rate, service order, times. cannot apply the results if your system does not satisfy them!

a single server open, closed queueing systems   utilization law, Little’s law, the P-K formula CoDel, adaptive LIFO a cluster of many servers the USL scaling bottlenecks stepping back the role of performance modeling

a single server

model I web clients server response time threshold “what’s the maximum throughput of this server, response time (ms) given a response time target?” “how can we improve the mean response time ?” throughput (requests / second)

model the web server as a queueing system . web server request response } } queueing delay + service time = response time

model the web server as a queueing system . web server request response } } queueing delay + service time = response time assumptions 1. requests are independent and random , arrive at some “arrival rate”. 2. requests are processed one at a time, in FIFO order;   requests queue if server is busy (“queueing delay”). 3. “service time” of a request is constant.

“What’s the maximum throughput of this server?” i.e. given a response time target

“What’s the maximum throughput of this server?” i.e. given a response time target arrival rate increases utilization = arrival rate * service time Utilization law “busyness” server utilization increases utilization arrival rate

“What’s the maximum throughput of this server?” i.e. given a response time target arrival rate increases Utilization law server utilization increases linearly

“What’s the maximum throughput of this server?” i.e. given a response time target arrival rate increases Utilization law server utilization increases linearly P(request has to queue) increases, so   mean queue length increases, so mean queueing delay increases.

“What’s the maximum throughput of this server?” i.e. given a response time target arrival rate increases Utilization law server utilization increases linearly P-K formula P(request has to queue) increases, so   mean queue length increases, so mean queueing delay increases.

Pollaczek-Khinchine (P-K) formula U * linear fn (mean service time) * quadratic fn (service time variability) mean queueing delay = (1 - U) assuming constant service time and so, request sizes: U mean queueing delay ∝ (1 - U) since response time ∝ queueing delay response time queueing delay utilization (U) utilization (U)

“What’s the maximum throughput of this server?” i.e. given a response time target arrival rate increases Utilization law response time (ms) low utilization server utilization increases linearly regime P-K formula mean queueing delay increases non-linearly ; throughput (requests / second) so, response time too.

“What’s the maximum throughput of this server?” i.e. given a response time target arrival rate increases high utilization regime Utilization law response time (ms) low utilization server utilization increases linearly regime P-K formula mean queueing delay increases non-linearly ; throughput (requests / second) so, response time too. max throughput

“How can we improve the mean response time?”

“How can we improve the mean response time?” 1. response time ∝ queueing delay prevent requests from queuing too long • Controlled Delay (CoDel)   in Facebook’s Thrift framework   • adaptive or always LIFO   in Facebook’s PHP runtime,   Dropbox’s Bandaid reverse proxy. • set a max queue length • client-side concurrency control

“How can we improve the mean response time?” 1. response time ∝ queueing delay prevent requests from queuing too long key insight: queues are typically empty • Controlled Delay (CoDel)   allows short bursts, prevents standing queues in Facebook’s Thrift framework   onNewRequest(req, queue): if ( queue.lastEmptyTime() < (now - N ms)) { • adaptive or always LIFO   // Queue was last empty more than N ms ago; in Facebook’s PHP runtime,   // set timeout to M << N ms.   timeout = M ms   Dropbox’s Bandaid reverse proxy. } else { // Else, set timeout to N ms.   • set a max queue length timeout = N ms   }   queue.enqueue(req, timeout) • client-side concurrency control

“How can we improve the mean response time?” 1. response time ∝ queueing delay prevent requests from queuing too long key insight: queues are typically empty • Controlled Delay (CoDel)   allows short bursts, prevents standing queues in Facebook’s Thrift framework   helps when system is overloaded,   • adaptive or always LIFO   makes no difference when it’s not. in Facebook’s PHP runtime,   newest requests first, not old requests   Dropbox’s Bandaid reverse proxy. that are likely to expire. • set a max queue length • client-side concurrency control

“How can we improve the mean response time?” 2. response time ∝ queueing delay P-K formula U * linear fn ( mean service time) * quadratic fn ( service time variability ) (1 - U) } } decrease service time decrease request / service size variability by optimizing application code for example, by batching requests

model II industry site the cloud while true: // upload synchronously. ack = upload(data) processes data from // update state, server // sleep for Z seconds. N sensors deleteUploaded(ack) sleep(Z seconds) N sensors

] ] server request response N clients This is called a closed system. super different that the previous web server model ( open system ). } throughput depends on response time !   • requests are synchronized. queue length is bounded (<= N), • fixed number of clients. so response time bounded !

response time vs. load for closed systems high utilization assumptions regime 1. sleep time (“think time”) is constant . 2. requests are processed one at a time, in FIFO order. low utilization 3. service time is constant. regime throughput Like earlier, as the number of clients (N) increases: throughput increases to a point i.e. until utilization is high.   after that, increasing N only increases queuing . number of clients What happens to response time in this regime?

Little’s Law for closed systems ] ] server sleeping N clients waiting being processed the system in this case is the entire loop i.e. a request can be in one of three states in the system: sleeping (on the device), waiting (in the server queue), being processed (in the server). the total number of requests in the system includes requests across the states .

Little’s Law for closed systems ] ] server sleep time N clients queueing delay + service time = response time # requests in system = throughput * round-trip time of a request across the whole system sleep time + response time applying it in the high utilization regime (constant throughput) and assuming constant sleep: N = constant * response time So, response time only grows linearly with N!

response time vs. load for closed systems low utilization regime: Like earlier, as the number of clients (N) increases: response time stays ~same throughput increases to a point i.e. until utilization is high.   high utilization regime:   after that, increasing N only increases queuing . grows linearly with N. So, response time for a closed system: high utilization regime response time number of clients

Applied Performance Theory @kavya719 kavya applying performance - PowerPoint PPT Presentation

Applied Performance Theory @kavya719 kavya applying performance theory to practice performance Whats the additional load the system can support, without degrading response time ? Whatre the system utilization bottlenecks ?

Applied Machine Learning Introduction 1 APPLIED MACHINE LEARNING Practicalities Contact

MACHINE LEARNING Overview 1 1 APPLIED MACHINE LEARNING 2011-2012 APPLIED MACHINE LEARNING

Applied Physics Day 2017 Welcome! Department of Applied Physics Present AlbaNova Electrum

Applied Algorithm Design Lecture 3 Pietro Michiardi Eurecom Pietro Michiardi (Eurecom) Applied

The Applied Control The Applied Control Technology Consortium Technology Consortium Dr Arek

Applied Machine Learning Introduction 1 APPLIED MACHINE LEARNING Practicalities Slides and

Applied Algorithm Design Lecture 5 Pietro Michiardi Eurecom Pietro Michiardi (Eurecom) Applied

APPLIED MACHINE LEARNING Probability Density Functions Gaussian Mixture Models 1 APPLIED

Cornell University Program on Applied Demographics Applied Demographics CaRDI Spring Research

COUNSELLORS SFU APPLIED SCIENCES FACULTY OF APPLIED SCIENCES COMPUTING SCIENCE | SOFTWARE

HOMOGENIZATION METHOD APPLIED TO THE HOMOGENIZATION METHOD APPLIED TO THE DEVELOPMENT OF

The Applied Research Institute Overview The University of Virginia Applied Research Institute

Gordon T. & Ellen West College of Education Bachelor of Applied Arts and Sciences (BAAS)

Climate Information Climate Information Applied in China Applied in China Yuping Yan Yan

Course Offerings Course Offering Algebra 1 Algebra 2 Geometry 4 th Year OR OR Math OR

APPLIED BEHAVIOR ANALYSIS Specialization Overview Agenda What is Applied Behavior Analysis

This transcript was exported on Feb 03, 2020 - view latest version here. Speaker 1: OK. And then

At-Speed BIST for Board-Level Interconnect Artur Jutman artur@pld.ttu.ee Tallinn University of

EVP, Products & Strategy Strategy Session - MaaS Erez Dagan Mobility Market

Topological implications of negative curvature for biological and social networks Bhaskar

RAMA HOETZLEIN Graphics Research Engineer | SIGGRAPH 2013 Outline Atomic Ops state Bottlenecks

NICMOS Status Roelof de Jong (STScI) and the NICMOS team: Santiago Arribas, Elizabeth Barker,

Innovation Through Pervasive Engineering Simulation Investor Presentation Q1 2019 NASDAQ: ANSS

Young females, sentinels of viral circulation in beef cattle herd Results of BVD monitoring in

Applied Performance Theory @kavya719 kavya applying performance - PowerPoint PPT Presentation

Applied Performance Theory @kavya719 kavya applying performance theory to practice performance Whats the additional load the system can support, without degrading response time ? Whatre the system utilization bottlenecks ?

Applied Machine Learning Introduction 1 APPLIED MACHINE LEARNING Practicalities Contact

MACHINE LEARNING Overview 1 1 APPLIED MACHINE LEARNING 2011-2012 APPLIED MACHINE LEARNING

Applied Physics Day 2017 Welcome! Department of Applied Physics Present AlbaNova Electrum

Applied Algorithm Design Lecture 3 Pietro Michiardi Eurecom Pietro Michiardi (Eurecom) Applied

The Applied Control The Applied Control Technology Consortium Technology Consortium Dr Arek

Applied Machine Learning Introduction 1 APPLIED MACHINE LEARNING Practicalities Slides and

Applied Algorithm Design Lecture 5 Pietro Michiardi Eurecom Pietro Michiardi (Eurecom) Applied

APPLIED MACHINE LEARNING Probability Density Functions Gaussian Mixture Models 1 APPLIED

Cornell University Program on Applied Demographics Applied Demographics CaRDI Spring Research

COUNSELLORS SFU APPLIED SCIENCES FACULTY OF APPLIED SCIENCES COMPUTING SCIENCE | SOFTWARE

HOMOGENIZATION METHOD APPLIED TO THE HOMOGENIZATION METHOD APPLIED TO THE DEVELOPMENT OF

The Applied Research Institute Overview The University of Virginia Applied Research Institute

Gordon T. &amp; Ellen West College of Education Bachelor of Applied Arts and Sciences (BAAS)

Climate Information Climate Information Applied in China Applied in China Yuping Yan Yan

Course Offerings Course Offering Algebra 1 Algebra 2 Geometry 4 th Year OR OR Math OR

APPLIED BEHAVIOR ANALYSIS Specialization Overview Agenda What is Applied Behavior Analysis

This transcript was exported on Feb 03, 2020 - view latest version here. Speaker 1: OK. And then

At-Speed BIST for Board-Level Interconnect Artur Jutman artur@pld.ttu.ee Tallinn University of

EVP, Products &amp; Strategy Strategy Session - MaaS Erez Dagan Mobility Market

Topological implications of negative curvature for biological and social networks Bhaskar

RAMA HOETZLEIN Graphics Research Engineer | SIGGRAPH 2013 Outline Atomic Ops state Bottlenecks

NICMOS Status Roelof de Jong (STScI) and the NICMOS team: Santiago Arribas, Elizabeth Barker,

Innovation Through Pervasive Engineering Simulation Investor Presentation Q1 2019 NASDAQ: ANSS

Young females, sentinels of viral circulation in beef cattle herd Results of BVD monitoring in

Gordon T. & Ellen West College of Education Bachelor of Applied Arts and Sciences (BAAS)

EVP, Products & Strategy Strategy Session - MaaS Erez Dagan Mobility Market