AdaptSize: Orchestrating the Hot Object Memory Cache in a CDN - PowerPoint PPT Presentation

AdaptSize: Orchestrating the Hot Object Memory Cache in a CDN Daniel S. Mor Ramesh K. Berger Harchol-Balter Sitaraman USENIX NSDI. Boston, March 28, 2017.

CDN Caching Architecture Content providers 1% 1% 1% 1% DC HOC CDN 100% 100% 100% 100% Users 1

Optimizing CDN Caches Two caching levels: 1% ❏ Disk Cache (DC) DC ❏ Hot Object Cache (HOC) 40% # reqs served HOC performance metric HOC by HOC object hit ratio = OHR = total # reqs 100% Goal: maximize OHR 2

Prior Approaches to Cache Management Frequent decisions required DC What to admit What to evict Today in practice historically everything LRU a few GBs capacity e.g., Nginx, Varnish HOC mixtures of 2000s in academia everything LRU/LFU e.g., Modha, Zhang, Kumar 500 GB concurrent 2010s in academia per hour everything LRU e.g., Kaminsky, Lim, Andersen 3

We Are Missing a Key Issue 9 orders of magnitude Not all objects are the same ❏ Should we admit every object? (no, we should favor small objects) ❏ A few key companies know this (but don’t know how to it well) ❏ Academia has not been helpful (almost all theoretical work assumes equal-sized objects) 4

What’s Hard About Size-Aware Admission Fixed Size Threshold: How to pick c: admit if size < Threshold c pick c to maximize OHR 9pm 2pm m a 8 t a c t s e b Threshold c The best threshold changes with traffic mix 5

Can we avoid picking a threshold c Probabilistic admission: Unfortunately, many curves example: exp(c) family high admission low admission Which curve makes big difference probability probability We need to adapt c 6

What to admit What to evict concurrent LRU AdaptSize adaptive size-aware The AdaptSize Caching System adapt adapt First system that continuously adapts with with the parameter of size-aware admission time traffic Enforce Calculate Take traffic Calculate admission measurements the best c the best c control 7 Incorporated into high-throughput production caching system (Varnish)

Δ interval Δ interval Δ interval … time How to Find Best c Within Each Δ Interval Traditional approach AdaptSize approach Hill climbing Markov model Enables speedy Local optima on global optimization OHR-vs-c curve 8

How AdaptSize Gets the OHR-vs-c curve hit miss Markov chain OUT IN ➢ track IN/OUT for each object Algorithm request request For every Δ interval and for every value of c use Markov chain to solve for OHR( c ) ❏ find c to maximize OHR ❏ Why hasn’t this been done? Too slow: exponential state space New technique: approximation with linear state space 9

Implementing AdaptSize Incorporated into Varnish highly concurrent HOC system, 40+ Gbit/s DC Enforce Take traffic Calculate admission measurements the best c control HOC A dapt S ize 10 Goal: low overhead on request path

Implementing AdaptSize Incorporated into Varnish highly concurrent HOC system, 40+ Gbit/s DC Enforce Take traffic Calculate admission measurements the best c control HOC Challenges A dapt 40% 1% 1) Concurrent write conflicts requests objects S ize 2) Locks too slow [NSDI’13 & 14] AdaptSize: producer/consumer + ring buffer Lock-free implementation 11

Implementing AdaptSize Incorporated into Varnish highly concurrent HOC system, 40+ Gbit/s DC Enforce Take traffic Calculate admission measurements the best c control HOC AdaptSize: A dapt admission is really simple S ize given c, and the object size ❏ admit with P(c, size) ❏ Enables lock free & low overhead implementation 12

AdaptSize Evaluation Testbed Origin 40 GBit / Origin : emulates 100s of web servers 100ms RTT 55 million / 8.9 TB unique objects DC DC : unmodified Varnish 4x 1TB/ 7200 Rpm HOC unmodified Varnish A dapt HOC systems : ❏ S ize 1.2 GB NGINX cache ❏ 16 threads AdaptSize ❏ 40 GBit / 30ms RTT Clients : replay Akamai requests trace 440 million / 152 TB total requests 13

Comparison to Production Systems what to admit what to evict Varnish everything concurrent LRU Nginx frequency filter LRU AdaptSize concurrent LRU adaptive size-aware +92% +48% 14

Comparison to Research-Based Systems manually tuned parameters recency and manually tuned parameters frequency combinations +67% manually tuned parameters 15

Robustness of AdaptSize Size-Aware OPT: offline parameter tuning AdaptSize: our Markovian tuning model HillClimb: local-search using shadow queues 16

Conclusion # reqs Goal: maximize OHR of the Hot Object Cache served by HOC OHR= total Approach: size-based admission control # reqs 17

Conclusion # reqs Goal: maximize OHR of the Hot Object Cache served by HOC OHR= total Approach: size-based admission control # reqs Key insight: need to adapt parameter c AdaptSize: adapts c via a Markov chain Result: 48-92% higher OHRs 18

Conclusion # reqs Goal: maximize OHR of the Hot Object Cache served by HOC OHR= total Approach: size-based admission control # reqs Key insight: need to adapt parameter c AdaptSize: adapts c via a Markov chain Result: 48-92% higher OHRs Throughput ❏ In our paper /dasebe/AdaptSize Disk utilization ❏ Byte hit ratio ❏ Request latency ❏ 19

AdaptSize: Orchestrating the Hot Object Memory Cache in a CDN - PowerPoint PPT Presentation

AdaptSize: Orchestrating the Hot Object Memory Cache in a CDN Daniel S. Mor Ramesh K. Berger Harchol-Balter Sitaraman USENIX NSDI. Boston, March 28, 2017. CDN Caching Architecture Content providers 1% 1% 1% 1% DC HOC CDN 100% 100%

What Is Memory Hierarchy A typical memory hierarchy today: Lecture 13: Cache Basics and Cache

Memory Hierarchy: Cache Memory hierarchy Cache basics Locality Cache organization Cache-aware

1 Classifying cache misses Cache Organization Classifying misses by causes (3Cs) Cache size,

Cache Systems CPU Main Main CPU Memory Memory 400MHz 10MHz Cache 10MHz Memory Hierarchy

General Cache Mechanics CPU Block: unit of data in cache and memory. (a.k.a. line) Memory

Cache Memory Chapter 17 S. Dandamudi Outline Introduction Types of cache misses

Cache Memory Chapter 17 S. Dandamudi Outline Introduction Types of cache misses

Chapter 4 Cache Memory Contents Computer memory system overview Characteristics of

L09: Cache Name: ID: Question: Direct Mapping Cache Hit Rate Consider a 4-block empty Cache,

Virtual Memory 1 Memory Hierarchy Memory 4GB Cache 1M Registers 1K Question: What if

Lecture 23: Cache, Memory, Virtual Memory Todays topics: Cache examples, caching

Caches Electronic Computers M Caches 1 Cache LOCALITY PRINCIPLE (SPATIAL AND TEMPORAL)

Web Cache Consistency Web Cache Consistency Web Cache Consistency Web Cache Consistency

Cache Example Main memory: Byte addressable memory of size 4GB = 2 32 bytes Cache size: 64KB = 2 16

Generations of Cache 1980: no cache in proc; 1989 first Intel proc with a cache on chip.

Object Oriented Object 3 Programming Object 1 Object 2 Object 4 For : COP 3330. Object

Keep your Data Close and your Caches Hotter using Apache Kafka, Connect and KSQL @gamussa |

Webinar Employee Self Service (ESS) November 13, 2014 Gavin Scott, QSS Mark Bixby, QSS Agenda

Resource Management Challenges in the Era of Extreme Heterogeneity Ron Brightwell, R&D

Annua l Gra nts Ma na g e me nt Surve y Re sults a nd Ana lysis FEBRUARY, 2020 RE I Syste ms,

Towards Network Aware Recommendations Savvas Kastanakis Postgraduate Student @ CSD UOC

The Page Cache Don Porter CSE 506 Recap Last time we talked about optimizing disk I/O

Navigating Working Relationships with Site Supervisors Dial : 877-853-5257 Webinar ID : 953 0278

Bill Boroski LQCD-ext Contractor Project Manager USQCD All-Hands Meeting Fermi National

AdaptSize: Orchestrating the Hot Object Memory Cache in a CDN - PowerPoint PPT Presentation

AdaptSize: Orchestrating the Hot Object Memory Cache in a CDN Daniel S. Mor Ramesh K. Berger Harchol-Balter Sitaraman USENIX NSDI. Boston, March 28, 2017. CDN Caching Architecture Content providers 1% 1% 1% 1% DC HOC CDN 100% 100%

What Is Memory Hierarchy A typical memory hierarchy today: Lecture 13: Cache Basics and Cache

Memory Hierarchy: Cache Memory hierarchy Cache basics Locality Cache organization Cache-aware

1 Classifying cache misses Cache Organization Classifying misses by causes (3Cs) Cache size,

Cache Systems CPU Main Main CPU Memory Memory 400MHz 10MHz Cache 10MHz Memory Hierarchy

General Cache Mechanics CPU Block: unit of data in cache and memory. (a.k.a. line) Memory

Cache Memory Chapter 17 S. Dandamudi Outline Introduction Types of cache misses

Cache Memory Chapter 17 S. Dandamudi Outline Introduction Types of cache misses

Chapter 4 Cache Memory Contents Computer memory system overview Characteristics of

L09: Cache Name: ID: Question: Direct Mapping Cache Hit Rate Consider a 4-block empty Cache,

Virtual Memory 1 Memory Hierarchy Memory 4GB Cache 1M Registers 1K Question: What if

Lecture 23: Cache, Memory, Virtual Memory Todays topics: Cache examples, caching

Caches Electronic Computers M Caches 1 Cache LOCALITY PRINCIPLE (SPATIAL AND TEMPORAL)

Web Cache Consistency Web Cache Consistency Web Cache Consistency Web Cache Consistency

Cache Example Main memory: Byte addressable memory of size 4GB = 2 32 bytes Cache size: 64KB = 2 16

Generations of Cache 1980: no cache in proc; 1989 first Intel proc with a cache on chip.

Object Oriented Object 3 Programming Object 1 Object 2 Object 4 For : COP 3330. Object

Keep your Data Close and your Caches Hotter using Apache Kafka, Connect and KSQL @gamussa |

Webinar Employee Self Service (ESS) November 13, 2014 Gavin Scott, QSS Mark Bixby, QSS Agenda

Resource Management Challenges in the Era of Extreme Heterogeneity Ron Brightwell, R&amp;D

Annua l Gra nts Ma na g e me nt Surve y Re sults a nd Ana lysis FEBRUARY, 2020 RE I Syste ms,

Towards Network Aware Recommendations Savvas Kastanakis Postgraduate Student @ CSD UOC

The Page Cache Don Porter CSE 506 Recap Last time we talked about optimizing disk I/O

Navigating Working Relationships with Site Supervisors Dial : 877-853-5257 Webinar ID : 953 0278

Bill Boroski LQCD-ext Contractor Project Manager USQCD All-Hands Meeting Fermi National

Resource Management Challenges in the Era of Extreme Heterogeneity Ron Brightwell, R&D