Efficient Miss Ratio Curve Computation for Heterogeneous Content - PowerPoint PPT Presentation

Efficient Miss Ratio Curve Computation for Heterogeneous Content Popularity Damiano Carra Giovanni Neglia Computer Science Dept. Université Côte d’Azur University of Verona Inria Verona, Italy Sophia Antipolis, France

Context q Caches are fundamental components of computing architectures – Fast access to the most used items q Shared resource – Used by different processes or applications, with different requirements and access patterns q Main issue – How to divide and assign dynamically cache portions to applications? 2

Miss Ratio Curves – MRCs q Profiling – Hit ratio vs amount of cache space for each application q Use – Compute the MRCs for each application for a given interval – Assign cache space that maximize some objective function q Main concern: Computational complexity 3

Current approaches q MRC are computed from the trace q Exact computation requires: – O(M) memory – O(logM) computational complexity q Approximate computation – Trade precision with space and speed • O(1) memory and O(1) operation per request – Most of the solutions are based on sampling 4

Spatial sampling q Approach: – Observe a randomly selected fraction R of the items – Build the MRC and scale the X-axis by 1/R q R can be adaptive – This allows to obtain O(1) memory and O(1) computational complexity 5

Sampling bias q Spatial sampling biased if requests rates are highly heterogenous across objects – The sample may or may not include some objects that are crucial for the MRC q Solutions to such bias focus on the MRC tail – A correction factor accounts for the difference between the expected and actual average observed requests 6

Experiments on synthetic traces q IRM traces, with Zipf-distributed object popularity – Various ⍺ (Zipf parameter) α = 0.8 α = 1.2 1.2 1.2 MRC 1 1 sample1 sample2 0.8 0.8 Miss ratio Miss ratio sample3 sample4 0.6 0.6 MRC sample1 0.4 0.4 sample2 sample3 0.2 0.2 sample4 0 0 10 0 10 1 10 2 10 3 10 4 10 5 10 6 10 7 10 0 10 1 10 2 10 3 10 4 10 5 10 6 10 7 Cache size (num. of items) Cache size (num. of items) 7

[Detour] How to measure the accuracy? q Mean Absolute Error (MAE) – Average of the absolute difference between the exact and approximate MRC q Main issue à All sizes have the same weight ms-ex ms-ex 1 1 0.8 0.8 Miss ratio Miss ratio 0.6 0.6 0.4 0.4 0.2 0.2 approximate MRC approximate MRC exact MRC exact MRC 0 0 10 0 10 1 10 2 10 3 10 4 10 5 10 6 0 0.5 1 1.5 2 2.5 Cache size (num. of items x 10 6 ) Cache size (num. of items) 8

[Detour] MAE per Quantile à MAEQ ms-ex q Divide the Y-axis into intervals 1 0.8 – And determine the corresponding intervals Miss ratio on the X-axis 0.6 q Compute the MAE i within each interval i 0.4 0.2 approximate MRC q MAEQ = average MAE over all intervals exact MRC 0 10 0 10 1 10 2 10 3 10 4 10 5 10 6 q In this example: Cache size (num. of items) – MAE = 0.025 – MAEQ = 0.09 9

Accuracy with synthetic traces q As ⍺ increases (high variability in the popularities) à the error increases q Does the error depend on the tail of the popularity distribution? 10 0 R = 0.1 Average error (MAEQ) R = 0.01 10 -1 R = 0.001 10 -2 10 -3 10 -4 0.6 0.8 1.0 1.2 0.6p Parameter α of the Zipf 10

The role of popular objects q Case with ⍺ = 0.6 à We add 20 popular objects IRM α = 0.6, with popular items 10 0 1.4 Zipf, 1.2 Zipf, 0.6 10 -1 1.2 0.6p 10 -2 1 request freq. Miss ratio 10 -3 0.8 10 -4 0.6 10 -5 0.4 MRC sample1 10 -6 sample2 0.2 sample3 sample4 10 -7 0 10 0 10 1 10 2 10 3 10 4 10 5 10 6 10 7 10 0 10 1 10 2 10 3 10 4 10 5 Item ID Cache size (num. of items) q The results are confirmed by a model (see the paper) 11

Our solution: Key idea A B C B D q Small cache sizes depend Requests mostly on the highly popular Reuse Distance Histogram items Exact From samples hits hits q Large cache sizes can be built from sample 1 size size B R Exact MRC Approximate MRC 1 1 q Our approach à Combine 1 R exact and approximate MRCs B size B size Full MRC – Can adopt a “constant 1 sampling rate” or “constant complexity” approach B size 12

Experimental methodology q Comparison with the state-of-the-art solutions – We set the parameters aiming at fair comparison • Use the same amount of memory q CPU overheads – With our scheme, the CPU usage is on average 10% higher than state-of-the-art solutions 13

Results: IRM α = 0.6, with popular items α = 1.2 10 0 1 1 R = 0.1 MRC Average error (MAEQ) R = 0.01 sample1 0.8 0.8 10 -1 R = 0.001 sample2 Miss ratio Miss ratio sample3 0.6 0.6 sample4 10 -2 MRC 0.4 0.4 sample1 10 -3 sample2 0.2 0.2 sample3 sample4 10 -4 0 0 10 0 10 1 10 2 10 3 10 4 10 5 10 6 10 7 10 0 10 1 10 2 10 3 10 4 10 5 10 6 10 7 0.6 0.8 1 1.2 0.6p Parameter α of the Zipf Cache size (num. of items) Cache size (num. of items) q Accurate for any size – MAEQ always below 1% 14

Results: IRM, sensitivity to B α = 0.6, with popular items 1 10 0 B = 0 (SHARDS adj ) Average error (MAEQ) 0.95 B = 32 10 -1 B = 64 0.9 Miss ratio B = 125 B = 250 MRC 0.85 10 -2 B = 500 B = 32 B = 64 0.8 B = 125 10 -3 B = 250 0.75 B = 500 0.7 10 -4 10 0 10 1 10 2 10 3 10 4 10 5 1.2 0.6p Cache size (num. of items) Parameter α of the Zipf q Error mainly where the curves join 15

Real-world traces q Traces from different sources, with different characteristics systor CDN 10 -1 10 -1 10 -2 10 -2 request freq. request freq. 10 -3 10 -3 α = 0.7 α = 1.3 10 -4 10 -4 α = 1.1 10 -5 10 -5 10 -6 10 -6 10 -7 10 -7 10 0 10 1 10 2 10 3 10 4 10 5 10 0 10 1 10 2 10 3 10 4 10 5 Item ID Item ID 16

Real-world traces: results systor systor 10 0 1 1 SHARDS adj Average error (MAEQ) our mixed approach 0.8 0.8 10 -1 Miss ratio Miss ratio 0.6 0.6 MRC MRC 10 -2 0.4 0.4 sample1 sample1 sample2 sample2 0.2 0.2 sample3 sample3 10 -3 sample4 sample4 0 0 10 0 10 1 10 2 10 3 10 4 10 5 10 6 10 0 10 1 10 2 10 3 10 4 10 5 10 6 10 -4 Cache size (num. of items) Cache size (num. of items) ms-exms-dev systor CDN fiu CDN CDN 1 1 Trace ID 0.8 0.8 Miss ratio Miss ratio 0.6 0.6 MRC MRC 0.4 0.4 sample1 sample1 sample2 sample2 0.2 0.2 sample3 sample3 sample4 sample4 0 0 10 0 10 1 10 2 10 3 10 4 10 5 10 6 10 0 10 1 10 2 10 3 10 4 10 5 10 6 Cache size (num. of items) Cache size (num. of items) 17

Extension (1/2) q “Non-stack” algorithms – Eviction policies that do not satisfy the inclusion property – Need to compute the MRC by points – Our approach: • use high R with small cache sizes, decrease R as the cache size increases 18

Extension (2/2) q Heterogenous item size – Web caches deal with items with heterogenous size – Can we build the MRC in such a case? à Order statistics tree – Does sampling work in this case? CDN, het. sizes CDN, het. sizes 1 1 0.8 0.8 Miss ratio Miss ratio 0.6 0.6 MRC MRC 0.4 0.4 sample1 sample1 sample2 sample2 0.2 0.2 sample3 sample3 sample4 sample4 0 0 10 0 10 1 10 2 10 3 10 4 10 5 10 6 10 0 10 1 10 2 10 3 10 4 10 5 10 6 19 Cache size (MB) Cache size (MB)

Conclusions and perspectives q Build a MRC from samples requires a careful design – Highly popular items have a significant impact – Instead of the tail of the distribution, the head is important q In our approach we combine an exact MRC with an approximate one – Improving the accuracy of the final result q Future works – Online adaptation of the scheme parameters 20

Thanks! For any question, you can reach me at

Efficient Miss Ratio Curve Computation for Heterogeneous Content - PowerPoint PPT Presentation

Efficient Miss Ratio Curve Computation for Heterogeneous Content Popularity Damiano Carra Giovanni Neglia Computer Science Dept. Universit Cte dAzur University of Verona Inria Verona, Italy Sophia Antipolis, France Context q Caches

Curve Curve Ninjas December 19, 2012 Curve Ninjas Curve Overview Using Curve Implementation

Welcome! Introductions Year 1 Year 2 Miss Fowler Miss Mackintosh Mrs McEwan Miss Williams

Elliptic Curve Cryptography Applications of Elliptic Curve Cryptography Elliptic Curve

Header Header Supported by: Miss Elwell Mrs Barlow Miss Kent Miss Abrahams 7RX 7DX 7EX

Welcome to the Year 5 Curriculum Meeting Miss Ainsley and Miss Keen Adults in Year 5 Miss

THE GOLDEN RATIO AND THE FIBONACCI NUMBERS Common Measures 1 foot 2 feet 3 feet 3 2 Ratio

Bending the Cost Curve and Improving Bending the Cost Curve and Improving Bending the Cost Curve

Welcome Year 2 Parents Two Year 2 classes: Chive Class Miss Adams & Miss Angela Basil

Introductions Miss Tankaria (3NT) Miss Bunting (3SB) Miss Darkes (3DD)

GCSE DRAMA HEAD OF PERFORMING ARTS MISS PARKER GCSE DRAMA TEACHERS: MISS PARKER AND MISS

P4 & 5 Meet-The-Parents 2018 P4 Form Teachers Class Form Teachers Miss Sharon Sin 4

Improving Cache Performance AMAT: Average Memory Access Time AMAT = T hit + Miss Rate x Miss

Ope ratio ns Ope ratio ns Wo rksho p Wo rksho p 2005 2005 USCG Auxiliary Ope ratio ns De

Coverage in Heterogeneous Coverage in Heterogeneous Networks Xiaoli Chu King s College

Local Analysis of 2D Curve Patches Local Analysis of 2D Curve Patches Topic 4.2: Topic 4.2:

Maths Summary Intro: Zero coupon yield curve (Ex 13) TV concept Yield curve

Efficient MRC Construc0on with SHARDS Carl Waldspurger

Adaptive Wavelet Methods for the Efficient Approximation of Images Gerlind Plonka Institute for

Constraint Programming Sudoku, Schach und Stundenplanung Thom Fr uhwirth Faculty of

Grid Enabled Neurosurgical Grid Enabled Neurosurgical Imaging Using Simulation g g g

ERO Enterprise Performance Metrics Metric 1: Reliability Results Measure Determine

Counting Triangles and Modeling MapReduce Siddharth Suri Yahoo! Research Outline 2 Modeling

Tutorials 1) Kinked Helix at Low Resolution Download DireX and Tutorial files from: Simple toy

INTRO TO JQUERY JAVASCRIPT MADE MORE ACCESSIBLE Created by Brian Duffey WHO AM I? Brian Duffey

Efficient Miss Ratio Curve Computation for Heterogeneous Content - PowerPoint PPT Presentation

Efficient Miss Ratio Curve Computation for Heterogeneous Content Popularity Damiano Carra Giovanni Neglia Computer Science Dept. Universit Cte dAzur University of Verona Inria Verona, Italy Sophia Antipolis, France Context q Caches

Curve Curve Ninjas December 19, 2012 Curve Ninjas Curve Overview Using Curve Implementation

Welcome! Introductions Year 1 Year 2 Miss Fowler Miss Mackintosh Mrs McEwan Miss Williams

Elliptic Curve Cryptography Applications of Elliptic Curve Cryptography Elliptic Curve

Header Header Supported by: Miss Elwell Mrs Barlow Miss Kent Miss Abrahams 7RX 7DX 7EX

Welcome to the Year 5 Curriculum Meeting Miss Ainsley and Miss Keen Adults in Year 5 Miss

THE GOLDEN RATIO AND THE FIBONACCI NUMBERS Common Measures 1 foot 2 feet 3 feet 3 2 Ratio

Bending the Cost Curve and Improving Bending the Cost Curve and Improving Bending the Cost Curve

Welcome Year 2 Parents Two Year 2 classes: Chive Class Miss Adams &amp; Miss Angela Basil

Introductions Miss Tankaria (3NT) Miss Bunting (3SB) Miss Darkes (3DD)

GCSE DRAMA HEAD OF PERFORMING ARTS MISS PARKER GCSE DRAMA TEACHERS: MISS PARKER AND MISS

P4 &amp; 5 Meet-The-Parents 2018 P4 Form Teachers Class Form Teachers Miss Sharon Sin 4

Improving Cache Performance AMAT: Average Memory Access Time AMAT = T hit + Miss Rate x Miss

Ope ratio ns Ope ratio ns Wo rksho p Wo rksho p 2005 2005 USCG Auxiliary Ope ratio ns De

Coverage in Heterogeneous Coverage in Heterogeneous Networks Xiaoli Chu King s College

Local Analysis of 2D Curve Patches Local Analysis of 2D Curve Patches Topic 4.2: Topic 4.2:

Maths Summary Intro: Zero coupon yield curve (Ex 13) TV concept Yield curve

Efficient MRC Construc0on with SHARDS Carl Waldspurger

Adaptive Wavelet Methods for the Efficient Approximation of Images Gerlind Plonka Institute for

Constraint Programming Sudoku, Schach und Stundenplanung Thom Fr uhwirth Faculty of

Grid Enabled Neurosurgical Grid Enabled Neurosurgical Imaging Using Simulation g g g

ERO Enterprise Performance Metrics Metric 1: Reliability Results Measure Determine

Counting Triangles and Modeling MapReduce Siddharth Suri Yahoo! Research Outline 2 Modeling

Tutorials 1) Kinked Helix at Low Resolution Download DireX and Tutorial files from: Simple toy

INTRO TO JQUERY JAVASCRIPT MADE MORE ACCESSIBLE Created by Brian Duffey WHO AM I? Brian Duffey

Welcome Year 2 Parents Two Year 2 classes: Chive Class Miss Adams & Miss Angela Basil

P4 & 5 Meet-The-Parents 2018 P4 Form Teachers Class Form Teachers Miss Sharon Sin 4