Capacity Planning and Headroom Analysis for Taming Database - PowerPoint PPT Presentation

Capacity Planning and Headroom Analysis for Taming Database Replication Latency - Experiences with LinkedIn Internet Traffic Zhenyun Zhuang , Haricharan Ramachandra, Cuong Tran, Subbu Subramaniam, Chavdar Botev, Chaoyue Xiong, Badri Sridharan zhenyun@gmail.com LinkedIn Corp. 1

Outlines } Introduction } Problem definition } Observations of LinkedIn Internet traffic } Solutions } Evaluation 2

Introduction - Database replication } Why replicating database events? } Source database protection } Inter-datacenter synchronization } Dataflow } Source database (Espresso database) } Database replication component (Databus) } Clients (Downstream products) Database Replication Web Source Events Downstream User Internet Database Updates pages Database Replicator Consumers Traffic Events 3

Introduction – Capacity planning } Importance } Determine SLA } Capacity planning (e.g., cluster size, replication capacity) } Reduce operation cost } Questions in capacity planning } Future traffic rate forecasting } Replication latency prediction } Replication capacity determination } Replication headroom determination } SLA determination 4

Problem Definition - Terminology } Replication latency } Time difference between: } The event is inserted into source database } The event (after replication) is ready for downstream consumption } Replication SLA } Service level agreements } E.g., Largest replication latency < 60 seconds } Incoming traffic rate } Number of incoming web events per second } Replication capacity } Number of events processed by replication component per second } Aka, Relay Capacity 5

Problem Definition } Forecast future traffic rate } Given historical traffic rate of T i,j , what is the future rate? } Determine the replication latency } Given the traffic rate of T i,j and relay capacity of R i,j , what is the replication latency L i,j ? } Determine SLA } What is the largest replication latency? P99 value? } Determine required replication capacity } Given SLA of L sla and traffic rate of T i,j , what is the required replay capacity of R i,j ? } Determine replication headroom } Given L sla and R i,j , what is highest traffic rate T i,j it can sustain? } What is the expected data of d k of that traffic rate? 6

Observations of LinkedIn Internet traffic } A weekday traffic across time } Weekday vs weekend } Traffic volume is growing 7

Observations of LinkedIn Internet traffic } Strong periodical patterns at day, week, month level 8

Design – Forecasting future traffic } Two models } Time series model (ARIMA) } Regression analysis model } Challenges } Goal: forecast per-hour (or per-minute, per-second) rate } ARIMA: not suitable for long period seasonality (e.g., 168 ) } Regression analysis: works well on weekly (or monthly) traffic } Two step approach } Forecasting future Daily / weekly traffic } Both ARIMA and Regression analysis } Converting daily/weekly traffic to hourly traffic } Seasonal index (hourly) 9

Design – Seasonal Index 10

Design – Forecasting with ARIMA } ARIMA(p,d,q) } P=7, d=1, q=0 } Historical traffic is aggregated on a daily/weekly basis } E.g., 42 days or 6 weeks } Forecasting into daily/weekly traffic } E.g., 21 days or 3 weeks } Computing hourly seasonal index } Totally 168 values (for a week) } Converting daily traffic to hourly traffic 11

Design – Forecasting with Regression Analysis } Linear fitting } Y = a W + b } Traffic is aggregated on a weekly basis } E.g., 6 weeks } Forecasting into weekly traffic } E.g., 3 weeks } Using hourly seasonal index } Totally 168 values (for a week) } Converting weekly traffic to hourly traffic 12

Design – Predicting replication latency } Iterating each hour of a day } Starting from the lowest traffic rate } If traffic rate > relay capacity: Accumulated latency } If traffic rate < relay capacity: Decreased latency 13

Design – Determining replication capacity } Input: } SLA and Traffic rate } Output: } Required replication capacity } Binary searching } Starting with a (very) small capacity and a (very) large capacity } Get the middle capacity, determine the corresponding replication latency } Reset small or large capacity 14

Evaluation - Forecasting } Regression Analysis and ARIMA } Forecasted traffic rates have similar accuracies } Reasons } Little dependency between neighboring data points (hourly) } Regression analysis works on weekly data, even less dependency 15

Evaluation – Determining replication latency } Methodology } Choosing the busiest server; Reset offset } Comparing the calculated relay lag } Shape is almost identical; peak value is 1.6X (376 vs 240 sec) 16

Evaluation - Others } Replication capacity determination } Traffic rate of 2386 event/s; SLA 60 seconds } Takes 12 steps to get capacity of 3374 event/s } Replication headroom determination } Capacity of 5000 event/s; SLA 60 seconds } Takes 9 steps to find it can sustain 8000 event/s traffic rate } Or taking 13 months to reach } SLA determination } Capacity of 6000 event/s } Finds the maximum replication latency of 1135 seconds } P99 of replication latency is 850 seconds 17

Thanks! } Questions ? } zhenyun@gmail.com 18

Capacity Planning and Headroom Analysis for Taming Database - PowerPoint PPT Presentation

Capacity Planning and Headroom Analysis for Taming Database Replication Latency - Experiences with LinkedIn Internet Traffic Zhenyun Zhuang , Haricharan Ramachandra, Cuong Tran, Subbu Subramaniam, Chavdar Botev, Chaoyue Xiong, Badri Sridharan

Chapter 4 Planning Capacity Capacity Strategies Determining Capacity Requirements

A FRAMEWORK FOR CAPACITY ANALYSIS D E B B I E S H E E T Z P R I N C I P A L C O N S U L T A N

Software Tool Seminar WS1516 - Taming the Snake November 4, 2015 1 Taming the Snake 1.1

The Illusion of Capacity in Central Planning The Challenge of Incorporating the Complexity of

Automatic Differentiation: History and Headroom Barak A. Pearlmutter Department of Computer

Elementary School Program Capacity Planning Team Facilities Planning, Design & Construction

T&TA Planning and Resources NASCSP provides research, analysis, training and technical

Overview Background to Planning for Growth Development Capacity modelling outcomes for

powering, CMS upgrade phase 2 Agathe Nidriche 26/07/2019 Agathe Nidriche Optimization for the

TOR of Capacity Building Working Group A needs analysis at Municipal Level (Capacity Building

Linux Systems Capacity Planning Rodrigo Campos camposr@gmail.com - @xinu USENIX LISA 11 -

IT-Capacity Analysis and Forecasting p y y g with KNIME and R Markus Schmid Markus

Taming Pointers A Symbolic Approach Jianwen Zhu jzhu@eecg.toronto.edu Electrical and Computer

Taming Effects in a Dependent World Pierre-Marie Pdrot Max Planck Institute for Software

Long Range Facility Planning Long Range Facility Planning Phase 1 Facility recommendation to

Taming Your Dragon: From No QA to Fully Integrated QA

Further Properties of Wireless Channel Capacity Fengyou Sun and Yuming Jiang Norwegian University

Capacity Cost Allocation Analysis Work Group Meeting #2 Derek Olmstead, Economic Advisor October

UK AND IRISH SHORT SEA FREIGHT RORO AND LOLO MARKETS Analysis of Capacity developments 14 th

Utility Engineering & Operations TOPICS Existing Sanitary Sewer Capacity Capacity for

Methodology Adapted from Menasc & Almeida. 1 Learning Objectives Discuss the concept

Planning for the Future Jekyll Island Carrying Capacity & Infrastructure Assessment OUR

WELCOME! Agenda Recap Define Consensus Bonding Capacity Building Project Costs 101

TAMING NG T THE C CAVEMAN: STRESS MANAGEMENT FOR THE NEW AGE Diana F. Hott, LCSW CEAP

Capacity Planning and Headroom Analysis for Taming Database - PowerPoint PPT Presentation

Capacity Planning and Headroom Analysis for Taming Database Replication Latency - Experiences with LinkedIn Internet Traffic Zhenyun Zhuang , Haricharan Ramachandra, Cuong Tran, Subbu Subramaniam, Chavdar Botev, Chaoyue Xiong, Badri Sridharan

Chapter 4 Planning Capacity Capacity Strategies Determining Capacity Requirements

A FRAMEWORK FOR CAPACITY ANALYSIS D E B B I E S H E E T Z P R I N C I P A L C O N S U L T A N

Software Tool Seminar WS1516 - Taming the Snake November 4, 2015 1 Taming the Snake 1.1

The Illusion of Capacity in Central Planning The Challenge of Incorporating the Complexity of

Automatic Differentiation: History and Headroom Barak A. Pearlmutter Department of Computer

Elementary School Program Capacity Planning Team Facilities Planning, Design &amp; Construction

T&amp;TA Planning and Resources NASCSP provides research, analysis, training and technical

Overview Background to Planning for Growth Development Capacity modelling outcomes for

powering, CMS upgrade phase 2 Agathe Nidriche 26/07/2019 Agathe Nidriche Optimization for the

TOR of Capacity Building Working Group A needs analysis at Municipal Level (Capacity Building

Linux Systems Capacity Planning Rodrigo Campos camposr@gmail.com - @xinu USENIX LISA 11 -

IT-Capacity Analysis and Forecasting p y y g with KNIME and R Markus Schmid Markus

Taming Pointers A Symbolic Approach Jianwen Zhu jzhu@eecg.toronto.edu Electrical and Computer

Taming Effects in a Dependent World Pierre-Marie Pdrot Max Planck Institute for Software

Long Range Facility Planning Long Range Facility Planning Phase 1 Facility recommendation to

Taming Your Dragon: From No QA to Fully Integrated QA

Further Properties of Wireless Channel Capacity Fengyou Sun and Yuming Jiang Norwegian University

Capacity Cost Allocation Analysis Work Group Meeting #2 Derek Olmstead, Economic Advisor October

UK AND IRISH SHORT SEA FREIGHT RORO AND LOLO MARKETS Analysis of Capacity developments 14 th

Utility Engineering &amp; Operations TOPICS Existing Sanitary Sewer Capacity Capacity for

Methodology Adapted from Menasc &amp; Almeida. 1 Learning Objectives Discuss the concept

Planning for the Future Jekyll Island Carrying Capacity &amp; Infrastructure Assessment OUR

WELCOME! Agenda Recap Define Consensus Bonding Capacity Building Project Costs 101

TAMING NG T THE C CAVEMAN: STRESS MANAGEMENT FOR THE NEW AGE Diana F. Hott, LCSW CEAP

Elementary School Program Capacity Planning Team Facilities Planning, Design & Construction

T&TA Planning and Resources NASCSP provides research, analysis, training and technical

Utility Engineering & Operations TOPICS Existing Sanitary Sewer Capacity Capacity for

Methodology Adapted from Menasc & Almeida. 1 Learning Objectives Discuss the concept

Planning for the Future Jekyll Island Carrying Capacity & Infrastructure Assessment OUR