Overload Control for Scaling WeChat Microservices WeChat The new - PowerPoint PPT Presentation

Feb 07, 2024 •523 likes •675 views

Overload Control for Scaling WeChat Microservices WeChat The new way to connect Chat Moments Contacts Search Pay 1 Billion monthly active users WeChats Microservice Architecture Service DAG Vertex: a distinct service; Edge: call

Overload Control for Scaling WeChat Microservices
WeChat The new way to connect Chat Moments Contacts Search Pay 1 Billion monthly active users
WeChat’s Microservice Architecture • Service DAG – Vertex: a distinct service; Edge: call path – Basic service : out-degree = 0 – Leap service : out-degree ≠ 0 o Entry service : in-degree = 0
Dealing with Overload • It’s usually hard to estimate the dynamics of workload during the development of microservices. Subsequent Overload How about random load shedding?
Dynamic Workload Relative Statistics of WeChat Service Requests
DAGOR • Overload detection • Service admission control • Requirements – Service agnostic o Benefit the ever evolving microservice system o Decouple overload control from the business logic of services – Independent but collaborative o Decentralized overload control o Service-oriented collaboration among nodes – Efficient and fair o Sustain best-effort success rate of service when load shedding becomes inevitable o Bias-free overload control
Overload Detection • Load indicator of a node: Queuing time – Rationale: to manage queue length for SLA • Why not response time? • Why not CPU utilization?
Service Admission Control Shuffling on an hourly basis Exploit histogram for real-time adjustment Static
DAGOR Workflow Service agnostic Independent but collaborative Efficient and fair Collaborative Admission Control
Overload Detection Queuing Time vs. Response Time
Scalability Overload Control Overload Control with Different Types of Workload with Increasing Workload (M 2 ) Optimal Success Rate = 𝒈 𝒕𝒃𝒖 𝒈
Fairness CoDel DAGOR
Takeaways: DAGOR Design Principles 1. Must be decentralized and autonomous in each service/node – Essential for the overload control framework to scale with the ever evolving microservice system 2. Employ feedback mechanism for adaptive load shedding – Essential for adjusting thresholds automatically 3. Prioritize user experience
Thank You ALL!

Recommend

Margins Overload = Balance Margins is the gap between overload and your limits. Overload

Margins Overload = Balance Margins is the gap between overload and your limits. Overload happens when you do not respect those limits. Balance is the humility to know, acknowledge, and accept you have limits. The Myths of Well being

653 views • 51 slides

Outline Scaling Scalinga Plenitude of Power Laws Scaling-at-large Scaling-at-large

Scaling Outline Scaling Scalinga Plenitude of Power Laws Scaling-at-large Scaling-at-large Scaling-at-large Principles of Complex Systems Allometry Allometry Definitions Definitions Course 300, Fall, 2008 Examples Examples

568 views • 27 slides

UP UP AND OUT: SCALING SOFTWARE WITH AKKA Jonas Bonr CTO Typesafe @jboner Scaling software

UP UP AND OUT: SCALING SOFTWARE WITH AKKA Jonas Bonr CTO Typesafe @jboner Scaling software with Jonas Bonr CTO Typesafe @jboner Scaling Scaling software with software with Scaling Scaling software with software with Akka

1.98k views • 174 slides

Operator Overload Ch 11.1 Highlights - operator overload Basic point class Suppose we wanted

Operator Overload Ch 11.1 Highlights - operator overload Basic point class Suppose we wanted to make a simple class to represent an (x,y) coordinate point (See: pointClass.cpp) Basic point class Now let's extend the class and make a

467 views • 11 slides

Analysis of Scaling Algorithms for Matrix & Operator Scaling Contents Scaling Algorithms

Rafael Oliveira University of Toronto Analysis of Scaling Algorithms for Matrix & Operator Scaling Contents Scaling Algorithms Three Step Analysis Generalization One More Application of Scaling Non-Negative Matrices &

604 views • 15 slides

Diameter Overload Control Jus3fica3on and Use Cases Mar3n Dolly

Diameter Overload Control Jus3fica3on and Use Cases Mar3n Dolly AT&T Labs Reason for Overload (Use Cases) Inadequate capacity Network element

396 views • 5 slides

A Mechanism for Session Initiation Protocol (SIP) Avalanche Restart Overload Control

A Mechanism for Session Initiation Protocol (SIP) Avalanche Restart Overload Control draft-shen-soc-avalanche-restart-overload-00 Charles Shen and Henning Schulzrinne, Columbia University Arata Koike, NTT IETF 80, Prague, Czech Republic

443 views • 10 slides

Effectively Scaling Effectively Scaling up/universalizing exclusive up/universalizing exclusive

Effectively Scaling Effectively Scaling up/universalizing exclusive up/universalizing exclusive breastfeeding breastfeeding Creating Distt. Level Model Creating Distt. Level Model Effectively scaling up /universalizing Effectively scaling

574 views • 12 slides

Scaling From simple models to rich strategies PPPLab Day, November 30th Scaling: recent

Scaling From simple models to rich strategies PPPLab Day, November 30th Scaling: recent publications Insight Series Tool Animation The rationale for scaling As the SDGs require transformational change, scaling can provide: Reaching more

609 views • 37 slides

Outline Scalinga Plenitude of Power Laws Scaling-at-large Scaling-at-large Principles of

Scaling Scaling Outline Scalinga Plenitude of Power Laws Scaling-at-large Scaling-at-large Principles of Complex Systems Allometry Allometry CSYS/MATH 300, Fall, 2010 Definitions Definitions Examples Examples History: Metabolism

251 views • 20 slides

Science is in trouble Information overload Built-in bias Reproducibility issues Access issues

Science is in trouble Information overload Built-in bias Reproducibility issues Access issues Incentives Iris.ai is helping Information overload Built-in bias Reproducibility issues Access issues Incentives Iris.ai 4.0 works with the

381 views • 16 slides

Conformal Finite Size Scaling of Conformal Finite Size Scaling of Flavors Chik Him Wong Twelve

Conformal Finite Size Scaling of Twelve Fermion Conformal Finite Size Scaling of Conformal Finite Size Scaling of Flavors Chik Him Wong Twelve Fermion Flavors Twelve Fermion Flavors Outline Background Conformality Controversy Simulation

955 views • 58 slides

Chapter 11: Scaling and Round-off Noise Keshab K. Parhi Outline Introduction Scaling

Chapter 11: Scaling and Round-off Noise Keshab K. Parhi Outline Introduction Scaling and Round-off Noise State Variable Description of Digital Filters Scaling and Round-off Noise Computation Round-off Noise Computation

577 views • 46 slides

So#ware Scaling Mo/va/on & Goals HW Configura/on & Scale Out So#ware Scaling

So#ware Scaling Mo/va/on & Goals HW Configura/on & Scale Out So#ware Scaling Efforts System management Opera/ng system Programming environment PreAcceptance Work HW stabiliza/on & early scaling

390 views • 19 slides

ADAPTIVE RADIO OUTPUT SCALING FOR POWER AND BANDWIDTH SAVING Koen Zandberg 1 ADAPTIVE RADIO

ADAPTIVE RADIO OUTPUT SCALING FOR POWER AND BANDWIDTH SAVING Koen Zandberg 1 ADAPTIVE RADIO OUTPUT SCALING FOR POWER AND BANDWIDTH SAVING 2 ADAPTIVE RADIO OUTPUT SCALING FOR POWER AND BANDWIDTH SAVING 3 ADAPTIVE RADIO OUTPUT SCALING

418 views • 27 slides

Industrial Robots Industrial Robots Control Control Part 1 Control Control Part 1 Part 1

Industrial Robots Industrial Robots Control Control Part 1 Control Control Part 1 Part 1 Part 1 Introduction to robot control The motion control problem motion control problem consists in the design of control algorithms for the robot

822 views • 46 slides

Vulnerability Analysis Of Optimal Power Flow Problem Under Cyber-Physical Security Attacks

Vulnerability Analysis Of Optimal Power Flow Problem Under Cyber-Physical Security Attacks Devendra Shelar and Saurabh Amin Massachusetts Institute of Technology INFORMS November 15 th , 2016 Vulnerable Electricity Networks: Key issues Two

391 views • 7 slides

A MAZON S3 Simple storage service Launched: March 14, 2006 Simple key/value storage

A MAZON S3: A RCHITECTING FOR R ESILIENCY IN THE F ACE OF M ASSIVE L OAD Jason McHugh S ETTING THE S TAGE Architecting for Resiliency in the Face of Massive Load Resiliency > High availability Massive load 1. Many requests 2.

845 views • 46 slides

Load Shedding in Network Monitoring Applications . Barlet-Ros 1 G. Iannaccone 2 J. Sanjus-Cuxart

Introduction Prediction Method Load Shedding Evaluation Conclusions Load Shedding in Network Monitoring Applications . Barlet-Ros 1 G. Iannaccone 2 J. Sanjus-Cuxart 1 P D. Amores-Lpez 1 J. Sol-Pareta 1 1 Technical University of

374 views • 34 slides

Instruction encoding The ISA defines The format of an instruction (syntax) The

Instruction encoding The ISA defines The format of an instruction (syntax) The meaning of the instruction (semantics) Format = Encoding Each instruction format has various fields Opcode field gives the semantics (Add,

316 views • 8 slides

Systems Infrastructure for Data Science Web Science Group Uni Freiburg WS 2012/13 Data Stream

Systems Infrastructure for Data Science Web Science Group Uni Freiburg WS 2012/13 Data Stream Processing Todays Topic Stream Processing Model Issues System Issues Distributed Processing Issues Uni Freiburg, WS2012/13 Systems

705 views • 31 slides

Confusion in the land of the serverless Sam Newman Building Microservices DESIGNING FINE -

Confusion in the land of the serverless Sam Newman Building Microservices DESIGNING FINE - GRAINED SYSTEMS Sam Newman #gotoams @samnewman Sam Newman & Associates #gotoams @samnewman #gotoams @samnewman #gotoams @samnewman

1.38k views • 137 slides

Scripts for Sensor Network Seminar Data Management Section Lectured by George Kollios,

Scripts for Sensor Network Seminar Data Management Section Lectured by George Kollios, Scribed by Feifei Li Boston University Computer Science Department { gkollios,lifeifei } @cs.bu.edu Abstract In this section of the seminar, our focus

312 views • 3 slides

Real-Time Databases Meghan Russ Miriam Speert Pete Dempsey Sedat Behar Yevgeny Ioffe Zachi

Real-Time Databases Meghan Russ Miriam Speert Pete Dempsey Sedat Behar Yevgeny Ioffe Zachi Klopman Timeline 1:40 - 1:50: Introduction 1:50 - 3:00: Real-Time Databases/Scheduling 3:00 - 3:10: Break 3:10 - 4:00: Operator Scheduling in

1.12k views • 69 slides