steering the edgecast cdn
play

Steering the Edgecast CDN Marcel Flores 13 June 2018 - PowerPoint PPT Presentation

Steering the Edgecast CDN Marcel Flores 13 June 2018 Confidential and proprietary materials for authorized Verizon personnel and outside agencies only. Use, disclosure or distribution of this material is not permitted to any unauthorized


  1. Steering the Edgecast CDN Marcel Flores 13 June 2018 Confidential and proprietary materials for authorized Verizon personnel and outside agencies only. Use, disclosure or distribution of this material is not permitted to any unauthorized persons or third parties except by written agreement.

  2. The Edgecast CDN | 2

  3. What does it do? ● The CDN moves content closer to end users. ● Reduces latency, increases capacity. Clients PoP Customer origin PoP | 3

  4. How does it work? ● Uses anycast for PoP Client selection. ● Relies on BGP to get to the BGP right PoP. The Internet MAGIC ● We get lots for free: ○ Network spreads load PoP more-or-less automatically. ○ We can achieve failover by PoP retracting announcements. | 4

  5. Anycast Challenges Client ● Overall, it can be unpredictable: PoP ○ No information on latency or load. BGP ● What’s going to happen if we The Internet MAGIC? change announcements? ● Can make many traditional Traffic PoP Engineering/Management problems hard. PoP | 5

  6. Pulling blocks can be destructive ● Breaks TCP connections ○ Unpredictable behavior for a period of time ● Long running downloads may get ruined ● But it can be even worse... | 6

  7. Pulling blocks can be destructive New Dallas Chicago York | 7

  8. A smoother way... CH1 CH1 CH2 DNS Magic | 8

  9. DNS To Different Anycast Announcements ● Edgecast has a few networks . ● Each has a (potentially) overlapping set of servers that it addresses. ○ Kind of like Microsoft’s FastRoute ● Using DNS can steer clients to a particular network. | 9

  10. Working with Humans Confidential and proprietary materials for authorized Verizon personnel and outside agencies only. Use, disclosure or | 10 distribution of this material is not permitted to any unauthorized persons or third parties except by written agreement.

  11. In the old days... ● An experienced Human would look at signals: ○ Current load at PoPs ○ Available capacity at PoPs ● Human would write and deploy a DNS rule to effect this change. | 11

  12. In the old days... Try it Deploy Measure | 12

  13. Challenges with humans... ● Humans make mistakes: ○ It’s hard to look at a lot of numbers at once, ○ As the CDN grows this gets more serious. ● Humans have to sleep sometimes! | 13

  14. How do we make a robot do it? Confidential and proprietary materials for authorized Verizon personnel and outside agencies only. Use, disclosure or | 14 distribution of this material is not permitted to any unauthorized persons or third parties except by written agreement.

  15. Heteractis DATA Network Information DNS Capacity Decision Action Voice Making Manager Usage Chat-Ops ??? | 15

  16. Wow that’s complicated. Confidential and proprietary materials for authorized Verizon personnel and outside agencies only. Use, disclosure or distribution of this material is not permitted to any unauthorized persons or third parties except by written agreement.

  17. Heteractis DATA Network Information DNS Capacity Decision Action Voice Making Manager Usage Chat-Ops ??? | 17

  18. Data Models ● Collect Many source of data. ○ Combine into meaningful representations. ○ Capacity, usage, the current state. ● Keep those up to date. | 18

  19. Decision Making Models D ● Considers a set of possible Complexity Model A actions (i.e. DNS rules). T ○ “For traffic A, send Z% to Red 1. Option A .99 A Capacity Model 2. Option B .89 Network” 3. Option C .74 1. Option A .96 ... ● Asks: According to each model, 2. Option B .85 what would happen if I did this? 3. Option C .74 ... ○ Each model generates a score. | 19

  20. Decision Making ● For each action... Capacity Model Complexity Model ○ Compute a weighted linear combination. 90% 10% ● Rank all the actions by combined score. ● Pick the action with the highest score. Final Score | 20

  21. Action Manager Try it ● Applies some of that delicate touch that a human would do: ○ Smooths actions out over several minutes. Deploy Measure ○ Prevents overlapping changes from firing at once. | 21

  22. Voice ● Interacts with humans: ○ Human gating mechanism: As a deployment strategy, a human says what is OK. ● Integrates with other systems ○ Slack, chatops, etc. | 22

  23. Making sure Humans Can understand. ● Why did it do that? ○ Can a human validate that it was a good idea? ● Can ask it questions: ○ What does Heteractis think about X? ○ Why did it just recommend Y? | 23

  24. Why didn’t we get really fancy? ● Humans need to feel good about why it’s making decisions. ● Each decision: ○ Can be made based on current data alone. ■ Easy debugging ○ Can be recreated based on snapshot. | 24

  25. Score Models: Capacity Confidential and proprietary materials for authorized Verizon personnel and outside agencies only. Use, disclosure or | 25 distribution of this material is not permitted to any unauthorized persons or third parties except by written agreement.

  26. Basic Idea : X ● For some proposed action, what would happen? ● Low Utilization: High Score ✔ ● High Utilization: Low Score | 26

  27. Predicting Load Client ● Because we are using PoP anycast, not totally clear what will happen. BGP ● So how do we know what The Internet MAGIC? score to give it? PoP PoP | 27

  28. Predicting Load ● Think about it like fluid: ○ Sum total amount to move. ○ Distribute it evenly over destinations. | 28

  29. Predicting Load On the other hand: ● If we know we already have X% at Blue, just scale it! At the end: we have an estimate % utilization at each PoP. | 29

  30. Computing Scores ● Use a logistic curve to smooth out the edges. ● Take a harmonic mean of all PoPs to test for outliers below. | 30

  31. Heteractis In Action Confidential and proprietary materials for authorized Verizon personnel and outside agencies only. Use, disclosure or | 31 distribution of this material is not permitted to any unauthorized persons or third parties except by written agreement.

  32. Customer X Throughput Hour Confidential and proprietary materials for authorized Verizon personnel and outside agencies only. Use, disclosure or | 32 distribution of this material is not permitted to any unauthorized persons or third parties except by written agreement.

  33. Customer X + Daily Traffic Throughput Hour Confidential and proprietary materials for authorized Verizon personnel and outside agencies only. Use, disclosure or | 33 distribution of this material is not permitted to any unauthorized persons or third parties except by written agreement.

  34. Heteractis’s Changes % Moved Hour Confidential and proprietary materials for authorized Verizon personnel and outside agencies only. Use, disclosure or | 34 distribution of this material is not permitted to any unauthorized persons or third parties except by written agreement.

  35. Red Network PoPs Throughput Hour Confidential and proprietary materials for authorized Verizon personnel and outside agencies only. Use, disclosure or | 35 distribution of this material is not permitted to any unauthorized persons or third parties except by written agreement.

  36. Blue Network PoPs Throughput Hour Confidential and proprietary materials for authorized Verizon personnel and outside agencies only. Use, disclosure or | 36 distribution of this material is not permitted to any unauthorized persons or third parties except by written agreement.

  37. PoP Capacity (%) PoP Capacity (%) Hour Confidential and proprietary materials for authorized Verizon personnel and outside agencies only. Use, disclosure or | 37 distribution of this material is not permitted to any unauthorized persons or third parties except by written agreement.

  38. Heteractis in Production ● Heteractis has been live for almost 2 years. ○ Making automatic traffic moves nearly daily. ● Moved from human gated to full auto. ○ Built confidence of the humans. ○ Widely used as a view into CDN health. ● Significantly reduced manual human interactions. | 38

  39. Heteractis ● We built an automated system for managing traffic. ● Implemented in a way that: ○ Provides visibility into decision making process. ○ Builds trust with humans. | 39

  40. Thank you. Confidential and proprietary materials for authorized Verizon personnel and outside agencies only. Use, disclosure or distribution of this material is not permitted to any unauthorized persons or third parties except by written agreement.

Recommend


More recommend