CS244 Advanced Topics in Networking Lecture 10: Buffer Sizing Nick - PowerPoint PPT Presentation

CS244 Advanced Topics in Networking Lecture 10: Buffer Sizing Nick McKeown “Sizing Router Buffers” [Appenzeller, et al. 2004] Spring 2020

Context Guido Appenzeller ▪ At the time: CS PhD student ▪ Founded Big Switch Networks ▪ CTO at VMware for networking ▪ Sigcomm Test of Time Award, 2015 At the time of writing ▪ Challenging to build ISP routers with big buffers ▪ 80% of world’s SRAMs used for router and switch buffers/counters ▪ ISP routers sold with ~90% profit margin 2

Why should we care? Background ▪ No universal agreement on how big a router buffer should be ▪ Or why. ▪ Yet buffers are a major cause of variation in packet delay. ▪ Big buffers require large, slow DRAM memories… ▪ …which complicate the design of large routers. ▪ It would be nice if we could use single chip switches and routers

Simple model of FCFS router buffer 1 C C C C N B Q: Why does the router have a buffer (or queue)? Q: What factors determine the buffer’s size?

Simple model of a router B C 1 1 C N N

Simple Internet queueing model

Early Internet Models Leonard Kleinrock Professor at UCLA 1963: 1 st theoretical study of packet switching using queueing theory 1969: 1 st message sent over Internet. UCLA IMP (Interface Message Processor)

Example model of single packet queue Poisson Traffic, fixed size packets M/D/1 C Poisson arrivals, rate λ B , 𝜍 = 𝜇 Define: Load 𝐷 < 𝜍 𝐶 Then: drop rate loss < 1% 𝑓 . 𝑕 . 𝜍 = 0.8, 𝐶 = 20 𝑞𝑙𝑢𝑡 → Observations: ▪ Packet drop rate is small ▪ Independent of C, RTT, number of flows, etc. 8

Q: How well does this model fit today’s Internet? The model assumes traffic is generated “open loop” by the source. Today’s Internet carries mostly TCP traffic, which uses a closed loop congestion control algorithm.

With multiple flows, RTT is less variable Router buffer Link rate > C Link rate = C A B Observations: ▪ An arriving packet sees a usually-full queue ▪ RTT is pretty much constant (very different from single flow case) ▪ Drops are random events

One flow vs multiple flows One flow 𝑋 ( 𝑢 ) 𝑈 h 𝑠𝑝𝑣𝑕 h 𝑞𝑣𝑢 = 𝑆𝑈 𝑈 ( 𝑢 ) = 𝑑𝑝𝑜𝑡𝑢𝑏𝑜𝑢 100 % Buffer occupancy cwnd and RTT W max W max 2 t Multiple flows 𝑈 h 𝑠𝑝𝑣𝑕 h 𝑞𝑣𝑢 = 𝑋 ( 𝑢 ) 𝑆𝑈 𝑈 ∝ 𝑋 ( 𝑢 ) Buffer occupancy Buffer and RTT Occupancy and RTT t One of the flows (Zoom in) cwnd t

Geometric intuition for throughput equation Drop Note : To convert from packets/second to bits/second, cwnd multiply throughput by the packet size (e.g. MSS) W 2 1 A 3 A 2 W 1 RTT A 1 t ? ( 𝑋 2 − 𝑋 1 ) ∙ 𝑆𝑈 𝑈 Combining 𝐹 [ 𝐵 ] Expected Area, Throughput, 𝐹 [ 𝑈 ] 2 𝐵 2 1 1 𝐵 2 = ( 𝑋 2 − 𝑋 1 ) 𝑋 1 + 1 Segments sent during Drop rate, 𝐵 2 : 𝑈 2 = 𝑞 ≈ = Area 2 ( 𝑋 2 − 𝑋 1 ) 𝐹 [ 𝐵 ] 3 ( 𝑋 2 − 𝑋 1 ) ∙ 𝑆𝑈 𝑈 8 𝐹 [ 𝑋 ] 2 2 2 = 1 1 2 ( 𝑋 2 − 𝑋 1 ) Combining… 2 ( 𝑋 2 − 𝑋 1 ) = ( 𝑋 2 − 𝑋 1 ) ∙ 𝑆𝑈 𝑈 𝐹 [ 𝑋 1 ] = 1 2 𝐹 [ 𝑋 2 ] But 𝑙 segments/sec 𝐹 [ 𝑈 ] = 1 Throughput: 𝐹 [ 𝑈 ] = 4 ∙ 𝑆𝑈 𝑈 𝐹 [ 𝑋 ] 𝑞 ∙ 𝑆𝑈 𝑈 3 𝐹 [ 𝑋 ] 2 𝐹 [ 𝐵 ] Therefore, =

Interpreting the throughput equation 𝑙 𝐹 [ 𝑈 ] = segments/sec 𝑞 ∙ 𝑆𝑈𝑈 ▪ If RTT doubles, throughput halves. ▪ If packet drop rate increases from 1% to 4%, throughput halves.

One reason to care about buffer size With on-chip buffers we can build higher capacity switch ASICs

Switch Chips are Limited by Serial I/O Capacity to the outside world Single chip switch ASIC Switch ASIC with external memory Small on-chip buffering Large off-chip buffering e.g. 64 Mbytes e.g. 8 Gbytes 𝑂 2 × 𝑆 DRAM Switch ASIC #1 Switch ASIC #2 I/O Capacity: C I/O Capacity: C/2 1 2 3 N . . . . . . . . . . . 1 2 3 N/2 . . . . R R 𝐷 = 𝑂 × 𝑆 𝐷 = 𝑂 2 × 𝑆 e.g. 12.8 Tb/s = 128 x 100Gb/s

Switch Chips are Limited by Serial I/O Capacity DRAM DRAM I/O Capacity: C I/O Capacity: C/2 C C/2

C How many switch chips with capacity C/2 do we need to make a router with capacity C ?

We need 6 ASICs with capacity C/2 DRAM DRAM I/O Capacity: C/2 I/O Capacity: C/2 C/8 C/8 C/8 C/8 DRAM DRAM DRAM DRAM I/O Capacity: C/2 I/O Capacity: C/2 I/O Capacity: C/2 I/O Capacity: C/2 C/4 C/4 C/4 C/4 Total Router Capacity: 𝐷 = 𝑂 × 𝑆

It is worth understanding where and when small on-chip buffers suffice

A brief history of buffer size

1988 1994 2019 Congestion Avoidance High Performance and Control TCP in ANSNET VJ & MK CV & CS 𝐶 = 2 𝑈 × 𝐷 “Buffer size should equal the bandwidth delay product” 𝑁𝑏𝑦 𝑆𝑈 𝑈 = 2 𝑈 + 𝐶 / 𝐷 = 4 𝑈 B A C B 𝑁𝑗𝑜 𝑆𝑈 𝑈 = 2 𝑈

cwnd B C B A time 2 T cwnd ^ 𝑋 ^ 𝑋 2 time t

2 cwnd B C B A 1 time 2 T cwnd ^ 𝑋 2 𝐷 = ^ 2 𝑈 + 𝐶 / 𝐷 𝑋 𝑆 𝑈 𝑈 = 2 𝑈 + 𝐶 1 𝐷 2 𝑈 + 5 1 𝐷 2 𝑈 + 4 1 𝐷 2 𝑈 + 3 1 𝐷 2 𝑈 + 2 1 𝐷 ^ 2 𝑈 + 1 𝑋 1 𝐷 𝑆 𝑈 𝑈 = 2 𝑈 2 ^ 𝑋 /2 ∴ 𝐶 = 2 𝑈 × 𝐷 1 𝐷 = 2 𝑈 time t

B C A Time Evolution of a Single TCP Flow B 2 T 𝐶 = 2 𝑈 × 𝐷 𝐶 < 2 𝑈 × 𝐷

Single TCP New Reno flow: 100% Throughput ^ Example: 𝑋 ^ 2 𝑈 = 100 𝑛𝑡 , 𝐷 = 10 𝐻𝑐 / 𝑡 If then 𝐶 ≥ 2 𝑈 × 𝐷 𝑋 → 2 1. 𝐶 ≥ 1 𝐻𝑐𝑗𝑢 Example: 𝑙 = 1.5 Example: 𝑙 = 1.14 ^ 𝐶 ≥ 500 𝑁𝑐𝑗𝑢𝑡 𝐶 ≥ 140 𝑁𝑐𝑗𝑢𝑡 𝑋 ^ If then 𝐶 ≥ 2 𝑈 ( 𝑙 − 1) × 𝐷 𝑋 → 2. 𝑙 1 Example: a = 𝑙 = 1 + 𝑏 100 If then 𝐶 ≥ 𝑏𝐷 2 𝑈 𝐶 ≥ 50 𝑁𝑐𝑗𝑢𝑡 3. i.e. if end host knows 2T, buffer size is independent of RTT

1988 1994 2004 2020 Congestion Avoidance High Performance Sizing Router and Control TCP in ANSNET Buffers VJ & MK CV & CS GA, IK, NM 𝐶 ≥ 2 𝑈 × 𝐷 where N is the number of long-lived flows 𝐶 = 2 𝑈 × 𝐷 𝑂 B A C B 𝑁𝑗𝑜 𝑆𝑈 𝑈 = 2 𝑈

Synchronized Flows W ∑ max W max ∑ 2 W max W max 2 t ▪ Aggregate window has same dynamics ▪ Therefore buffer occupancy has same dynamics ▪ Rule-of-thumb still holds. 27

Many TCP Flows ∑ W B 0 ~ Buffer Size Probability Distribution 28

Many AIMD flows: 100% Throughput Example: 2 𝑈 = 100 𝑛𝑡 , 𝐷 = 10 𝐻𝑐 / 𝑡 , 𝑂 = 1 𝐶 ≥ 2 𝑈 × 𝐷 𝐶 ≥ 1 𝐻𝑐𝑗𝑢 𝑂 Example: 2 𝑈 = 100 𝑛𝑡 , 𝐷 = 10 𝐻𝑐 / 𝑡 , 𝑂 = 10,000 𝐶 ≥ 10 𝑁𝑐𝑗𝑢 2 𝑈 × 𝐷 1 𝑂 Q: Do you think will hold for other 𝐶 ∝ 𝑂 congestion control algorithms…?

You said Nikhil Athreya I think the assumptions made in the original paper are beginning to change. While they acknowledge that their work is mostly based off of TCP, they argue that single-packet sources (e.g. DNS) and constant-rate UDP sources (e.g. online games) can be modeled with short flows. However, a quick Google search tells me that (https://www.caida.org/ research/traffic-analysis/tcpudpratio/) that the amount of UDP flows either has surpassed TCP flows or is at least comparable to it. I’m unsure how this would affect the results in this paper; perhaps because the results for short flows shows that average queue length is only dependent on link load and flow length, this shouldn’t be a problem, but I’d be interested to hear more about it . 30

You said Kevin Baichoo It seems like from the backbone router vendor’s will disavow this smaller buffer sizes, as otherwise they can’t justify the prices for their equipment. Esther Goldstein Do buffer sizes today go by the rule-of-thumb equation, or has the equation been refined in a different way than what was proposed in the paper? Isabel Victoria Papadimitriou It seems that this is working under an assumption that TCP is TCP, and router manufacturers have to deal with that regarding buffer size. Why isn’t it the other way around (“this is the buffer size, make congestion control work”), or is it possible to think about some sort of joint optimization of buffer size and paradigm? 31

1988 1994 2004 2006 2020 Congestion Avoidance High Performance Sizing Router Routers with 𝐶 = 𝑃 ( 𝑚𝑝𝑕𝑋 ) and Control TCP in ANSNET Buffers Very Small Buffers VJ & MK CV & CS GA, IK, NM ME, YG, AG, NM, TR Assumptions 1. Paced Traffic 2. Link utilization < 80% 𝐶 = 2 𝑈 × 𝐷 Consequences Only 20-50 packet buffers. B A C B 𝑁𝑗𝑜 𝑆𝑈 𝑈 = 2 𝑈

1988 1994 2004 2006 2020 2008 Congestion Avoidance High Performance Sizing Router Routers with Experimental Study of and Control TCP in ANSNET Buffers Very Small Buffers Router Buffers VJ & MK CV & CS GA, IK, NM ME, YG, AG, NM, TR NB, YG, MG, NM, GS 𝐶 = 2 𝑈 × 𝐷 B A C B 𝑁𝑗𝑜 𝑆𝑈 𝑈 = 2 𝑈

Buffer Sizing Experiments Small Buffers ▪ Stanford University dorm network ▪ University of Wisconsin ▪ Internet2 ▪ Level 3 Communications Tiny Buffers ▪ Internet2 ▪ Sprint Advanced Technology Lab ▪ University of Toronto

CS244 Advanced Topics in Networking Lecture 10: Buffer Sizing Nick - PowerPoint PPT Presentation

CS244 Advanced Topics in Networking Lecture 10: Buffer Sizing Nick McKeown Sizing Router Buffers [Appenzeller, et al. 2004] Spring 2020 Context Guido Appenzeller At the time: CS PhD student Founded Big Switch Networks CTO

Welcome to CS244 Spring 2020! Class will start shortly CS244 Advanced Topics in Networking

Welcome to CS244 Spring 2020! Class will start shortly CS244 Advanced Topics in Networking

CS244 Online for COVID-19 This is the first time for us too, so please email us if you have ideas

The Forwarding Plane: An Old New Frontier of Networking Research CS244, Spring 2019 Changhoon

Towards a Fully Encrypted Internet CS244 | Zakir Durumeric 2013 Snowden Revelations Explicit

CS244 Advanced Topics in Networking Lecture 7: Programmable Forwarding Nick McKeown Processing

CS244 Advanced Topics in Networking Lecture 9: SDN (2) Network Virtualization Nick McKeown

Class will start shortly CS244 Online for COVID-19 This is the first time for us too, so please

CS244 Advanced Topics in Networking Lecture 6: Switching Nick McKeown High-speed switch

Architecture and Principles 1. End to end arguments in system design (1981) Sachin Katti

Visually Grounded Meaning Representation Qi Huang Ryan Rock Outline 1. Motivation 2.

Business Process Management Journal Mobile customer relationship management: underlying issues and

Identifying Opportunities for R&D and Collaboration Roundtable Discussion ANL Marcel

Kernels for deterministic and stochastic approximations of (invariant) functions David

External buffer Raslan Darawsheh Mellanox External buffer First was introduced by Olivier

Design of Adaptive Communication Design of Adaptive Communication Channel Buffers for Low-

Fast Buffer Insertion Considering Process Variation Jinjun Xiong, Lei He EE Department EE

Carnegie Mellon Univ. Dept. of Computer Science 15-415 - Database Applications Lecture 12:

Adapting Router Buffers for Energy Efficiency Arun Vishwanath CEET, University of Melbourne

1 Fast hits by Avoiding Address Fast hits by Avoiding Address Translation Translation Send

Lab 2: Buffer Overflows Fengwei Zhang Wayne State University Course: Cyber Security Prac@ce 1

SCR and Preparing for Burst Buffers DOE COE Performance Portability Meeting August 23, 2017

Integration of Burst Buffer in High- level Parallel IO Library for Exa- scale Computing Era SC

Chapter 5: Standard I/O Library CMPS 105: Systems Programming Prof. Scott Brandt T Th 2-3:45

CS244 Advanced Topics in Networking Lecture 10: Buffer Sizing Nick - PowerPoint PPT Presentation

CS244 Advanced Topics in Networking Lecture 10: Buffer Sizing Nick McKeown Sizing Router Buffers [Appenzeller, et al. 2004] Spring 2020 Context Guido Appenzeller At the time: CS PhD student Founded Big Switch Networks CTO

Welcome to CS244 Spring 2020! Class will start shortly CS244 Advanced Topics in Networking

Welcome to CS244 Spring 2020! Class will start shortly CS244 Advanced Topics in Networking

CS244 Online for COVID-19 This is the first time for us too, so please email us if you have ideas

The Forwarding Plane: An Old New Frontier of Networking Research CS244, Spring 2019 Changhoon

Towards a Fully Encrypted Internet CS244 | Zakir Durumeric 2013 Snowden Revelations Explicit

CS244 Advanced Topics in Networking Lecture 7: Programmable Forwarding Nick McKeown Processing

CS244 Advanced Topics in Networking Lecture 9: SDN (2) Network Virtualization Nick McKeown

Class will start shortly CS244 Online for COVID-19 This is the first time for us too, so please

CS244 Advanced Topics in Networking Lecture 6: Switching Nick McKeown High-speed switch

Architecture and Principles 1. End to end arguments in system design (1981) Sachin Katti

Visually Grounded Meaning Representation Qi Huang Ryan Rock Outline 1. Motivation 2.

Business Process Management Journal Mobile customer relationship management: underlying issues and

Identifying Opportunities for R&amp;D and Collaboration Roundtable Discussion ANL Marcel

Kernels for deterministic and stochastic approximations of (invariant) functions David

External buffer Raslan Darawsheh Mellanox External buffer First was introduced by Olivier

Design of Adaptive Communication Design of Adaptive Communication Channel Buffers for Low-

Fast Buffer Insertion Considering Process Variation Jinjun Xiong, Lei He EE Department EE

Carnegie Mellon Univ. Dept. of Computer Science 15-415 - Database Applications Lecture 12:

Adapting Router Buffers for Energy Efficiency Arun Vishwanath CEET, University of Melbourne

1 Fast hits by Avoiding Address Fast hits by Avoiding Address Translation Translation Send

Lab 2: Buffer Overflows Fengwei Zhang Wayne State University Course: Cyber Security Prac@ce 1

SCR and Preparing for Burst Buffers DOE COE Performance Portability Meeting August 23, 2017

Integration of Burst Buffer in High- level Parallel IO Library for Exa- scale Computing Era SC

Chapter 5: Standard I/O Library CMPS 105: Systems Programming Prof. Scott Brandt T Th 2-3:45

Identifying Opportunities for R&D and Collaboration Roundtable Discussion ANL Marcel