Sincronia: Near-Optimal Network Design for Coflows Shijin - PowerPoint PPT Presentation

Sincronia: Near-Optimal Network Design for Coflows Shijin Rajakrishnan Joint work with Saksham Agarwal Akshay Narayan Rachit Agarwal David Shmoys Amin Vahdat

The Flow Abstraction … FTP Email HTTP Traditional Applications : Care about performance of individual flows Good Match Optimized for Flow-level performance

Is Flow Still the Right Abstraction? … … FTP Email HTTP Distributed Applications : Traditional Applications : Care about performance Care about performance for a group of flows of individual flows Optimized for Flow-level performance

The Coflow abstraction Co llection of semantically related flows [Chowdhury & Stoica, 2012] Coflow 1 … Coflow 3 Coflow 2 … … … Allows applications to more precisely express their performance goals

Network and Coflow Model Egress ports • Big-switch model Ingress ports • Clairvoyant scheduler 1 DC Fabric 1’ Coflow details known at arrival time: ▪ ➢ Source-destination for each flow 2 2’ ➢ Size of each flow ➢ Coflow weight • Metric – coflow completion time: Ti me when all flows complete Goal : Minimize Average Weighted Coflow Completion Time (CCT)

Prior Results Impossibility Results • NP-hard • <2x approximation hard Runs on Systems/ Performance Work Starvation State-of-the-art Existing Theory Guarantees Conserving Avoiding Transport Varys Systems [SIGCOMM ‘14] On Scheduling Theory Coflows (4-apx) [IPCO ‘17] Practical, Near-Optimal Network Design for Coflows?

Sincronia: Two key results Guarantees 4-approximation for (weighted) average CCT Given a set of coflows and a “right” ordering, ANY per-flow rate allocation mechanism that is work-conserving & order-preserving produces average CCT within 4x of optimal • Per-flow rate allocation irrelevant • Transport layer agnostic

Sincronia – Near-Optimal Network Design Runs on Systems/ Performance Work Starvation Name Existing Theory Guarantees Conserving Avoiding Transport Systems Varys On Scheduling Theory Coflows (4-apx) Systems Sincronia (4-apx) Also outperforms state-of-the-art across evaluated workloads

Sincronia Design Ordered Flow Coflow Set of Priorities set of Scheduling ordering coflows on flows coflows • Priorities set from order • Algorithm – BSSI • Flows offloaded to transport layer ▪ Bottleneck, Select, Scale, Iterate ▪ SRPT-first style algorithm • No explicit per-flow rate allocation

Bottleneck-Select-Scale-Iterate (BSSI) • Find BOTTLENECK port 1 • SELECT (weighted) largest job 2 ▪ Ordered last 1’ 2’ • SCALE weights of remaining jobs • ITERATE on unscheduled jobs Ordering not important

BSSI in Action #packets = 4 1 • Bottleneck 1 1’ Size 3 Size Size Weight = 8 ൗ ൗ #packets = 8 Weight = 3 2 ൗ • Select Weight 1 Weight ← Weight ← Weight(1 – × (1 – Weight ) ) Size 4 ൗ #packets = 5 1’ ▪ Ordered Last Size Weight = 4 ൗ 2 2’ Size Size ൗ Weight = 4 ൗ Weight • Scale Weight ← Weight(1 – Weight ) #packets = 7 2’ Size ൗ Size Weight = 1 ൗ • Iterate Weights: Scale weight of each coflow Select coflow with Find port handling Iterate on largest size-to-weight ratio largest number of packets unscheduled coflows (at bottleneck port) Order:

End-to-End Design(Offline) BSSI Host 1 Host 2 1 1’ Order: 2 2’ Transport Transport • Each host knows ordering • Flows get priority of coflow • Offloads to priority enabled transport layer

Per-flow Rate Allocation is Irrelevant • Intuition: Sharing bandwidth does not help CCT • Order-preserving schedule : Flow blocked iff ingress or egress port serving higher-ordered flow Given the BSSI ordering, ANY per-flow rate allocation mechanism that is work conserving & order-preserving produces average CCT within 4x of optimal

Avoiding per-flow rate allocation: Implications • Implement on top of any transport layer ▪ E.g. pFabric, pHost, TCP • Design and implementation independent of ▪ Network Topology ▪ Location of Congestion ▪ Paths of Coflows Details in paper • More scalable ▪ No reallocations upon coflow arrivals/departures

Handling Arbitrary Arrival Times • Framework: Khuller, Li, Sturmfels , Sun, Venkat, ‘18 • Time divided into epochs 0 1 2 4 8 • In each epoch ▪ Choose subset of unscheduled jobs 0 1 2 4 8 ▪ Schedule in next epoch using offline alg. Provides 12-competitive performance (details in paper)

Evaluation Overview • Testbed implementation on top of TCP ▪ Evaluate impact of in-network congestion, and hardware constraints • Simulations ▪ Coflows arrive at time 0 ▪ Coflows arrive at arbitrary times ▪ Sensitivity analysis ➢ Coflow sizes, structure, # of coflows ➢ Network topologies, Oversubscription ratios, Network load ➢ … All simulations, workloads, and implementations are open- sourced on Sincronia website

Simulation Results Offl fline 526 coflow trace [Varys] 9 8 7 Facebook trace 6 1000 coflow trace 5 4 2000 coflow trace 3 2 OCT : Completion 1 time of a coflow 0 in an unloaded 90 th 99 th Average network percentile percentile Sincronia not only provides near-optimal guarantees, but also improves upon state-of-the-art design in practice

Simulation Results Online 4 Network Load = 0.9 3.5 3 1000 coflow trace 2.5 Slowdown 2000 coflow trace 2 1.5 1 0.5 0 90 th 99 th Average percentile percentile Even at such high network loads, Sincronia achieves CCT close to that of an unloaded network

Implementation Results Implemented on top of TCP • 16-server Fat tree topology ▪ Full bisection bandwidth ▪ 20 PICA8 switches ➢ Supports 8 priority levels • DiffServ for priority scheduling

Implementation Results 160 - Unfair Evaluation 140 • TCP not designed for coflows 120 • TCP not designed to minimize CT 100 + Compare against existing designs 80 • E.g. Varys reports 1.85x improvement 60 at mean and at tails 40 20 0 90 th percentile 99 th percentile Average Sincronia achieves significant improvements over existing network designs even with a small number of priority levels

Summary • Sincronia – a network design for coflows • 4x within optimal • No per-flow rate allocation Performance Run on existing Work Starvation Name Guarantees Transport Conserving Avoiding Varys On Scheduling Coflows (4-apx) Sincronia (4-apx) • Paper discusses number of open problems

Thanks!

Sincronia: Near-Optimal Network Design for Coflows Shijin - PowerPoint PPT Presentation

Sincronia: Near-Optimal Network Design for Coflows Shijin Rajakrishnan Joint work with Saksham Agarwal Akshay Narayan Rachit Agarwal David Shmoys Amin Vahdat The Flow Abstraction FTP Email HTTP Traditional Applications : Care about

Markov Decision Processes and Exact Solu6on Methods: Value

C++ support for better hardware/software co-design in C# with SME Kenneth Skovhede FSP 2017

GOVERNMENT PROMOTION OF LEARNING AND INNOVATION IN SMEs OF INDUSTRIALIZING ECONOMIES Subsidies,

The Role of Small and Medium Enterprises in Structural transformation and Economic Development

Anno unc e me nt Anno unc e me nt FIT100 FIT100 FIT100 No: Ye s: Midte rms Midte rms

Defining Classes, Part 1 Rose-Hulman Institute of Technology Computer Science and Software

Darrell Bethea June 3, 2011 Lab 8 solution posted Program 4 due in 1 week Final exam

CHARTER STUDY BACKGROUND On 7/16/12, the Plainfield City Council passed an ordinance

Welcome Columbus Investor Forum May 12, 2016 Columbus Investor Forum Market of Focus:

Spectrum Painting Richard Shipman MW0RCZ ADARS 6th Jan 2020 Introduction Introduction

Towards a Framework for Responsible Timing Dr. Joo Geada CLK Design Automation CLK

USI NG T HE SOL UT I ON F OCUSE D APPROACH I N NE W ZE AL AND POL I CE K E Y OUT

Causes of The Great Depression The Great Depression is one of the most misunderstood events

CS 403X Mobile and Ubiquitous Computing Lecture 9: Face Detection, Widget Catalog, SQLite

Multimodal Biometrics with Auxiliary Information Quality, Userspecific, Cohort information

Digit al Makin g Pr ogr am m e Lesson 1: Basi c Funct i ons Sim il ar ity Bet ween Scr act ch

Escaping Large Deceptive Basins of Attraction with Heavy-Tailed Mutation Operators Tobias

International Income Taxation Chapter 1: I NTRODUCTION Professors Wells January 13, 2014 Chapter

Assuring Meaningful Learning Using Evidence based Strategies Cary Kreutzer, EdD, MPH, RD July

Outline Common Errors & Tax Obligations Correct Treatment Offences & Filing of Income

Normal Forms and Physical Database Design Ramakrishnan & Gehrke, Chapter 17 & 18 320302

TERENA Certificate Service Milan Sova CESNET Agenda History TCS Structure Procedures

Certificate Directory for SIP Cullen Jennings fluffy@cisco.com SIP Security & SMIME SIP

User-centric Security Models Manuel Ziegler Advisor: Heiko Niedermayer Seminar Future Internet

Explore More Topics

Sambuz

Useful Links

Newsletter

Mail Us

Sincronia: Near-Optimal Network Design for Coflows Shijin - PowerPoint PPT Presentation

Sincronia: Near-Optimal Network Design for Coflows Shijin Rajakrishnan Joint work with Saksham Agarwal Akshay Narayan Rachit Agarwal David Shmoys Amin Vahdat The Flow Abstraction FTP Email HTTP Traditional Applications : Care about

Markov Decision Processes and Exact Solu6on Methods: Value

C++ support for better hardware/software co-design in C# with SME Kenneth Skovhede FSP 2017

GOVERNMENT PROMOTION OF LEARNING AND INNOVATION IN SMEs OF INDUSTRIALIZING ECONOMIES Subsidies,

The Role of Small and Medium Enterprises in Structural transformation and Economic Development

Anno unc e me nt Anno unc e me nt FIT100 FIT100 FIT100 No: Ye s: Midte rms Midte rms

Defining Classes, Part 1 Rose-Hulman Institute of Technology Computer Science and Software

Darrell Bethea June 3, 2011 Lab 8 solution posted Program 4 due in 1 week Final exam

CHARTER STUDY BACKGROUND On 7/16/12, the Plainfield City Council passed an ordinance

Welcome Columbus Investor Forum May 12, 2016 Columbus Investor Forum Market of Focus:

Spectrum Painting Richard Shipman MW0RCZ ADARS 6th Jan 2020 Introduction Introduction

Towards a Framework for Responsible Timing Dr. Joo Geada CLK Design Automation CLK

USI NG T HE SOL UT I ON F OCUSE D APPROACH I N NE W ZE AL AND POL I CE K E Y OUT

Causes of The Great Depression The Great Depression is one of the most misunderstood events

CS 403X Mobile and Ubiquitous Computing Lecture 9: Face Detection, Widget Catalog, SQLite

Multimodal Biometrics with Auxiliary Information Quality, Userspecific, Cohort information

Digit al Makin g Pr ogr am m e Lesson 1: Basi c Funct i ons Sim il ar ity Bet ween Scr act ch

Escaping Large Deceptive Basins of Attraction with Heavy-Tailed Mutation Operators Tobias

International Income Taxation Chapter 1: I NTRODUCTION Professors Wells January 13, 2014 Chapter

Assuring Meaningful Learning Using Evidence based Strategies Cary Kreutzer, EdD, MPH, RD July

Outline Common Errors &amp; Tax Obligations Correct Treatment Offences &amp; Filing of Income

Normal Forms and Physical Database Design Ramakrishnan &amp; Gehrke, Chapter 17 &amp; 18 320302

TERENA Certificate Service Milan Sova CESNET Agenda History TCS Structure Procedures

Certificate Directory for SIP Cullen Jennings fluffy@cisco.com SIP Security &amp; SMIME SIP

User-centric Security Models Manuel Ziegler Advisor: Heiko Niedermayer Seminar Future Internet

Explore More Topics

Sambuz

Useful Links

Newsletter

Mail Us

Outline Common Errors & Tax Obligations Correct Treatment Offences & Filing of Income

Normal Forms and Physical Database Design Ramakrishnan & Gehrke, Chapter 17 & 18 320302

Certificate Directory for SIP Cullen Jennings fluffy@cisco.com SIP Security & SMIME SIP