Decentralized Server Selection for Cloud Services Patrick Wendell, - PowerPoint PPT Presentation

DONAR Decentralized Server Selection for Cloud Services Patrick Wendell, Princeton University Joint work with Joe Wenjie Jiang, Michael J. Freedman, and Jennifer Rexford

Outline • Server selection background • Constraint-based policy interface • Scalable optimization algorithm • Production deployment

User Facing Services are Geo-Replicated

Reasoning About Server Selection Client Mapping Service Requests Nodes Replicas

Example: Distributed DNS Clients Mapping Nodes Service Replicas DNS Resolvers Servers Auth. Nameservers Client 1 DNS 1 DNS 2 Client 2 DNS 10 Client C

Example: HTTP Redir/Proxying Clients Mapping Nodes Service Replicas HTTP Clients Datacenters HTTP Proxies Client 1 Proxy 1 Proxy 2 Client 2 Client C Proxy 500

Reasoning About Server Selection Client Mapping Service Requests Nodes Replicas

Reasoning About Server Selection Client Mapping Service Requests Nodes Replicas Outsource to DONAR

Naïve Policy Choices Load- Aware: “Round Robin” Client Mapping Service Requests Nodes Replicas

Naïve Policy Choices Location- Aware: “Closest Node” Client Mapping Service Requests Nodes Replicas Goal: support complex policies across many nodes.

Policies as Constraints DONAR Replicas bandwidth_cap Nodes = 10,000 req/m split_ratio = 10% allowed_dev = ± 5%

Eg. 10-Server Deployment How to describe policy with constraints?

No Constraints Equivalent to “Closest Node” 35% Requests per Replica 28% 10% 9% 7% 6% 2% 2% 1% 1%

No Constraints Equivalent to “Closest Node” Impose 20% 35% Requests per Replica 28% Cap 10% 9% 7% 6% 2% 2% 1% 1%

Cap as Overload Protection Requests per Replica 20% 20% 20% 14% 10% 7% 6% 2% 1% 1%

12 Hours Later… Requests per Replica 29% 16% 16% 12% 10% 5% 4% 3% 3% 3%

“Load Balance” (split = 10%, tolerance = 5%) Requests per Replica 15% 15% 15% 15% 15% 5% 5% 5% 5% 5%

“Load Balance” (split = 10%, tolerance = 5%) Trade-off network proximity & load distribution Requests per Replica 15% 15% 15% 15% 15% 5% 5% 5% 5% 5%

12 Hours Later… Large range of policies by varying cap/weight Requests per Replica 15% 15% 15% 13% 10% 10% 7% 5% 5% 5%

Optimization: Policy Realization Clients: c ∈ C Nodes: n ∈ N Replica Instances: i ∈ I • Global LP describing “optimal” pairing Minimize network cost min α 𝒅 ∙ 𝑆 𝑑𝑗 ∙ 𝑑𝑝𝑡𝑢(𝑑, 𝑗) 𝑑∈𝐷 𝑗∈𝐽 s.t. 𝑄 𝑗 − ω 𝑗 ≤ 𝜁 𝑗 Server loads within tolerance 𝐶 𝑗 < 𝐶 ∙ 𝑄 𝑗 Bandwidth caps met

Optimization Workflow Calculate Track Measure Optimal Replica Set Traffic Assignment

Optimization Workflow Calculate Track Measure Optimal Replica Set Traffic Assignment Per-customer!

Optimization Workflow Calculate Track Measure Optimal Replica Set Traffic Assignment Continuously! (respond to underlying traffic)

By The Numbers 10 1 10 2 10 3 10 4 DONAR Nodes Customers replicas/customer client groups/ customer Problem for each customer: 10 2 * 10 4 = 10 6

Measure Traffic & Optimize Locally? Mapping Service Nodes Replicas

Not Accurate! Client Mapping Service Requests Nodes Replicas No one node sees entire client population

Aggregate at Central Coordinator? Mapping Service Nodes Replicas

Aggregate at Central Coordinator? Mapping Service Nodes Replicas Share Traffic Measurements (10 6 )

Aggregate at Central Coordinator? Mapping Service Nodes Replicas Optimize

Aggregate at Central Coordinator? Mapping Service Nodes Replicas Return assignments (10 6 )

So Far Accurate Efficient Reliable Local only No Yes Yes Central Yes No No Coordinator

Decomposing Objective Function cost of mapping c to i Traffic from c min α 𝒅 ∙ 𝑆 𝑑𝑗 ∙ 𝑑𝑝𝑡𝑢(𝑑, 𝑗) We also decompose 𝑑∈𝐷 𝑗∈𝐽 prob of mapping c to i constraints ∀ clients ∀ instances = (more complicated) 𝑡 𝑜 α 𝑑𝑜 ∙ 𝑆 𝑜𝑑𝑗 ∙ 𝑑𝑝𝑡𝑢(𝑑, 𝑗) 𝑜∈𝑂 𝑑∈𝐷 𝑗∈𝐽 ∀ nodes Traffic to this node

Decomposed Local Problem For Some Node (n*) load i = f(prevailing load on each server + load I will impose on each server) min ∀ 𝑗 𝑚𝑝𝑏𝑒 𝑗 + 𝑡 𝑜 ∗ α 𝑑𝑜 ∗ ∙ 𝑆 𝑜 ∗ 𝑑𝑗 ∙ 𝑑𝑝𝑡𝑢 𝑑, 𝑗 𝑑∈𝐷 𝑗∈𝐽 Local distance Global load minimization information

DONAR Algorithm Solve local Mapping Service Nodes Replicas problem

DONAR Algorithm Solve local Mapping Service Nodes Replicas problem Share summary data w/ others (10 2 )

DONAR Algorithm Mapping Service Nodes Replicas Solve local problem

DONAR Algorithm Mapping Service Nodes Replicas Share summary data w/ others (10 2 )

DONAR Algorithm Mapping Service Nodes • Provably Replicas converges to global optimum • Requires no coordination • Reduces message passing by 10 4

Better! Accurate Efficient Reliable Local only No Yes Yes Central Yes No No Coordinator DONAR Yes Yes Yes

Production and Deployment • Publicly deployed 24/7 since November 2009 • IP2Geo data from Quova Inc. • Production use: – All MeasurementLab Services (incl. FCC Broadband Testing) – CoralCDN • Services around 1M DNS requests per day

Systems Challenges (See Paper!) • Network availability Anycast with BGP • Reliable data storage Chain-Replication with Apportioned Queries • Secure, reliable updates Self-Certifying Update Protocol

CoralCDN Experimental Setup split_weight = .1 DONAR CoralCDN tolerance = .02 Client Nodes Replicas Requests

Results: DONAR Curbs Volatility “Closest Node” policy DONAR “Equal Split” Policy

Results: DONAR Minimizes Distance Minimal (Closest Node) Requests per Replica DONAR Round-Robin 1 2 3 4 5 6 7 8 9 10 Ranked Order from Closest

Conclusions • Dynamic server selection is difficult – Global constraints – Distributed decision-making • Services reap benefit of outsourcing to DONAR. – Flexible policies – General: Supports DNS & HTTP Proxying – Efficient distributed constraint optimization • Interested in using? Contact me or visit http://www.donardns.org.

Questions?

Related Work (Academic and Industry) • Academic – Improving network measurement • iPlane: An informationplane for distributed services H. V. Madhyastha, T. Isdal, M. Piatek, C. Dixon, T. Anderson, A. Krishnamurthy, and A. Venkataramani, “,” in OSDI, Nov. 2006 – “Application Layer Anycast ” • OASIS: Anycast for Any Service Michael J. Freedman, Karthik Lakshminarayanan, and David Mazières Proc. 3rd USENIX/ACM Symposium on Networked Systems Design and Implementation (NSDI '06) San Jose, CA, May 2006. • Proprietary – Amazon Elastic Load Balancing – UltraDNS – Akamai Global Traffic Management

Doesn’t [Akamai/ UltraDNS/etc] Already Do This? • Existing approaches use alternative, centralized formulations. • Often restrict the set of nodes per-service. • Lose benefit of large number of nodes (proxies/DNS servers/etc).

Decentralized Server Selection for Cloud Services Patrick Wendell, - PowerPoint PPT Presentation

DONAR Decentralized Server Selection for Cloud Services Patrick Wendell, Princeton University Joint work with Joe Wenjie Jiang, Michael J. Freedman, and Jennifer Rexford Outline Server selection background Constraint-based policy

Server Traffic Management Server Traffic Management Jeff Chase Duke University, Department of

1 Handling Return Traffic Handling Return Traffic URL Switching URL Switching Idea: switch

Content Server Caching Network Client Web Server Browser Avoid Network Latency Avoid Queuing

ERP Selection KIRTANE & PANDIT Suhas Deshpande Why ERP Selection is important ?

DNS Server Selection on Multi-Homed Hosts draft-savolainen-mif-dns-server-selection-03 Teemu

Server Upgrades 6/25/19 Agenda Existing Server Infrastructure Reasons for upgrading

Proxy Server, Network Address Translator, Firewall 1 Proxy Server 2 1 Introduction What

Installing a Web Server 1. Install a sample web server, which supports Servlets/JSPs. A light

Installing a Web Server 1. Install a sample web server, which supports Servlets/JSPs. A light

SECONDHAND SELECTION Sales Price - 275,000.00 EU SECONDHAND SELECTION INTERNAL VIEWS SECONDHAND

Variable selection bias Bias in Ensemble Bias in Ensemble Methods Methods Variable selection

SELECTION Deterministic Stochastic Proportionate selection: Roulette Wheel Selection

Selection 2 Selection Selection given a set of (distinct) elements, finding the element larger

A Decentralized and Distributed E-voting Scheme Based on Cryptographic Shuffles Decentralized

Decentralized Trust Management for Decentralized Trust Management for Ad-Hoc Peer-to-Peer

monitoring decentralized specifications Antoine El-Hokayem Ylis Falcone Univ. Grenoble Alpes,

Locking the Web Open The Decentralized Web @brewster_kahle Code is Law But it is huge. We

SDPaxos: Building Efficient Semi-Decentralized Geo-replicated State Machines Hanyu Zhao * , Quanlu

Linux Kernel Debugging Your kernel just oopsed - What do you do, hotshot? Muli Ben-Yehuda

Figure out what problem you are solving. Test individual methods before

Approaching Infinity: Governance, and the case for experimentation By Brett Sun Approaching

Coexistence or Downfall of Bitcoin Cash? Yujin Kwon* , Hyoungshick Kim , Jinwoo Shin*, Yongdae

Cellular DFC Design: Technological Perspectives Prateek Saxena Asst. Professor of Computer

Decentralize and Randomize: Faster Algorithm for Wasserstein Barycenters Pavel Dvurechensky,

Sambuz

Useful Links

Newsletter

Mail Us

Decentralized Server Selection for Cloud Services Patrick Wendell, - PowerPoint PPT Presentation

DONAR Decentralized Server Selection for Cloud Services Patrick Wendell, Princeton University Joint work with Joe Wenjie Jiang, Michael J. Freedman, and Jennifer Rexford Outline Server selection background Constraint-based policy

Server Traffic Management Server Traffic Management Jeff Chase Duke University, Department of

1 Handling Return Traffic Handling Return Traffic URL Switching URL Switching Idea: switch

Content Server Caching Network Client Web Server Browser Avoid Network Latency Avoid Queuing

ERP Selection KIRTANE &amp; PANDIT Suhas Deshpande Why ERP Selection is important ?

DNS Server Selection on Multi-Homed Hosts draft-savolainen-mif-dns-server-selection-03 Teemu

Server Upgrades 6/25/19 Agenda Existing Server Infrastructure Reasons for upgrading

Proxy Server, Network Address Translator, Firewall 1 Proxy Server 2 1 Introduction What

Installing a Web Server 1. Install a sample web server, which supports Servlets/JSPs. A light

Installing a Web Server 1. Install a sample web server, which supports Servlets/JSPs. A light

SECONDHAND SELECTION Sales Price - 275,000.00 EU SECONDHAND SELECTION INTERNAL VIEWS SECONDHAND

Variable selection bias Bias in Ensemble Bias in Ensemble Methods Methods Variable selection

SELECTION Deterministic Stochastic Proportionate selection: Roulette Wheel Selection

Selection 2 Selection Selection given a set of (distinct) elements, finding the element larger

A Decentralized and Distributed E-voting Scheme Based on Cryptographic Shuffles Decentralized

Decentralized Trust Management for Decentralized Trust Management for Ad-Hoc Peer-to-Peer

monitoring decentralized specifications Antoine El-Hokayem Ylis Falcone Univ. Grenoble Alpes,

Locking the Web Open The Decentralized Web @brewster_kahle Code is Law But it is huge. We

SDPaxos: Building Efficient Semi-Decentralized Geo-replicated State Machines Hanyu Zhao * , Quanlu

Linux Kernel Debugging Your kernel just oopsed - What do you do, hotshot? Muli Ben-Yehuda

Figure out what problem you are solving. Test individual methods before

Approaching Infinity: Governance, and the case for experimentation By Brett Sun Approaching

Coexistence or Downfall of Bitcoin Cash? Yujin Kwon* , Hyoungshick Kim , Jinwoo Shin*, Yongdae

Cellular DFC Design: Technological Perspectives Prateek Saxena Asst. Professor of Computer

Decentralize and Randomize: Faster Algorithm for Wasserstein Barycenters Pavel Dvurechensky,

Sambuz

Useful Links

Newsletter

Mail Us

ERP Selection KIRTANE & PANDIT Suhas Deshpande Why ERP Selection is important ?