Towards General-Purpose Resource Management in Shared Cloud - PowerPoint PPT Presentation

Towards General-Purpose Resource Management in Shared Cloud Services Jonathan Mace , Brown University Peter Bodik, MSR Redmond Rodrigo Fonseca, Brown University Madanlal Musuvathi, MSR Redmond

Shared-tenant cloud services Processes service requests from multiple clients ✓ Great for cost and efficiency ✘ Performance is a challenge Aggressive tenants and system maintenance tasks Resource starvation and bottlenecks Degraded performance, Violated SLOs, system outages 2

Shared-tenant cloud services Ideally manage resources to provide end-to-end guarantees and isolation Challenge OS/hypervisor mechanisms insufficient ✘ Shared threads & processes ✘ Application-level resource bottlenecks (locks, queues) ✘ Resources across multiple processes and machines Today lack of guarantees, isolation some ad-hoc solutions 3

This paper • 5 design principles for resource policies in shared- tenant systems • Retro – prototype for principled resource management • Preliminary demonstration of Retro in HDFS 4

Hadoop Distributed File System (HDFS) HDFS DataNode HDFS NameNode HDFS DataNode HDFS DataNode Replicated block storage Filesystem metadata 5

Hadoop Distributed File System (HDFS) HDFS DataNode HDFS NameNode HDFS DataNode HDFS DataNode Replicated block storage Filesystem metadata 6

HDFS DataNode HDFS NameNode HDFS DataNode HDFS DataNode 9

Principle 1: Consider all resources and request types • Fine-grained resources within processes • Resources shared between processes (disk, network) • Many different API calls • Bottlenecks can crop up in many places hardware resources: disk, network, cpu , … software resources : locks, queues, … data structures: transaction logs, shared batches, … 12

Principle 2: Distinguish between tenants • Tenants might send different types of requests • Tenants might be utilizing different machines • If a policy is efficient , it should be able to target the cause of contention e.g., if a tenant is causing contention, throttle otherwise leave the tenant alone 17

Admission Control HDFS DataNode HDFS NameNode HDFS DataNode HDFS DataNode 19

Admission Control HDFS DataNode HDFS NameNode HDFS DataNode HDFS DataNode while (!Thread. isInterrupted ()){ sendPacket(); } 20

Admission Control HDFS DataNode HDFS NameNode HDFS DataNode HDFS DataNode Principle 5: while (!Thread. isInterrupted ()){ rate_limit(); Schedule early, sendPacket(); } schedule often 21

Resource Management Design Principles 1. Consider all request types and all resources 2. Distinguish between tenants 3. Treat foreground and background tasks uniformly 4. Estimate resource usage at runtime 5. Schedule early, schedule often Retro – prototype for principled resource management in shared-tenant systems 22

Retro: end-to-end tracing Tenants 23

Retro: end-to-end tracing Tenants 24

Retro: application-level resource interception Tenants 25

Retro: aggregation and centralized reporting Tenants 26

Retro: application-level enforcement Tenants 27

Retro: distributed scheduling Tenants 28

Retro: distributed scheduling Tenants 29

Early Results 1.1 1.2 HDFS Normalized Throughput HDFS w/ Retro HDFS NNBench Normalized Latency benchmark 0.01% to 2% 1 average overhead 1 on end-to-end latency, throughput 0.9 0.8 Open Read Create Rename Delete Open Read Create Rename Delete 30

Retrospective Thus far: • Per-tenant identification • Resource measurements • Schedule enforcement Next steps: • Abstractions for writing simplified high-level policies • Low-level enforcement mechanisms • Policies to monitor system, find bottlenecks, provide guarantees 33

Towards General-Purpose Resource Management in Shared Cloud - PowerPoint PPT Presentation

Towards General-Purpose Resource Management in Shared Cloud Services Jonathan Mace , Brown University Peter Bodik, MSR Redmond Rodrigo Fonseca, Brown University Madanlal Musuvathi, MSR Redmond Shared-tenant cloud services Processes service

Wednesday, November 30, 2016 3:41 PM General Page 1 General Page 2 General Page 3 General Page

Web Services Web Services Towards Web Services Towards Web Services Towards Web Services A

Towards an Italian RSG ? Towards an Italian RSG ? Achille Zappa achille.zappa@gmail.com

Towards Deep Multi-View Stereo Silvano Galliani October 2, 2017 1 / 40 Towards Deep Multi-View

OUR JOURNEY By Giving PURPOSE to LEARNING & PURPOSE to LIFE

Purpose: Purpose: Purpose: Purpose: Assessment of status and trends in biodiversity

Purpose, Function, and Design Purpose, Function, and Design Purpose, Function, and Design

Whats My Purpose? Caterpillar Confidential Green 1 How to Think of Purpose What gives you

Stylization with a Purpose Stylization with a Purpose Stylization with a Purpose Stylization

Catalysts for H 2 Iceland s first step towards the s first step towards the Iceland

Towards a General Computation-Oriented Simplicial Complexes . . . Description of Physical

General-Purpose Input/Output Textbook: Chapter 14 General-Purpose I/O programming 1 I/O devices

A GENERAL-PURPOSE A GENERAL-PURPOSE CRN-TO-DSD COMPILER FRAMEWORK CRN-TO-DSD COMPILER FRAMEWORK

5.1. GENERAL-PURPOSE INSTRUCTIONS The general-purpose instructions preform basic data movement,

PC BUILDING PRESENTED BY WHAT IS A PC General purpose Personal Computer for individual usage

Why Attitude to Good Towards Explanation . . . Towards Explanation . . . People Is Not Always

Where to store all the IoT Data? Piotr Robert Konopelko Business & Technical Support

Sefos A self-aware factored operating system A Traditional OS App 1 App 2 App 3 System call

Distributed File Storage in Multi-Tenant Clouds using CephFS Openstack Vancouver 2018 May 23

When Your Business Depends On It The Evolution of a Global File System for a Global Enterprise

HBase on top of HDFS Seminar Software Systems Engineering "Mobile, Security, Cloud

Twitter Data Processing with MongoDB By Ama & Sameera Introduction Create twitter

HCI & Storage 1 2 Isilon The Recognized Leader Reflects on both product

Data Lake to AI on GPUs CPUs can no longer handle the growing data demands of data science

Towards General-Purpose Resource Management in Shared Cloud - PowerPoint PPT Presentation

Towards General-Purpose Resource Management in Shared Cloud Services Jonathan Mace , Brown University Peter Bodik, MSR Redmond Rodrigo Fonseca, Brown University Madanlal Musuvathi, MSR Redmond Shared-tenant cloud services Processes service

Wednesday, November 30, 2016 3:41 PM General Page 1 General Page 2 General Page 3 General Page

Web Services Web Services Towards Web Services Towards Web Services Towards Web Services A

Towards an Italian RSG ? Towards an Italian RSG ? Achille Zappa achille.zappa@gmail.com

Towards Deep Multi-View Stereo Silvano Galliani October 2, 2017 1 / 40 Towards Deep Multi-View

OUR JOURNEY By Giving PURPOSE to LEARNING &amp; PURPOSE to LIFE

Purpose: Purpose: Purpose: Purpose: Assessment of status and trends in biodiversity

Purpose, Function, and Design Purpose, Function, and Design Purpose, Function, and Design

Whats My Purpose? Caterpillar Confidential Green 1 How to Think of Purpose What gives you

Stylization with a Purpose Stylization with a Purpose Stylization with a Purpose Stylization

Catalysts for H 2 Iceland s first step towards the s first step towards the Iceland

Towards a General Computation-Oriented Simplicial Complexes . . . Description of Physical

General-Purpose Input/Output Textbook: Chapter 14 General-Purpose I/O programming 1 I/O devices

A GENERAL-PURPOSE A GENERAL-PURPOSE CRN-TO-DSD COMPILER FRAMEWORK CRN-TO-DSD COMPILER FRAMEWORK

5.1. GENERAL-PURPOSE INSTRUCTIONS The general-purpose instructions preform basic data movement,

PC BUILDING PRESENTED BY WHAT IS A PC General purpose Personal Computer for individual usage

Why Attitude to Good Towards Explanation . . . Towards Explanation . . . People Is Not Always

Where to store all the IoT Data? Piotr Robert Konopelko Business &amp; Technical Support

Sefos A self-aware factored operating system A Traditional OS App 1 App 2 App 3 System call

Distributed File Storage in Multi-Tenant Clouds using CephFS Openstack Vancouver 2018 May 23

When Your Business Depends On It The Evolution of a Global File System for a Global Enterprise

HBase on top of HDFS Seminar Software Systems Engineering &quot;Mobile, Security, Cloud

Twitter Data Processing with MongoDB By Ama &amp; Sameera Introduction Create twitter

HCI &amp; Storage 1 2 Isilon The Recognized Leader Reflects on both product

Data Lake to AI on GPUs CPUs can no longer handle the growing data demands of data science

OUR JOURNEY By Giving PURPOSE to LEARNING & PURPOSE to LIFE

Where to store all the IoT Data? Piotr Robert Konopelko Business & Technical Support

HBase on top of HDFS Seminar Software Systems Engineering "Mobile, Security, Cloud

Twitter Data Processing with MongoDB By Ama & Sameera Introduction Create twitter

HCI & Storage 1 2 Isilon The Recognized Leader Reflects on both product