Experiences in Using OS- level Virtualization for Block I/O Dan - PowerPoint PPT Presentation

Experiences in Using OS- level Virtualization for Block I/O Dan Huang, University of Central Florida Jun Wang, University of Central Florida Gary Liu, Oak Ridge National Lab

Contents  Motivation  Background for Virtualization  Our Solution: I/O Throttling Middleware  Evaluations  Related Work  Conclusion  Acknowledgement University of Central Florida

Motivation  Nowadays in HPC, job schedulers such as PBS/TORQUE are used to assign physical nodes, exclusively, to users for running jobs.  Easy configuration through batch scripts  Low resource utilization  Hard to meet interactive and ad-hoc analytics’ QoS requirements.  Multiple jobs access to shared distributed or parallel file systems to load or save data.  Interference on PFS  Negative impact on jobs’ QoS University of Central Florida

Resource Consolidation in Cloud Computing  In data centers, cloud computing has been widely deployed for elastic resource provisioning.  High isolation with low mutual interference  Cloud computing employs various virtualization technologies to consolidate physical resources.  Hypervisor-based virtualization: VMWare, Xen, KVM  OS-level virtualization: Linux container, OpenVZ, Docker University of Central Florida

Virtualization in HPC  HPC uses high-end and dedicated nodes to run scientific computing jobs.  Could HPC analysis cluster be virtualized with low overhead?  What type of virtualization should be adopted?  According to the previous studies[1, 2, 3], the overhead of hypervisor-based virtualization is high.  Overhead on disk throughput ≈ 36%  Overhead on memory throughput ≈ 53%  [1] Nikolaus Huber, Marcel von Quast, Michael Hauck, and Samuel Kounev. Evaluating and modeling virtualization performance overhead for cloud environments. In CLOSER, pages 563-573, 2011.  [2] Stephen Soltesz, Herbert Potzl, Marc E Fiuczynski, Andy Bavier, and Larry Peterson. Container-based operating system virtualization: a scalable, high-performance alternative to hypervisors. In ACM SIGOPS Operating Systems Review, volume 41, pages 275-287. ACM, 2007. [3] Miguel G Xavier, Marcelo Veiga Neves, Fabio D Rossi, Tiago C Ferreto, Timoteo Lange, and Cesar AF De Rose. Performance evaluation of container-based  virtualization for high performance computing environments. In Parallel, Distributed and Network-Based Processing (PDP), 2013 21 st Euromicro International Conference on, pages 233-240. IEEE, 2013. University of Central Florida

Hypervisor and OS-level Virtualization  Virtualization technology takes advantage of the trade-off between isolation and overhead.  Hypervisor-based virtualization has a hypervisor (or VM monitor) layer under the guest OS and it introduces high performance overhead and is not acceptable to HPC.  OS-level virtualization (container based) is a lightweight layer in Linux kernel. University of Central Florida

Hypervisor and OS-level Virtualization (cont.) University of Central Florida

The Internal Components of OS- level Virtualization  OS-level virtualization shares the same operating system kernel.  1) Control Groups (CGroups)  CGroups controls the resource usage per process group.  2) Linux Namespaces  Linux Namespace creates a set of isolated namespaces such as PID and Network Namespaces etc. University of Central Florida

Allocating Block I/O via OS-level Virtualization  There are two methods for allocating block I/O in CGroups module.  1) Throttling functionality  Set an upper limit to a process group’s block I/O  2) Weight functionality  Assign shares of block I/O to a group of processes University of Central Florida

Create Virtual Node (VNode) University of Central Florida

The Gap Between Virtual Node and PFS Configuration Gap : The shared I/O resources of a PFS is hard to be controlled by current resource allocation mechanisms, since the I/O configurations on users' VNodes can not take effect on a remote PFS. University of Central Florida

The Design of I/O Throttling Middleware University of Central Florida

The Structure of VNode Sync VNode Sync: 1) Accept I/O configurations 2) Apply I/O configurations into VNodes 3) Intercept users’ I/O request handlers 4) Insert handlers into corresponding VNodes University of Central Florida

Single Node Testbed The Configuration of Single Node Testbed Make& Model Dell XPS 8700 CPU Intel i7 Processor, 64 bit, 18 MB L2, 2.8 GHz, 4 cores RAM 8×2 GB Internal Hard Disk 1× Western Digital Black SATA 7200rpm 1 TB Local File System EXT3 Operating System CentOS 6 64-bit, kernel 2.6.32 504.8.1.el6 University of Central Florida

Distributed Testbed The Configuration of Marmot Cluster Reserve 17 nodes in Marmot Make& Model Dell PowerEdge 1950 CPU 2 Opteron 242, 64 bit, 1 MB L2, 1GHz RAM 8×2.0 GB RDIMM, PC3200, CL3 Internal Hard Disk 1× Western Digital Black SATA 7200rpm 2 TB Network Connection 1 × Gigabit Ethernet Operating System CentOS 6 64-bit, 2.6.32 504.8.1.el6 Switch Make & Model 152 port Extreme Networks BlackDiamond 6808 HDFS 1 head node and 16 storage nodes Lustre 1 head node, 8 storage nodes and 8 client nodes University of Central Florida

Read Overhead on Single Node 1 0.8 0.6 0.4 0.2 C P o h a a e y s c s t i l 0 1 V N _ 1 1 V 6 N K _ B 1 6 M 2 B V N _ 2 1 V 6 N K _ B 1 6 M 4 B V N _ 4 1 V 6 N K _ B 1 6 M 8 B V N _ 8 1 V 6 N K _ B 1 6 M m R w N B e a d a n d d h o a e d z r t i l i Numble of VNodes and Object Size The worst read overhead is less than 10%. University of Central Florida

Throttling Read on Single Node 140 120 100 Read 80 60 40 10MB/s 20 20MB/s 30MB/s 40MB/s 0 M R P h w y P _ 1 h 6 y K _ 1 B 6 1 M 0 B _ 1 1 6 0 K _ 1 B 6 M 2 B 0 _ 1 2 0 6 _ K 1 B 6 M 3 B 0 _ 1 3 0 6 _ K 1 B 6 M 4 B 0 _ 1 4 0 6 K _ 1 B 6 M B B B e a d a n d d h s t ( / ) i Throttle Rate on Bottom VNode (MB/s) and Object Size The throttle functionality could guarantee the process’s I/O does not exceed the upper limits. But it is largely influenced by other concurrent processes University of Central Florida

Weight Read on Single Node 1 20% 25% 0.8 50% 20% Read 25% 0.6 100% 20% 0.4 50% 50% 40% 0.2 C P o h a a e y s c s t i l 0 1 V N _ 1 1 V 6 N K _ B 1 6 M 2 B V N _ 2 1 V 6 N K _ B 1 6 M 3 B V N _ 3 1 V 6 N K _ B 1 6 M 4 B V N _ 4 1 V 6 N K _ B 1 6 M m R w N B e a d a n d d h o a z e d t r i l i Numble of VNodes and Object Size The result shows that the overhead of the weight function is less that 8%. The weight module does not suffer from interference and can provide effective isolation. University of Central Florida

I/O Throttling on PFS 1200 HDFS with Data Locality 140 HDFS W/O Data Locality 1000 Lustre N-to-N 120 Lustre N-to-1 800 100 80 600 60 400 40 o L r f 200 20 M R w A 0 B B 0 g g e g a e e a d a n d d h s A g g e g a r ( ) t i t / r W /O _V N 10 M B /s20 M B /s40 M B /s80 M B /s160 M B /s Throttle Rate to DFS Block I/O I/O throttling middleware can effectively control the aggregate bandwidth of PFSs and introduces negligible overhead University of Central Florida

I/O Throttling on Real Application 180 Data Load Time of ParaView (ms) 160 Computing Time of Paraview (ms) 140 120 100 80 60 40 20 0 W / O _ W D M / O _ 5 T H M T B L / 1 s 0 M B 2 / 0 s M B 4 / 0 s M B 6 / 0 s M B 8 / 0 s M B 1 / 0 s 0 M B / s m m P V w F T n h e o a a e s s ( ) r f i i i i Throttle Rate to Competing Daemons' I/O The finish time of ParaView is increasing as the I/O throttle rate of background daemons increasing. University of Central Florida

Experiences in Using OS- level Virtualization for Block I/O Dan - PowerPoint PPT Presentation

Experiences in Using OS- level Virtualization for Block I/O Dan Huang, University of Central Florida Jun Wang, University of Central Florida Gary Liu, Oak Ridge National Lab Contents Motivation Background for Virtualization Our

Virtualization Virtualization Memory virtualization Process feels like it has its own

Introduction to Virtual Machines Carl Waldspurger (SB SM 89 PhD 95) VMware R&D Overview

AMD Pacifica Virtualization Technology AMD Unveils Virtualization Platform AMD Pacifica

KVM MMU Virtualization Xiao Guangrong <xiaoguangrong@cn.fujitsu.com> Index What is MMU

Virtualization and SDN Applications 2 Virtualization Network Virtualization Sharing

Virtualization What is Virtualization? Virtualization is the simulation of the software and/

EUROPA: Efficient User-Mode Packet Forwarding in Network Virtualization Virtualization Yong

Virtualization. A dream within a dream Type 1 Virtualization Hypervisor run on bare

A Fault Tolerant Virtualization Server Based on Xen Jrgen Gro Virtualization Kernel

Linux Virtualization Kir Kolyshkin <kir@openvz.org> OpenVZ project manager What is

Problem 1 k zero bits n bits IV Block Block Block Block Cipher Cipher Cipher Cipher

Generic RAID Reassembly using Block-Level Entropy Christian Zoubek, Sabine Seufert, Andreas

Virtualization and High Availability Mika Karlstedt AMICT'08 May 2008 Faculty of Science

Virtualization in Fedora Virtualization in Fedora (KVM based) (KVM based) Kashyap Chamarthy

IO Virtualization Kedar & Ozzie Overview Benefits Challenges Full Virtualization

Hardware / Virtualization / Architectures Foundations of the cloud Implementing Virtualization

Alert acknowledgement With Alertmanager ukasz Mierzwa Alert states in Alertmanager An alert

ORK at with Eric Chester HOW TO IGNITE PASSION IN YOUR PEOPLE IGNITE PASSION WITHOUT BURNING

Internet protocol stack application: supporting network applications application FTP,

CPSC 213 The Client Server Programming Model - Web Servers 2nd ed: 11.1-11.5 1st ed:

Governance Body Meeting Thursday, February 16, 2017 12:00 PM 1:00 PM EDT Audio:

O k l a h o m a P a s s accounting organizations including the Oklahoma Society of CPAs,

Actualizing Racial Justice and Decolonization in Settled and Turbulent Times Tuesday, June 16,

Putting the Community Profile together, one piece at a time Todays Agenda Introduction

Experiences in Using OS- level Virtualization for Block I/O Dan - PowerPoint PPT Presentation

Experiences in Using OS- level Virtualization for Block I/O Dan Huang, University of Central Florida Jun Wang, University of Central Florida Gary Liu, Oak Ridge National Lab Contents Motivation Background for Virtualization Our

Virtualization Virtualization Memory virtualization Process feels like it has its own

Introduction to Virtual Machines Carl Waldspurger (SB SM 89 PhD 95) VMware R&amp;D Overview

AMD Pacifica Virtualization Technology AMD Unveils Virtualization Platform AMD Pacifica

KVM MMU Virtualization Xiao Guangrong &lt;xiaoguangrong@cn.fujitsu.com&gt; Index What is MMU

Virtualization and SDN Applications 2 Virtualization Network Virtualization Sharing

Virtualization What is Virtualization? Virtualization is the simulation of the software and/

EUROPA: Efficient User-Mode Packet Forwarding in Network Virtualization Virtualization Yong

Virtualization. A dream within a dream Type 1 Virtualization Hypervisor run on bare

A Fault Tolerant Virtualization Server Based on Xen Jrgen Gro Virtualization Kernel

Linux Virtualization Kir Kolyshkin &lt;kir@openvz.org&gt; OpenVZ project manager What is

Problem 1 k zero bits n bits IV Block Block Block Block Cipher Cipher Cipher Cipher

Generic RAID Reassembly using Block-Level Entropy Christian Zoubek, Sabine Seufert, Andreas

Virtualization and High Availability Mika Karlstedt AMICT'08 May 2008 Faculty of Science

Virtualization in Fedora Virtualization in Fedora (KVM based) (KVM based) Kashyap Chamarthy

IO Virtualization Kedar &amp; Ozzie Overview Benefits Challenges Full Virtualization

Hardware / Virtualization / Architectures Foundations of the cloud Implementing Virtualization

Alert acknowledgement With Alertmanager ukasz Mierzwa Alert states in Alertmanager An alert

ORK at with Eric Chester HOW TO IGNITE PASSION IN YOUR PEOPLE IGNITE PASSION WITHOUT BURNING

Internet protocol stack application: supporting network applications application FTP,

CPSC 213 The Client Server Programming Model - Web Servers 2nd ed: 11.1-11.5 1st ed:

Governance Body Meeting Thursday, February 16, 2017 12:00 PM 1:00 PM EDT Audio:

O k l a h o m a P a s s accounting organizations including the Oklahoma Society of CPAs,

Actualizing Racial Justice and Decolonization in Settled and Turbulent Times Tuesday, June 16,

Putting the Community Profile together, one piece at a time Todays Agenda Introduction

Introduction to Virtual Machines Carl Waldspurger (SB SM 89 PhD 95) VMware R&D Overview

KVM MMU Virtualization Xiao Guangrong <xiaoguangrong@cn.fujitsu.com> Index What is MMU

Linux Virtualization Kir Kolyshkin <kir@openvz.org> OpenVZ project manager What is

IO Virtualization Kedar & Ozzie Overview Benefits Challenges Full Virtualization