Performance Evaluation of Containers for HPC Cristian Ruiz, - PowerPoint PPT Presentation

Performance Evaluation of Containers for HPC Cristian Ruiz, Emmanuel Jeanvoine and Lucas Nussbaum INRIA Nancy, France VHPC’15 . . . . . . . .

Outline Introduction 1 State of the art 2 Experimental evaluation 3 Conclusions 4 Bibliography 5 . . . . . . . . INRIA MADYNES TEAM VHPC’15 2 / 27

Outline Introduction 1 State of the art 2 Experimental evaluation 3 Conclusions 4 Bibliography 5 . . . . . . . . INRIA MADYNES TEAM VHPC’15 Introduction 3 / 27

Containers Containers refers generally to Operating-system-level virtualization, where the kernel of an operating system allows for multiple isolated user-space instances. . . . . . . . . INRIA MADYNES TEAM VHPC’15 Introduction 4 / 27

Implementations ▶ Chroot ▶ Linux-VServer ▶ FreeBSD Jails ▶ Solaris Containers ▶ OpenVZ . . . . . . . . INRIA MADYNES TEAM VHPC’15 Introduction 5 / 27

namespaces and cgroups ▶ Both features incorporated in Linux kernel since 2006 (Linux 2.6.24) ▶ Several container solutions: LXC, libvirt, libcontainer, systemd-nspawn, Docker systemd nspwan LXC libcontainer libvirt . . . . . . . . INRIA MADYNES TEAM VHPC’15 Introduction 6 / 27

Outline Introduction 1 State of the art 2 Experimental evaluation 3 Conclusions 4 Bibliography 5 . . . . . . . . INRIA MADYNES TEAM VHPC’15 State of the art 7 / 27

Benefits of using containers in HPC ▶ Containers allow to easily provision a full software stack. They bring: ▶ portability ▶ user customization ▶ reproducibility of experiments ▶ Containers provide a lower oversubscription overhead than full vms, enabling: ▶ a better resource utilization ▶ to be used as a building block for large scale platform emulators . . . . . . . . INRIA MADYNES TEAM VHPC’15 State of the art 8 / 27

Container performance evaluation ▶ Matthews et al[3] compared the performance of VMWare, Xen, Solaris containers and OpenVZ using custom benchmarks ▶ Felter et al[2] evaluated the I/O performance of Docker using MySQL, Linpack, Stream, RandomAccess, nuttcp, netperf, fio, and Redis ▶ Walter et al[4] compared VMWare Server, Xen and OpenVZ using NetPerf, IOZone, and the NAS Parallel Benchmarks ▶ Xavier et al[5] compared Linux VServer, OpenVZ, LXC and Xen using the HPC Challenge benchmarks and the NAS Parallel Benchmarks . . . . . . . . INRIA MADYNES TEAM VHPC’15 State of the art 9 / 27

In this work, we answer: ▶ What is the overhead of oversubscription using different versions of Linux kernel? ▶ What is the performance of inter-container communication? ▶ What is the impact of running an HPC workload with several MPI processes inside containers? . . . . . . . . INRIA MADYNES TEAM VHPC’15 State of the art 10 / 27

Outline Introduction 1 State of the art 2 Experimental evaluation 3 Conclusions 4 Bibliography 5 . . . . . . . . INRIA MADYNES TEAM VHPC’15 Experimental evaluation 11 / 27

Experimental setup Hardware ▶ Cluster in Grid’5000 Testbed[1] where each node is equipped with two Intel Xeon E5-2630v3 processors (with 8 cores each), 128 GB of RAM and a 10 GbE adapter ▶ Our experimental setup included up to 64 machines Software ▶ Debian Jessie, Linux kernel versions: 3.2, 3.16 and 4.0, OpenMPI and NPB. We instrumented the benchmarks: LU, EP, CG, MG, FT, IS using TAU ▶ We automate the experimentation processes using Distem a and Kameleon b a https://distem.gforge.inria.fr b https://github.com/camilo1729/distem-recipes . . . . . . . . INRIA MADYNES TEAM VHPC’15 Experimental evaluation 12 / 27

Network setup ▶ Veth pair + Linux bridge ▶ Veth pair + OpenvSwitch ▶ MACVLAN or SR-IOV ▶ Phys Host system Linux bridge LXC1 lxcn0 veth0 LAN, WAN, eth0 WLAN, lxcn1 veth1 LXC2 VLAN br0 . . . . . . . . INRIA MADYNES TEAM VHPC’15 Experimental evaluation 13 / 27

Linux kernel version 32 containers running on: 8,16,32 physical machines. 250 Results No of containers native 1/node ▶ 2/node 2/node 200 4/node ▶ 3.2: 1577.78% Execution time [secs] ▶ 3.16: 22.67% 150 ▶ 4.0: 2.40% ▶ Overhead present in MPI 100 communication ▶ Since Linux kernel version 50 3.11, TSO was enabled in 0 veth 3.2 3.16 4 Kernel version Figure: CG.B . . . . . . . . INRIA MADYNES TEAM VHPC’15 Experimental evaluation 14 / 27

Oversubscription Linux kernel 4.0 ▶ There is a veth per MPI process ▶ 64 containers running over: 8,16,32,64 physical machines 12.5 No of containers native 1/node 2/node 10.0 4/node Results 8/node ▶ Top 3 worst performance Execution time [secs] 7.5 results: MG, FT, LU ▶ Maximum overhead (15%, 5.0 67%) 2.5 ▶ Container placing plays an important role. 0.0 8 16 32 64 Number of MPI Processes Figure: FT.B . . . . . . . . INRIA MADYNES TEAM VHPC’15 Experimental evaluation 15 / 27

Inter-container communication ▶ container and SM : 1 physical node ▶ native : 2, 4, 8 physical nodes All running the equivalent number of MPI processes. 20 native native container container SM SM 10 Execution time [secs] 15 Execution time [secs] 10 5 5 0 0 4 8 16 4 8 16 Number of MPI processes Number of MPI processes (a) MG Class B (b) IS Class C . . . . . . . . INRIA MADYNES TEAM VHPC’15 Experimental evaluation 16 / 27

Inter-container communication LU.B MG.C EP.B CG.B % time % time % time % time Native cpu 78 11221 70 4823 79 4342 47 3286 comm 15 2107 15 1024 3 142 39 2721 init 7 1050 15 1045 19 1044 15 1045 % time % time % time % time Container cpu 83 14621 84 6452 80 4682 71 4832 comm 11 2015 3 206 2 141 14 935 init 6 1056 14 1057 18 1051 15 1053 % time % time % time % time SM cpu 81 14989 80 6456 78 4595 70 4715 comm 13 2350 7 602 4 258 14 938 init 6 1040 13 1038 18 1038 16 1040 Table: Profile results. Time in msec ▶ Inter-container communication is the fastest ▶ Important degradation of the CPU performance for memory bound applications ▶ LU: 53%, MG: 53%, EP: 25%, CG: 12%, FT: 0%, IS: 0% (overheads regarding native) . . . . . . . . INRIA MADYNES TEAM VHPC’15 Experimental evaluation 17 / 27

Multinode inter-container communication ▶ 16 MPI processes were run per physical machine or container ▶ We used a maximum of 32 physical machines 12 native native container container 6 9 Execution time [secs] Execution time [secs] 4 6 2 3 0 0 16 32 64 128 256 512 16 32 64 128 Number of MPI processes Number of MPI processes (a) FT Class B (b) CG Class B . . . . . . . . INRIA MADYNES TEAM VHPC’15 Experimental evaluation 18 / 27

Multinode inter-container communication ▶ Benchmarks with low MPI communication: we observed a maximum overhead of 5.97% (with 512 MPI processes) ▶ Benchmarks with an intensive MPI communication: we observed a higher overhead starting from 30% for the benchmark LU . . . . . . . . INRIA MADYNES TEAM VHPC’15 Experimental evaluation 19 / 27

Multinode inter-container communication ▶ A particular behavior is observed for CG benchmark. It reaches 180% of overhead when 128 MPI processes are used. The number of MPI messages sent by this benchmark increases with the number of nodes, leading to network congestion and TCP timeouts ▶ We found a way to alleviate the overhead by tweaking parameters of the Linux network stack ▶ TCP minimum retransmission timeout (RTO) ▶ TCP Selective Acknowledgments (SACK) . . . . . . . . INRIA MADYNES TEAM VHPC’15 Experimental evaluation 20 / 27

Outline Introduction 1 State of the art 2 Experimental evaluation 3 Conclusions 4 Bibliography 5 . . . . . . . . INRIA MADYNES TEAM VHPC’15 Conclusions 21 / 27

In the context of HPC … ▶ We study the impact of using containers. ▶ We evaluate two interesting uses of containers: ▶ portability of complex software stacks ▶ oversubscription . . . . . . . . INRIA MADYNES TEAM VHPC’15 Conclusions 22 / 27

What did we find? ▶ There is important performance degradation provoked by veth for Linux kernels < 3.11 ▶ Container placing plays in important role under oversubscription ▶ Memory bound applications and application that use all to all MPI communication are the most affected by oversubscription ▶ Inter-container communication through veth has equivalent performance than communication through shared memory using OpenMPI ▶ Performance issues can appear only at certain scale (e.g. 180 % overhead with 128 nodes for CG benchmark) . . . . . . . . INRIA MADYNES TEAM VHPC’15 Conclusions 23 / 27

Future work ▶ Measure the impact of using containers on disk I/O and other containers features like memory limitation ▶ The overhead observed could be diminished by integrating more advance network interconnection such as Linux’s macvlan , SR-IOV or OpenvSwitch 1 1 http://openvswitch.org/ . . . . . . . . INRIA MADYNES TEAM VHPC’15 Conclusions 24 / 27

The end Thank you . . . . . . . . INRIA MADYNES TEAM VHPC’15 Conclusions 25 / 27

Outline Introduction 1 State of the art 2 Experimental evaluation 3 Conclusions 4 Bibliography 5 . . . . . . . . INRIA MADYNES TEAM VHPC’15 Bibliography 26 / 27

Performance Evaluation of Containers for HPC Cristian Ruiz, - PowerPoint PPT Presentation

Performance Evaluation of Containers for HPC Cristian Ruiz, Emmanuel Jeanvoine and Lucas Nussbaum INRIA Nancy, France VHPC15 . . . . . . . . Outline Introduction 1 State of the art 2 Experimental evaluation 3 Conclusions 4

Uni.lu HPC School 2020 PS6: HPC Containers: Singularity Uni.lu High Performance Computing (HPC)

HPC @ SAO S.G. Korzennik - SAO HPC Analyst hpc@cfa February 2013 SGK ( hpc@cfa ) HPC @ SAO

CONTAINERS DEMOCRATIZE HPC CJ Newburn, Principal Architect for HPC, NVIDIA GTC19 S9525 -

Improving Trust in Containers Matthew Garrett @mjg59 | mjg59@coreos.com | coreos.com

The HPC Skill Tree A Brief Overview Kai Himstedt On Behalf of the HPC-CF Board BoF:

UL HPC School 2017[bis] PS1: Getting Started on the UL HPC platform UL High Performance

UL HPC School 2017 PS1: Getting Started on the UL HPC platform UL High Performance Computing

Unprivileged Containers Jess Frazelle, @jessfraz How do containers help security? Containers are

Herd of Containers Sad DIF Database Engineer Herd of Containers: PostgreSQL in containers at

Matthias Sohn Adel Zaalouk SAP From Containers to Kubernetes From Containers to Kubernetes

Everything you need to know about Containers Security Track Containers Jos Manuel Ortega

UL HPC School 2017 PS5: Advanced Scheduling with SLURM and OAR on UL HPC clusters UL High

Whats new in HPC? Gregory Bauer To keep up-to-date on HPC HPC Guru -

Building a Grid System for HPC HPC on Grid High Performance Computing (HPC): Use of computer

MATLAB on UL HPC Checkpointing & parallel execution UL High Performance Computing (HPC) Team

UL HPC School 2017 PS9: [Advanced] Prototyping with Python UL High Performance Computing (HPC)

MultiLanes: Providing Virtualized Storage for OS-level Virtualization on Many Cores Junbin Kang,

Enabling Hybrid CAN and CAN FD Networks Tony Adamson Marketing Manager CAN / LIN / FlexRay

Continuous Integration: Jenkins, Libvirt & Real Hardware Anna-Maria Gleixner

Cambridge Technology Partners 18. November 1999 Agenda Cambridge Technology Partners

Principal Consultant @muellermarc Microsoft I like but Im a Microsoft Fan-Boy

From Zero to Zero-day How I became a hacker and why you should Carl Svensson @ Detectify 5/12

Introductory Computer Security CS461/ECE422 Fall 2009 Susan Hinrichs Slide #1-1 Outline

SIP Working Group IETF 73 chaired by Keith Drage, Dean Willis Note Well Any submission to the

Performance Evaluation of Containers for HPC Cristian Ruiz, - PowerPoint PPT Presentation

Performance Evaluation of Containers for HPC Cristian Ruiz, Emmanuel Jeanvoine and Lucas Nussbaum INRIA Nancy, France VHPC15 . . . . . . . . Outline Introduction 1 State of the art 2 Experimental evaluation 3 Conclusions 4

Uni.lu HPC School 2020 PS6: HPC Containers: Singularity Uni.lu High Performance Computing (HPC)

HPC @ SAO S.G. Korzennik - SAO HPC Analyst hpc@cfa February 2013 SGK ( hpc@cfa ) HPC @ SAO

CONTAINERS DEMOCRATIZE HPC CJ Newburn, Principal Architect for HPC, NVIDIA GTC19 S9525 -

Improving Trust in Containers Matthew Garrett @mjg59 | mjg59@coreos.com | coreos.com

The HPC Skill Tree A Brief Overview Kai Himstedt On Behalf of the HPC-CF Board BoF:

UL HPC School 2017[bis] PS1: Getting Started on the UL HPC platform UL High Performance

UL HPC School 2017 PS1: Getting Started on the UL HPC platform UL High Performance Computing

Unprivileged Containers Jess Frazelle, @jessfraz How do containers help security? Containers are

Herd of Containers Sad DIF Database Engineer Herd of Containers: PostgreSQL in containers at

Matthias Sohn Adel Zaalouk SAP From Containers to Kubernetes From Containers to Kubernetes

Everything you need to know about Containers Security Track Containers Jos Manuel Ortega

UL HPC School 2017 PS5: Advanced Scheduling with SLURM and OAR on UL HPC clusters UL High

Whats new in HPC? Gregory Bauer To keep up-to-date on HPC HPC Guru -

Building a Grid System for HPC HPC on Grid High Performance Computing (HPC): Use of computer

MATLAB on UL HPC Checkpointing &amp; parallel execution UL High Performance Computing (HPC) Team

UL HPC School 2017 PS9: [Advanced] Prototyping with Python UL High Performance Computing (HPC)

MultiLanes: Providing Virtualized Storage for OS-level Virtualization on Many Cores Junbin Kang,

Enabling Hybrid CAN and CAN FD Networks Tony Adamson Marketing Manager CAN / LIN / FlexRay

Continuous Integration: Jenkins, Libvirt &amp; Real Hardware Anna-Maria Gleixner

Cambridge Technology Partners 18. November 1999 Agenda Cambridge Technology Partners

Principal Consultant @muellermarc Microsoft I like but Im a Microsoft Fan-Boy

From Zero to Zero-day How I became a hacker and why you should Carl Svensson @ Detectify 5/12

Introductory Computer Security CS461/ECE422 Fall 2009 Susan Hinrichs Slide #1-1 Outline

SIP Working Group IETF 73 chaired by Keith Drage, Dean Willis Note Well Any submission to the

MATLAB on UL HPC Checkpointing & parallel execution UL High Performance Computing (HPC) Team

Continuous Integration: Jenkins, Libvirt & Real Hardware Anna-Maria Gleixner