Provable Multicore Schedulers with Ipanema: Application to - PowerPoint PPT Presentation

Provable Multicore Schedulers with Ipanema: Application to Work-Conservation Baptiste Lepers Redha Gouicem Damien Carver Jean-Pierre Lozi Nicolas Palix Virginia Aponte Willy Zwaenepoel Julien Sopena Julia Lawall Gilles Muller

Work conservation “No core should be left idle when a core is overloaded” Core 0 Core 1 Core 2 Core 3 Non work-conserving situation: core 0 is overloaded, other cores are idle 2/32

Problem Linux (CFS) suffers from work conservation issues 0 Core is mostly idle 8 16 Core is mostly overloaded 24 Core 32 40 48 56 Time (second) 3/32 [Lozi et al. 2016]

Problem FreeBSD (ULE) suffers from work conservation issues Core is overloaded Core Core is idle Time (second) [Bouron et al. 2018] 4/32

Problem Work conservation bugs are hard to detect No crash, no deadlock. No obvious symptom. 137x slowdown on HPC applications 23% slowdown on a database. [Lozi et al. 2016] 5/32

This talk Formally prove work-conservation 6/32

Work Conservation Formally ( ∃ c . O(c)) ⇒ ( ∀ c ′ . ¬I(c ′ )) If a core is overloaded, no core is idle Core 0 Core 1 7/32

Work Conservation Formally ( ∃ c . O(c)) ⇒ ( ∀ c ′ . ¬I(c ′ )) If a core is overloaded, no core is idle Does not work for realistic schedulers! Core 0 Core 1 8/32

Challenge #1 Concurrent events & optimistic concurrency 9/32

Challenge #1 Concurrent events & optimistic concurrency Observe (state of every core) Lock ( one core – less overhead) time Act (e.g., steal threads from locked core) Based on possibly outdated observations! 10/32

Challenge #1 Concurrent events & optimistic concurrency Core 0 Core 1 Core 2 Core 3 Runs load balancing 11/32

Challenge #1 Concurrent events & optimistic concurrency Core 0 Core 1 Core 2 Core 3 Observes load (no lock) 12/32

Challenge #1 Concurrent events & optimistic concurrency Ideal scenario: no change since observations Core 0 Core 1 Core 2 Core 3 Locks busiest 13/32

Challenge #1 Concurrent events & optimistic concurrency Possible scenario: Core 0 Core 1 Core 2 Core 3 Locks “busiest” Busiest might have no thread left! (Concurrent blocks/terminations.) 14/32

Challenge #1 Concurrent events & optimistic concurrency Core 0 Core 1 Core 2 Core 3 (Fail to) Steal from busiest 15/32

Challenge #1 Concurrent events & optimistic concurrency Observe Lock time Act Based on possibly outdated observations! Definition of Work Conservation must take concurrency into account! 16/32

Concurrent Work Conservation Formally Definition of overloaded with « failure cases »: ∃ c . (O(c) ∧ ¬fork(c) ∧ ¬unblock(c ) …) If a core is overloaded (but not because a thread was concurrently created) 17/32

Concurrent Work Conservation Formally ∃ c . (O(c) ∧ ¬fork(c) ∧ ¬unblock(c ) …) ⇒ ∀ c ′ . ¬(I(c ′ ) ∧ …) 18/32

Challenge #2 Existing scheduler code is hard to prove Schedulers handle millions of events per second Historically: low level C code. 19/32

Challenge #2 Existing scheduler code is hard to prove Schedulers handle millions of events per second Historically: low level C code. Code should be easy to prove AND efficient! 20/32

Challenge #2 Existing scheduler code is hard to prove Schedulers handle millions of events per second Historically: low level C code. Code should be easy to prove AND efficient! ⇒ Domain Specific Language (DSL) 21/32

DSL advantages Trade expressiveness for expertise/knowledge: Robustness: (static) verification of properties Explicit concurrency: explicit shared variables Performance: efficient compilation 22/32

DSL-based proofs WhyML code Proof DSL Policy C code Kernel module DSL: close to C Easy learn and to compile to WhyML and C 23/32

DSL-based proofs Proof on all possible interleavings 24/32

DSL-based proofs Core 0 Proof on all possible load balancing interleavings Split code in blocks time (1 block = 1 read or write to a shared variable) load balancing 25/32

DSL-based proofs Core 0 Core 1 … Core N Proof on all possible load balancing interleavings terminate Split code in blocks fork time (1 block = 1 read or write to a shared variable) load balancing Simulate execution of concurrent fork blocs on N cores fork Concurrent WC must hold at the end of the load balancing 26/32

DSL-based proofs Core 0 Core 1 … Core N Proof on all possible load balancing interleavings terminate DSL ➔ few shared variables ➔ tractable Split code in blocs fork time (1 bloc = 1 read or write to a shared variable) load balancing Simulate execution of concurrent fork blocs on N cores fork Concurrent WC must always hold! 27/32

Evaluation CFS-CWC (365 LOC) Hierarchical CFS-like scheduler CFS-CWC-FLAT (222 LOC) Single level CFS-like scheduler ULE-CWC (244 LOC) BSD-like scheduler 28/32

Less idle time FT.C (NAS benchmark) 29/32

Comparable or better performance NAS benchmarks (lower is better) 30/32

Comparable or better performance Sysbench on MySQL (higher is better) 31/32

Conclusion Work conservation: not straighforward! … new formalism: concurrent work conservation! Complex concurrency scheme … proofs made tractable using a DSL. Performance: similar or better than CFS. 32/32

Provable Multicore Schedulers with Ipanema: Application to - PowerPoint PPT Presentation

Provable Multicore Schedulers with Ipanema: Application to Work-Conservation Baptiste Lepers Redha Gouicem Damien Carver Jean-Pierre Lozi Nicolas Palix Virginia Aponte Willy Zwaenepoel Julien Sopena Julia Lawall Gilles Muller Work

Lets Build Provable Multicore Schedulers! Redha GOUICEM Whisper team, Sorbonne Universits,

Brett Ayoob, PSP Best Practices for CPM Schedulers // Introduction The Corporate Teams Plan

State of Multicore OCaml KC Sivaramakrishnan University of OCaml Labs Cambridge Outline

The Why, Where and How of Multicore Anant Agarwal MIT and Tilera Corp. What is Multicore?

Multicore Multicore curiculum 1 Motivation Moores Law: the number of transistors double

[537] Schedulers Tyler Harter 9/10/14 Overview Review processes Workloads, schedulers, and

Multicore OCaml GC KC Sivaramakrishnan, Stephen Dolan University of OCaml Labs Cambridge

Multicore Synchronization a pragmatic introduction Multicore Synchronization This is a talk on

RETHINKING OPERATING SYSTEM DESIGNS FOR A Ken Birman Based heavily MULTICORE WORLD on a slide

Design Principles for End-to-End Multicore Schedulers Paul Barham Simon Peter Adrian

Integrated Schedulers for a Predictable Interrupt Management on Real-Time Kernels A. Crespo S.

CPU Scheduling Schedulers Structure of a CPU scheduler Criteria for scheduling

Learning Automatic Schedulers through Projective Reparameterization Ajay Jain Saman Amarasinghe

Resource and Application Resource and Application Models for Advanced Grid Models for Advanced

Another Look at Provable Security Alfred Menezes (joint work with Sanjit Chatterjee, Neal

Provable Security in Cryptography ----- DL-based Systems ECC - Sept 24th 2002 - Essen David

FLSCHED: A Lockless and Lightweight Approach to OS Scheduler for Xeon Phi Heeseung Jo Chonbuk

MAPPING PEERING INTERCONNECTIONS TO A FACILITY Vasileios Giotsas 1 Georgios Smaragdakis 2 Bradley

TxFS: Leveraging File-System Crash Consistency to Provide ACID Transactions Yige Hu, Zhiting

Tiered-ReRAM: A Low Latency and Energy Efficient TLC Crossbar ReRAM Architecture Yang Zhang, Dan

Scaling Guest OS Critical Sections with e CS Sanidhya Kashyap, Changwoo Min, Taesoo Kim The

CS540 Uninformed Search Yingyu Liang yliang@cs.wisc.edu Computer Sciences Department

Indian Valley Restoration and Water Luke Hunt Director of Headwaters Conservation American

The 1.8-ft gage datum discrepancy The USGS stated in its November 5, 2008 letter, the gage