DWS: Demand-aware Work-Stealing in Multi-programmed Multi-core - PowerPoint PPT Presentation

DWS: Demand-aware Work-Stealing in Multi-programmed Multi-core Architectures � Quan Chen, Long Zheng, Minyi Guo Shanghai Jiao Tong University, China � 1 PMAM 2014

Outline � • Background • Problem & Motivation • Demand-aware Work-Stealing (DWS) • Evaluation • Conclusions � 2

Background � � Hardware: Multi-core/Many-core Architectures � Scenario: Multiple parallel programs � … P 1 … P i … P n 3

Background-parallel programs � � Traditional parallel programs • Hard to adjust the number of threads at runtime � Task-based parallel programs • Dynamic task scheduling � 4

Work-sharing � Task Task Central task pool Task Task Task Unlock Lock Unlock Lock Worker 1 Worker 2 Worker 3 Worker 4 Lock the central task pool when getting a task 5

Work-stealing � Unlock Lock Task Task Task Task Task Task Task Task Task Task Task Thread 1 Thread 2 Thread 4 Thread 3 6

Problem & Motivation � � Aggressive feature of work-stealing • On a k -core computer, k threads/workers are launched � Existing solutions • Time-sharing - ABP yielding mechanism • Space-sharing - Equal-partitioning � 7

Time-sharing � � ABP yielding mechanism • If a thread fails to steal a task, it goes to sleep � Sleep Active Thread 3 Thread 2 Thread 1 C Cache 8

Space-sharing � � Equal-partitioning mechanism � If m programs co-run on a k -core computer, each program is allocated k/m cores. � … … … P 1 P i P m 9

Demand-aware Work-Stealing (DWS) � � Start from Equal-partitioning � Dynamically balance cores at runtime • If p i cannot fully-utilized a core, it release the core • If p i has too many tasks, it tries to obtain more cores � Obtain Release Runtime Arch. of DWS 10

Stealing algorithm - (Release) � � A worker decides whether to release its core by itself � If a worker fails too many times (T_SLEEP) to steal a new task, it goes to sleep 11

Coordinator - (Obtain) � � The coordinator decides whether to obtain more cores • If a program has too many queued tasks, it should try to get some free cores � How Which? Many? C1: The more queued tasks in a program, the more cores should the program obtain C2: A program can take its allocated cores back C3: A program cannot obtain the busy cores 12

Coordinator - How Many? � � C1: The more queued tasks in a program, the more cores should the program obtain � Num of active workers � N a � Num of queued tasks � N b � How many: Num of free cores � N f � Num of released cores � N r � Num of cores expected � N w � 13

Coordinator - Which? � � N w <= N f • Randomly select N w free cores � N f < N w <= N f +N r (C2) • Select N f free cores + its ( N w -N f ) released core � N w > N f +N r (C3) � • N f free cores+its N r released cores Num of active workers � N a � Num of queued tasks � N b � Num of free cores � N f � Num of released cores � N r � Num of cores expected � N w � 14

Evaluation platform � � A Dual-socket Quad-core computer with Hyper- Threading Technology � Each socket is a Quad-Core Intel Xeon E5620 � Hardware & Configuration � Size/Version � L1/L2 cache size (each core) � 256 KB/1MB � L3 cache size (each socket) � 12 MB � Main memory size � 32 GB � Operation system � Linux 2.6.32-38 � 15

Benchmarks � Calculate execution time: 16

Performance of DWS � DWS can significantly improve the performance of the benchmarks 17

Effectiveness of the coordinator � Without the coordinator, the performance of the benchmarks is degraded 18

Impact of T_SLEEP � We should choose T_SLEEP = k or 2k on a k-core computer 19

Contributions & conclusions � • A modified work-stealing algorithm that enables a program to release the under-utilized cores. • A coordinator to manage the workers. It enables a program to grab and use the under-utilized cores released by other programs. • We have implemented DWS, which achieves a performance gain of up to 32.3% in the best cases compared to traditional work-stealing schedulers. � 20

Thanks! Questions? �

DWS: Demand-aware Work-Stealing in Multi-programmed Multi-core - PowerPoint PPT Presentation

DWS: Demand-aware Work-Stealing in Multi-programmed Multi-core Architectures Quan Chen, Long Zheng, Minyi Guo Shanghai Jiao Tong University, China 1 PMAM 2014 Outline Background Problem & Motivation Demand-aware

WORK STEALING SCHEDULER 2 6/16/2010 Work Stealing Scheduler

Questions? john@dwagents.com 1 Why Sell DWs and Hs? DWs continue exploding in popularity

Body Fluids And Electrolytes A Programmed Presentation Page 1/114 1041152 Body Fluids And

Susan Sidamon-Eristoff Sources: Nicky McLeod (ERS/UCPP) Kululwa Mkosana (DWS) Pearl Gola

Nash demand game Julio D avila 2009 Julio D avila Nash demand game Nash demand game

Asymmetry-Aware Work-Stealing Runtimes Christopher Torng, Moyang Wang, and Christopher Batten

Toolkit to Support Intelligibility in Context Aware Applications Context-Aware Applications P

Aug 09 2020 Aug 09 2020 body-fluids-and-electrolytes-a-programmed-presentation

Philosophical Foundations Weak AI claim: computers can be programmed to act as if they

Demand Aware Network ( DAN ) Design Some Results and Open Questions Chen Avin Joint work with

Phaunos Timber Fund Limited Meeting with DWS 13 September 2018 Background Phaunos

CABRI Workshop (WASH) PRESENTATION TITLE Presented by: DWS - SUSTAINABLE DEVELOPMENT GOAL 6

Understanding Task Scheduling Algorithms Kenjiro Taura 1 / 51 Contents 1 Introduction 2 Work

Peak Energy Management (PEMa) System November 2019 Demand-Supply 2018 Daily Demand/Supply

Chapter 4 - Demand Maybach Exelero Section 1 Understanding Demand Demand The desire to

Law of Demand.notebook November 03, 2014 Supply and Demand Supply and Demand Economic model

Human Competitiveness of Genetic Programming for Spectrum Based Fault Localisation Shin Yoo 1 ,

Phase transitions and critical behavior in 2D Dirac materials Laura Classen Heidelberg, March

On the Goodwillie Derivatives of the Identity in Structured Ring Spectra Duncan Clark Ohio State

Byzantine Vector Consensus in Complete Graphs Nitin Vaidya University of

GRAVITY DUALS OF 2D SUSY GAUGE THEORIES BASED ON: 0909.XXXX with E. Conde and A.V. Ramallo

A general S -unit equation solver and tables of elliptic curves over number fields Benjamin

REVIEW TALK (2+1)d dualities with N = 2 supersymmetry Antonio Amariti INFN - Sezione di Milano

Channel Estimation Schemes for OFDM Relay-Assisted System Darlene Maciel, C. Ribeiro, A. Silva e

Sambuz

Useful Links

Newsletter

Mail Us