Bag-of-Tasks Scheduling under Budget Constraints Ana-Maria Oprescu, - PowerPoint PPT Presentation

Intro What’s the problem? BaTS Performance Evaluation Bag-of-Tasks Scheduling under Budget Constraints Ana-Maria Oprescu, Thilo Kielmann project co-funded by the EC 7th Framework Programme Ana-Maria Oprescu, Thilo Kielmann BaTS 1 / 17

Intro What’s the problem? BaTS Performance Evaluation Bags of Tasks ◮ Example: Parameter sweep applications ◮ High-throughput computing ◮ Traditional execution model: ◮ find resources (networks of workstations, clusters, grids,...) ◮ sit in a queue ◮ run ◮ generally no accounting Ana-Maria Oprescu, Thilo Kielmann BaTS 2 / 17

Intro What’s the problem? BaTS Performance Evaluation Clouds ◮ Elastic computing, get exactly the machines you need, exactly when you need them... ◮ ... for the price they ask Ana-Maria Oprescu, Thilo Kielmann BaTS 3 / 17

Intro What’s the problem? BaTS Performance Evaluation Assumptions: What’s in a bag? ◮ Tasks are independent of each other ◮ Runtimes unknown ◮ Runtime distribution unknown ◮ Tasks can be aborted/restarted ◮ All tasks available for execution when the application starts. Ana-Maria Oprescu, Thilo Kielmann BaTS 4 / 17

Intro What’s the problem? BaTS Performance Evaluation Assumptions: What’s in a cloud? ◮ Several types of machines ◮ different by certain properties, e.g. CPU speed, memory ◮ Upper limit on the number of machines you can get from a cloud (e.g. self-imposed) ◮ A machine is charged per Accountable Time Unit (ATU) (e.g. 1 hour) ◮ We use the term cluster for all the machines of the same type you can get from a certain cloud Ana-Maria Oprescu, Thilo Kielmann BaTS 5 / 17

Intro What’s the problem? BaTS Performance Evaluation What’s the problem? ◮ Goal: Run entire bag on (cloud) clusters, within our budget. ◮ Bonus goal: Minimize makespan of the whole bag, as much as budget allows. ◮ Assumptions: ◮ some form of runtime distribution exists ◮ a ”pay-per-hour” economic model for resource utilization ◮ we have all the tasks Ana-Maria Oprescu, Thilo Kielmann BaTS 6 / 17

Intro Job Profiler What’s the problem? Reconfigure BaTS Cluster Monitoring Performance Evaluation BaTS: Budget-constrained task scheduler 1) Start with a set of initial workers from each cluster 2) Run the initial sample on each cluster 3) (Re)configure based on estimates 4) Run tasks 5) At regular monitoring intervals, go back to 3). Ana-Maria Oprescu, Thilo Kielmann BaTS 7 / 17

Intro Job Profiler What’s the problem? Reconfigure BaTS Cluster Monitoring Performance Evaluation Job Profiler: Task runtime estimate We use estimates to characterize the bag on each machine type ◮ Statistics for sampling with replacement For each cluster: 30 ◮ Keep a moving average sample size (n) 25 20 ◮ initialize the average with 15 a small, initial sample n 10 5 ◮ keep collected runtimes of 0 sample set tasks in an ordered list 0 200 400 600 800 1000 BoT size (N) ◮ update the moving average during BoT execution ◮ Estimate the runtime of running tasks using the average over the ”tail” of the sample set. Ana-Maria Oprescu, Thilo Kielmann BaTS 8 / 17

Intro Job Profiler What’s the problem? Reconfigure BaTS Cluster Monitoring Performance Evaluation Reconfigure: How many machines of which types? ◮ From the average speed of each cluster, (in tasks per minute) we can compute estimates for time/makespan ( T e ) and budget/cost ( B e ) for a configuration consisting of nodes from multiple clusters: � T e C max � N � T e = ; B e = a i ∗ c i ∗ � C max a i ATU i =1 T i i =1 ◮ We minimize T e while keeping B e ≤ B using a modified Bounded Knapsack Problem (BKP) method ◮ The BKP can be solved in pseudo-polynomial time, as 0-1 knapsack problem via linear programming ◮ BaTS chooses the configuration with minimal T e for B e ≤ B Ana-Maria Oprescu, Thilo Kielmann BaTS 9 / 17

Intro Job Profiler What’s the problem? Reconfigure BaTS Cluster Monitoring Performance Evaluation Cluster monitoring and BoT execution progress BaTS regularly re-evaluates the current cluster configuration: ◮ At each monitoring interval, the problem gets smaller (less tasks left, less budget left). ◮ Each moving average converges during the run ◮ Execution on real machines adds some complexity: ◮ Individually requested from the cloud provider, startup time until ready ◮ Each machine has a different time left of the current ATU ◮ Runtime granularity ⇒ paid machine time possibly unused ◮ Throughout bag execution, BaTS keeps track of ◮ Time on machines we already paid for ◮ Actual speed (tasks/minute) achieved per cluster Ana-Maria Oprescu, Thilo Kielmann BaTS 10 / 17

Intro What’s the problem? BaTS Performance Evaluation Evaluation Setup - workloads and clouds ◮ Synthetic workloads ◮ N=1000 tasks ⇒ n =30 (sample set size) ◮ Normal distribution of runtime: avg=15 min, st. dev.=2.23 ◮ Iosup et al. show bags typically have some normal distribution [ The performance of bags-of-tasks in large-scale distributed systems ] ◮ Tasks sleep defined ”run” time ◮ Cloud emulation on DAS-3 ◮ 2 clouds, 32 machines each ◮ Fast/slow machines emulated by modifying the sleep time ◮ Allocate through local site scheduler (without competing users) ◮ Accountable Time Unit = 1 hour ◮ Compare BaTS to a self-scheduler (RR) Ana-Maria Oprescu, Thilo Kielmann BaTS 11 / 17

Intro What’s the problem? BaTS Performance Evaluation Evaluation Setup Profitability: how much faster vs. how much costier profitability cluster 2 c 2 w.r.t. c 1 speed cost ◮ We propose 5 different scenarios: 0.25 1 4 speed and cost of cluster 2 0.75 3 4 compared to normalized 1 1 1 1.33 4 3 speed and cost of cluster 1 . 4 4 1 ◮ We evaluate each scenario by running: ◮ self-scheduler (RR) always using 32+32 machines ◮ BaTS on initial config. 30+30 machines provided with ◮ budget B BaTS RR = cost incurred by running RR (C RR ) ◮ budget B BaTS BMin , computed off-line as the cost incurred by running the bag on a machine of the most profitable type. Ana-Maria Oprescu, Thilo Kielmann BaTS 12 / 17

Intro What’s the problem? BaTS Performance Evaluation Results - Makespan (M), Cost (C) and Budget (B) Ana-Maria Oprescu, Thilo Kielmann BaTS 13 / 17

Intro What’s the problem? BaTS Performance Evaluation Conclusions ◮ Choosing the cloud resources suitable for your application is tough ◮ BaTS can help staying within budget while still performing reasonably well ◮ Limitation: Guessing a proper budget up front ◮ Current work: fixing limitation by pre sampling (even smaller) ◮ Early results promising ◮ Future work ◮ DAG’s instead of BoT’s (dependencies) ◮ BaTS for MapReduce? Ana-Maria Oprescu, Thilo Kielmann BaTS 16 / 17

Intro What’s the problem? BaTS Performance Evaluation Contrail Ana-Maria Oprescu, Thilo Kielmann BaTS 17 / 17

Related work - Assumptions we don’t make ◮ prior knowledge on task arrival rate, execution time, deadline. ◮ same complexity class and a calibration step to estimate execution time per machine type. ◮ prior knowledge of relative complexity classes of tasks ◮ fixed, one-time cost per machine type. Ana-Maria Oprescu, Thilo Kielmann BaTS 1 / 2

Snapshot - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - [ a m o @ f s 0 ~ ] $ p r e s e r v e - l l i s t T h u O c t 2 1 0 2 : 4 0 : 0 6 2 0 1 0 i d u s e r s t a r t s t o p s t a t e n h o s t s h o s t s 1 1 5 2 3 3 4 v p o p e s c u 1 0 / 1 5 1 4 : 4 5 1 2 / 2 4 0 0 : 2 5 r 1 n o d e 0 1 0 1 1 5 2 6 1 1 p p o u w e l s 1 0 / 2 0 2 0 : 0 0 1 0 / 2 1 0 8 : 0 0 r 1 n o d e 0 3 0 1 1 5 2 6 0 7 p p o u w e l s 1 0 / 2 0 2 0 : 0 0 1 0 / 2 1 0 8 : 0 0 r 1 n o d e 0 5 9 1 1 5 2 6 0 8 p p o u w e l s 1 0 / 2 0 2 0 : 0 0 1 0 / 2 1 0 8 : 0 0 r 1 n o d e 0 6 0 1 1 5 2 6 3 3 p p o u w e l s 1 0 / 2 1 0 0 : 2 2 1 0 / 2 1 1 2 : 2 2 r 1 n o d e 0 6 2 1 1 5 2 6 0 6 p p o u w e l s 1 0 / 2 0 2 0 : 0 0 1 0 / 2 1 0 8 : 0 0 r 1 n o d e 0 6 8 1 1 5 2 6 3 4 p p o u w e l s 1 0 / 2 1 0 1 : 0 1 1 0 / 2 1 1 3 : 0 1 r 1 n o d e 0 7 8 1 1 5 2 6 0 4 m c d 1 0 / 2 0 1 7 : 0 1 1 0 / 2 1 2 3 : 0 2 r 1 n o d e 0 7 6 [ a m o @ f s 0 ~ ] $ f i n g e r p p o u w e l s L o g i n : p p o u w e l s N a m e : P e t r a P o u w e l s D i r e c t o r y : / h o m e 5 / p p o u w e l s S h e l l : / b i n / b a s h O f f i c e : V U M C , P J W . P o u w e l s @ v u m c . n l N e v e r l o g g e d i n . N o m a i l . N o P l a n . [ a m o @ f s 0 ~ ] $ f i n g e r v p o p e s c u L o g i n : v p o p e s c u N a m e : V e r o n i c a P o p e s c u D i r e c t o r y : / h o m e 5 / v p o p e s c u S h e l l : / b i n / b a s h O f f i c e : V U M C , v . p o p e s c u @ v u m c . n l N e v e r l o g g e d i n . N o m a i l . N o P l a n . [ a m o @ f s 0 ~ ] $ Ana-Maria Oprescu, Thilo Kielmann BaTS 2 / 2

Bag-of-Tasks Scheduling under Budget Constraints Ana-Maria Oprescu, - PowerPoint PPT Presentation

Intro Whats the problem? BaTS Performance Evaluation Bag-of-Tasks Scheduling under Budget Constraints Ana-Maria Oprescu, Thilo Kielmann project co-funded by the EC 7th Framework Programme Ana-Maria Oprescu, Thilo Kielmann BaTS 1 / 17

Scheduling in Aussois 2011 Bag-of-Tasks Scheduling under Budget Constraints Ana Oprescu, Thilo

Scheduling Aperiodic Tasks Background Scheduling Treat aperiodic tasks as lowest-priority

Aperiodic Task Scheduling Radek Pel anek Preemptive Scheduling Non-preemptive Scheduling

WINE BOTTLE AIRBAG SINGLE WINE BOTTLE AIRBAG SINGLE BOTTLE AIR BAG PROTECT ALL BOTTLED PRODUCT

Red-Bag Engineers Consultants Software User Day April 2017 Red-Bag 2017 1 Ves Online

Pathway Red Bag Scheme October 2018 The Red Bag concept The Red Bag scheme was first implemented

The Plastic Bag Free world in action Surfriders Ban the Bag Campaign Plastic bag free

Time Management Beth Asbury Outline Time Bandits Scheduling tasks Prioritising tasks

Resource Scheduling Constraints Nicolas Beldiceanu SICS Lgerhyddsvgen 5 SE-75237 Uppsala,

Priority-Driven Scheduling of Periodic Tasks Priority-driven vs. clock-driven scheduling:

CPU Scheduling CPU Scheduling CPU Scheduling 101 CPU Scheduling 101 The CPU scheduler makes a

Module 5: CPU Scheduling Basic Concepts Scheduling Criteria Scheduling Algorithms

Chapter 6: CPU Scheduling Basic Concepts Scheduling Criteria Scheduling Algorithms

Uniprocessor Scheduling Basic Concepts Scheduling Criteria Scheduling Algorithms 2

Module 5: CPU Scheduling Basic Concepts Scheduling Criteria Scheduling Algorithms

Module 6: CPU Scheduling Basic Concepts Scheduling Criteria Scheduling Algorithms

First attempts to Automatize Generalisation of Electronic Navigational Charts Weronika Socha

TOOLS TO MODEL TEXAS ESTUARIES Solomon Negusse, Dharhas Pothina Modeling Bays and Estuaries

Research Data Management for Computational Science Christian T. Jacobs 1 & Alexandros Avdis 1

Experimental Design for Bathymetry Editing Julaiti Alafate Yoav Freund David Sandwell

P RECIPITATION (W INTER ) The 2nd International Electronic Conference on Atmospheric Sciences

Testing The Foreman Has Jenkins melted again...? Greg Sutcliffe Red Hat FOSDEM 2014 Why

Hybrid Temporal Situation Calculus Vitaliy Batusov 1 Giuseppe De Giacomo 2 Mikhail Soutchanski 3 1

A NEW HYPERSPECTRAL LIBRARY CONNECTED TO SOLSA OPEN DATABASES for on-line-real-time analyses of

Bag-of-Tasks Scheduling under Budget Constraints Ana-Maria Oprescu, - PowerPoint PPT Presentation

Intro Whats the problem? BaTS Performance Evaluation Bag-of-Tasks Scheduling under Budget Constraints Ana-Maria Oprescu, Thilo Kielmann project co-funded by the EC 7th Framework Programme Ana-Maria Oprescu, Thilo Kielmann BaTS 1 / 17

Scheduling in Aussois 2011 Bag-of-Tasks Scheduling under Budget Constraints Ana Oprescu, Thilo

Scheduling Aperiodic Tasks Background Scheduling Treat aperiodic tasks as lowest-priority

Aperiodic Task Scheduling Radek Pel anek Preemptive Scheduling Non-preemptive Scheduling

WINE BOTTLE AIRBAG SINGLE WINE BOTTLE AIRBAG SINGLE BOTTLE AIR BAG PROTECT ALL BOTTLED PRODUCT

Red-Bag Engineers Consultants Software User Day April 2017 Red-Bag 2017 1 Ves Online

Pathway Red Bag Scheme October 2018 The Red Bag concept The Red Bag scheme was first implemented

The Plastic Bag Free world in action Surfriders Ban the Bag Campaign Plastic bag free

Time Management Beth Asbury Outline Time Bandits Scheduling tasks Prioritising tasks

Resource Scheduling Constraints Nicolas Beldiceanu SICS Lgerhyddsvgen 5 SE-75237 Uppsala,

Priority-Driven Scheduling of Periodic Tasks Priority-driven vs. clock-driven scheduling:

CPU Scheduling CPU Scheduling CPU Scheduling 101 CPU Scheduling 101 The CPU scheduler makes a

Module 5: CPU Scheduling Basic Concepts Scheduling Criteria Scheduling Algorithms

Chapter 6: CPU Scheduling Basic Concepts Scheduling Criteria Scheduling Algorithms

Uniprocessor Scheduling Basic Concepts Scheduling Criteria Scheduling Algorithms 2

Module 5: CPU Scheduling Basic Concepts Scheduling Criteria Scheduling Algorithms

Module 6: CPU Scheduling Basic Concepts Scheduling Criteria Scheduling Algorithms

First attempts to Automatize Generalisation of Electronic Navigational Charts Weronika Socha

TOOLS TO MODEL TEXAS ESTUARIES Solomon Negusse, Dharhas Pothina Modeling Bays and Estuaries

Research Data Management for Computational Science Christian T. Jacobs 1 &amp; Alexandros Avdis 1

Experimental Design for Bathymetry Editing Julaiti Alafate Yoav Freund David Sandwell

P RECIPITATION (W INTER ) The 2nd International Electronic Conference on Atmospheric Sciences

Testing The Foreman Has Jenkins melted again...? Greg Sutcliffe Red Hat FOSDEM 2014 Why

Hybrid Temporal Situation Calculus Vitaliy Batusov 1 Giuseppe De Giacomo 2 Mikhail Soutchanski 3 1

A NEW HYPERSPECTRAL LIBRARY CONNECTED TO SOLSA OPEN DATABASES for on-line-real-time analyses of

Research Data Management for Computational Science Christian T. Jacobs 1 & Alexandros Avdis 1