Sampling Effect on Performance Prediction of Configurable Systems : - PowerPoint PPT Presentation

Sampling Effect on Performance Prediction of Configurable Systems : A Case Study Juliana Alves Pereira, Mathieu Acher, Hugo Martin, Jean-Marc Jezequel 1

Configurable systems Pros ● Adaptive ● Lots of options Cons ● Lots of options (and interactions) ● Increasingly complex Machine learning to the rescue 2

Machine Learning : Sampling, Measure, Learning, Validating Sampling Measuring Validation Learning 3

Distance-Based Sampling of Software Configuration Spaces ● C. Kaltenecker, A. Grebhahn, N. Siegmund, J. Guo and S. Apel, "Distance-Based Sampling of Software Configuration Spaces," 2019 IEEE/ACM 41st International Conference on Software Engineering (ICSE) , Montreal, QC, Canada, 2019, pp. 1084-1094. ● Proposing a new sampling solution : Distance-Based Sampling ● Empirical study on 10 subject systems and 6 sampling strategies 4

Sampling strategies ● Coverage-based ● Solver-based ● Randomized solver-based ● Random ● Distance-based ● Diversified distance-based 5

Subject systems Experiment setup ● 7z ● Machine learning based on multiple ● BerkeleyDB-C linear regression and feature-forward ● Dune MGS selection ● HIPAcc ● Mean Relative Error (MRE) ● Java GC ● LLVM ● LRZIP ● Polly ● VPXENC ● x264 6

Results ● Coverage-based is dominant at low sample size ● Diversified distance-based is dominant on higher sample size ● Diversified distance-based is close to random sampling accuracy, even better in some cases 7

Is it true?

Replicating the experiment ● Subject system : x264, video encoder ● Changing the input video : 17 videos ● Changing the measured non-functional property 9

Experimental setup What does vary? ● Sampling strategy (6 strategies) ● Sample size (3 sample size) ● Encoded video (17 videos) 🔵 ● System configuration (1152 configurations) ● Measured property (Encoding time, encoding size) 🔵 What doesn’t vary? ● Learning algorithm (Multiple Linear Regression) ● Learning algorithm hyperparameters Configurable Software (x264) 🔵 ● ● Version 🔶 Hardware 🔶 ● 10

Results ● High variation between videos, between non-functional properties ● Encoding time : ○ Similar results ○ Random sampling dominant over Diversified Distance-based sampling ● Encoding size : ○ Random sampling and randomized solver-based sampling overall dominant ○ Most strategies present good and similar accuracy for higher sample size 11

11 Results table for encoding time

11 Results table for encoding size

Results 11

Replicability ● Fully replicable experiment ● Dataset for video encoding time and size available ● Docker image with all data and scripts for performance prediction and results aggregation : https://github.com/jualvespereira/ICPE2020 12

What’s next? ● How do version and hardware affect the sampling effectiveness? ● How does machine learning technique affect the sampling effectiveness? ● How to leverage the fact that some sampling strategies overperform by focusing on important options? 13

Conclusion ● Random sampling is a strong baseline, hard to challenge ● Diversified distance-based sampling is a strong alternative ● Researchers should be aware that effectiveness of sampling strategies can be biased by inputs and performance property used 14

Sampling Effect on Performance Prediction of Configurable Systems : - PowerPoint PPT Presentation

Sampling Effect on Performance Prediction of Configurable Systems : A Case Study Juliana Alves Pereira, Mathieu Acher, Hugo Martin, Jean-Marc Jezequel 1 Configurable systems Pros Adaptive Lots of options Cons Lots of

Sampling Effect on Performance Prediction of Configurable Systems : A Case Study Juliana Alves

Fibre Optic Multiplexer Configurable The What is the Badger Fully configurable Audio/Data

Overview of Overview of configurable architectures configurable architectures Prof. Kurt

Sampling Methods Oliver Schulte - CMPT 419/726 Bishop PRML Ch. 11 Sampling Rejection Sampling

Chapter 7. Sampling Chapter 7. Sampling methods? methods? Two types of sampling methods Two

Multiple importance sampling Slides for CS6630 lecture 6 sampling the BRDF sampling the

What is the strengths and weakness of these sampling methods? Sampling Strengths /

Performance Prediction of Configurable Software Systems by Fourier Learning Yi Zhang, Jianmei

Dual-Mode Configurable RISC-V Processor IP Nuclei System Technology Dual-Mode

Designing a Web of Highly-Configurable Designing a Web of Highly-Configurable Intrusion Detection

Configurable software- -based based Configurable software edge router architecture edge router

An Architecture for An Architecture for Configurable Dependability of Configurable Dependability

Reinforcement Learning in Configurable Continuous Environments Alberto Maria Metelli, Emanuele

Maca a configurable tool to Maca a configurable tool to integrate Polish morphological

A Configurable Hardware Scheduler A Configurable Hardware Scheduler (CHS) for Real- -Time

Sampling Overview R toy sampling Non-probability sampling Probability Methods (AKA random)

PTSD as a Shame Disorder (A Work in Progress) Judith L. Herman, M.D. ISTSS Webinar, October 2014

CS7015 (Deep Learning) : Lecture 6 Eigen Values, Eigen Vectors, Eigen Value Decomposition,

Epistasis 02-715 Advanced Topics in Computa8onal Genomics

Understanding the Fundamentals: The Language of Genetics Genetics Webinar Series for Blue Plans

(Basic Concepts) CSC304 - Nisarg Shah 1 Game Theory How do rational, self-interested agents

DRFQ : Multi-Resource Fair Queueing for Packet Processing Ali Ghodsi 1,3 , Vyas Sekar 2 , Matei

Lecture 5: The animal kingdom of heuristics: Admissible, Consistent, zero, Relaxed,

Chapter 3 State Feedback - Pole Placement Motivation Whereas classical control theory is based

Sampling Effect on Performance Prediction of Configurable Systems : - PowerPoint PPT Presentation

Sampling Effect on Performance Prediction of Configurable Systems : A Case Study Juliana Alves Pereira, Mathieu Acher, Hugo Martin, Jean-Marc Jezequel 1 Configurable systems Pros Adaptive Lots of options Cons Lots of

Sampling Effect on Performance Prediction of Configurable Systems : A Case Study Juliana Alves

Fibre Optic Multiplexer Configurable The What is the Badger Fully configurable Audio/Data

Overview of Overview of *configurable* architectures *configurable* architectures Prof. Kurt

Sampling Methods Oliver Schulte - CMPT 419/726 Bishop PRML Ch. 11 Sampling Rejection Sampling

Chapter 7. Sampling Chapter 7. Sampling methods? methods? Two types of sampling methods Two

Multiple importance sampling Slides for CS6630 lecture 6 sampling the BRDF sampling the

What is the strengths and weakness of these sampling methods? Sampling Strengths /

Performance Prediction of Configurable Software Systems by Fourier Learning Yi Zhang, Jianmei

Dual-Mode Configurable RISC-V Processor IP Nuclei System Technology Dual-Mode

Designing a Web of Highly-Configurable Designing a Web of Highly-Configurable Intrusion Detection

Configurable software- -based based Configurable software edge router architecture edge router

An Architecture for An Architecture for Configurable Dependability of Configurable Dependability

Reinforcement Learning in Configurable Continuous Environments Alberto Maria Metelli, Emanuele

Maca a configurable tool to Maca a configurable tool to integrate Polish morphological

A Configurable Hardware Scheduler A Configurable Hardware Scheduler (CHS) for Real- -Time

Sampling Overview R toy sampling Non-probability sampling Probability Methods (AKA random)

PTSD as a Shame Disorder (A Work in Progress) Judith L. Herman, M.D. ISTSS Webinar, October 2014

CS7015 (Deep Learning) : Lecture 6 Eigen Values, Eigen Vectors, Eigen Value Decomposition,

Epistasis 02-715 Advanced Topics in Computa8onal Genomics

Understanding the Fundamentals: The Language of Genetics Genetics Webinar Series for Blue Plans

(Basic Concepts) CSC304 - Nisarg Shah 1 Game Theory How do rational, self-interested agents

DRFQ : Multi-Resource Fair Queueing for Packet Processing Ali Ghodsi 1,3 , Vyas Sekar 2 , Matei

Lecture 5: The animal kingdom of heuristics: Admissible, Consistent, zero, Relaxed,

Chapter 3 State Feedback - Pole Placement Motivation Whereas classical control theory is based

Overview of Overview of configurable architectures configurable architectures Prof. Kurt