Automating the Configuration of Algorithms for Solving Hard - PowerPoint PPT Presentation

Simple manual approach for configuration Start with some parameter configuration Modify a single parameter 12

Simple manual approach for configuration Start with some parameter configuration Modify a single parameter if results on benchmark set improve then keep new configuration 12

Simple manual approach for configuration Start with some parameter configuration repeat Modify a single parameter if results on benchmark set improve then keep new configuration until no more improvement possible (or “good enough”) 12

Simple manual approach for configuration Start with some parameter configuration repeat Modify a single parameter if results on benchmark set improve then keep new configuration until no more improvement possible (or “good enough”) � Manually-executed local search 12

The ParamILS Framework Iterated Local Serach in parameter configuration space: Choose initial parameter configuration θ Perform subsidiary local search on θ 13

The ParamILS Framework Iterated Local Serach in parameter configuration space: Choose initial parameter configuration θ Perform subsidiary local search on θ While tuning time left: | θ ′ := θ | | | Perform perturbation on θ | | Perform subsidiary local search on θ 13

The ParamILS Framework Iterated Local Serach in parameter configuration space: Choose initial parameter configuration θ Perform subsidiary local search on θ While tuning time left: | θ ′ := θ | | | Perform perturbation on θ | | Perform subsidiary local search on θ | | | | Based on acceptance criterion , | keep θ or revert to θ := θ ′ | 13

The ParamILS Framework Iterated Local Serach in parameter configuration space: Choose initial parameter configuration θ Perform subsidiary local search on θ While tuning time left: | θ ′ := θ | | | Perform perturbation on θ | | Perform subsidiary local search on θ | | | | Based on acceptance criterion , | keep θ or revert to θ := θ ′ | | | ⌊ With probability p restart randomly pick new θ � Performs biased random walk over local optima 13

Instantiations of ParamILS Framework How to evaluate each configuration? ◮ BasicILS( N ): perform fixed number of N runs to evaluate a configuration θ – Blocking: use same N (instance, seed) pairs for each θ 14

Instantiations of ParamILS Framework How to evaluate each configuration? ◮ BasicILS( N ): perform fixed number of N runs to evaluate a configuration θ – Blocking: use same N (instance, seed) pairs for each θ ◮ FocusedILS: adaptive choice of N ( θ ) – small N ( θ ) for poor configurations θ – large N ( θ ) only for good θ 14

Instantiations of ParamILS Framework How to evaluate each configuration? ◮ BasicILS( N ): perform fixed number of N runs to evaluate a configuration θ – Blocking: use same N (instance, seed) pairs for each θ ◮ FocusedILS: adaptive choice of N ( θ ) – small N ( θ ) for poor configurations θ – large N ( θ ) only for good θ – typically outperforms BasicILS 14

Empirical Comparison to Previous Configuration Procedure CALIBRA system [Adenso-Diaz & Laguna, ’06] ◮ Based on fractional factorial designs ◮ Limited to continuous parameters ◮ Limited to 5 parameters 15

Empirical Comparison to Previous Configuration Procedure CALIBRA system [Adenso-Diaz & Laguna, ’06] ◮ Based on fractional factorial designs ◮ Limited to continuous parameters ◮ Limited to 5 parameters Empirical comparison ◮ FocusedILS typically did better, never worse ◮ More importantly, much more general 15

Adaptive Choice of Cutoff Time ◮ Evaluation of poor configurations takes especially long 16

Adaptive Choice of Cutoff Time ◮ Evaluation of poor configurations takes especially long ◮ Can terminate evaluations early ◮ Incumbent solution provides bound ◮ Can stop evaluation once bound is reached 16

Adaptive Choice of Cutoff Time ◮ Evaluation of poor configurations takes especially long ◮ Can terminate evaluations early ◮ Incumbent solution provides bound ◮ Can stop evaluation once bound is reached ◮ Results – Provably never hurts – Sometimes substantial speedups (factor 10) 16

Outline 1. Problem Definition & Intuition 2. Model-Free Search for Algorithm Configuration ParamILS: Iterated Local Search in Configuration Space “Real-World” Applications of ParamILS 3. Model-Based Search for Algorithm Configuration 4. Conclusions 17

Configuration of ILOG CPLEX ◮ Recall: 63 parameters, 1 . 78 × 10 38 possible configurations ◮ Ran FocusedILS for 2 days on 10 machines 18

Configuration of ILOG CPLEX ◮ Recall: 63 parameters, 1 . 78 × 10 38 possible configurations ◮ Ran FocusedILS for 2 days on 10 machines ◮ Compared against default “A great deal of algorithmic development effort has been devoted to establishing default ILOG CPLEX parameter settings that achieve good performance on a wide variety of MIP models.” [CPLEX 10.0 user manual, page 247] 18

Configuration of ILOG CPLEX ◮ Recall: 63 parameters, 1 . 78 × 10 38 possible configurations ◮ Ran FocusedILS for 2 days on 10 machines ◮ Compared against default “A great deal of algorithmic development effort has been devoted to establishing default ILOG CPLEX parameter settings that achieve good performance on a wide variety of MIP models.” [CPLEX 10.0 user manual, page 247] 4 10 3 10 2 10 Auto−tuned 1 10 0 10 −1 10 −2 10 −2 10 −1 10 0 1 2 3 4 10 10 10 10 10 Default Combinatorial auctions: 7-fold speedup 18

Configuration of ILOG CPLEX ◮ Recall: 63 parameters, 1 . 78 × 10 38 possible configurations ◮ Ran FocusedILS for 2 days on 10 machines ◮ Compared against default “A great deal of algorithmic development effort has been devoted to establishing default ILOG CPLEX parameter settings that achieve good performance on a wide variety of MIP models.” [CPLEX 10.0 user manual, page 247] 4 4 10 10 3 3 10 10 2 2 10 10 Auto−tuned Auto−tuned 1 1 10 10 0 0 10 10 −1 −1 10 10 −2 −2 10 10 −2 10 −1 10 −2 10 −1 10 0 1 2 3 4 0 1 2 3 4 10 10 10 10 10 10 10 10 10 10 Default Default Combinatorial auctions: 7-fold speedup Mixed integer knapsack: 23-fold speedup 18

Configuration of SAT Solver for Verification SAT (propositional satisfiability problem) – Prototypical NP -hard problem – Interesting theoretically and in practical applications 19

Configuration of SAT Solver for Verification SAT (propositional satisfiability problem) – Prototypical NP -hard problem – Interesting theoretically and in practical applications Formal verification – Bounded model checking – Software verification – Recent progress based on SAT solvers 19

Configuration of SAT Solver for Verification SAT (propositional satisfiability problem) – Prototypical NP -hard problem – Interesting theoretically and in practical applications Formal verification – Bounded model checking – Software verification – Recent progress based on SAT solvers Spear, tree search solver for industrial SAT instances – 26 parameters, 8 . 34 × 10 17 configurations 19

Configuration of SAT Solver for Verification ◮ Ran FocusedILS for 2 days on 10 machines 20

Configuration of SAT Solver for Verification ◮ Ran FocusedILS for 2 days on 10 machines ◮ Compared to manually-engineered default – 1 week of performance tuning – competitive with the state of the art 20

Configuration of SAT Solver for Verification ◮ Ran FocusedILS for 2 days on 10 machines ◮ Compared to manually-engineered default – 1 week of performance tuning – competitive with the state of the art SPEAR, optimized for IBM−BMC (s) 4 10 3 10 2 10 1 10 0 10 −1 10 −2 10 −2 10 −1 10 0 10 1 10 2 10 3 10 4 10 SPEAR, original default (s) IBM Bounded Model Checking: 4 . 5-fold speedup 20

Configuration of SAT Solver for Verification ◮ Ran FocusedILS for 2 days on 10 machines ◮ Compared to manually-engineered default – 1 week of performance tuning – competitive with the state of the art SPEAR, optimized for IBM−BMC (s) 4 10 4 SPEAR, optimized for SWV (s) 10 3 10 3 10 2 2 10 10 1 1 10 10 0 0 10 10 −1 −1 10 10 −2 −2 10 10 −2 10 −1 10 0 10 1 10 2 10 3 10 −2 10 −1 10 0 10 1 10 2 10 3 10 4 4 10 10 SPEAR, original default (s) SPEAR, original default (s) IBM Bounded Model Checking: Software verification: 500-fold speedup 4 . 5-fold speedup � won 2007 SMT competition 20

Other Fielded Applications of ParamILS ◮ SAPS, local search for SAT � 8-fold and 130-fold speedup 21

Other Fielded Applications of ParamILS ◮ SAPS, local search for SAT � 8-fold and 130-fold speedup ◮ SAT4J, tree search for SAT � 11-fold speedup 21

Other Fielded Applications of ParamILS ◮ SAPS, local search for SAT � 8-fold and 130-fold speedup ◮ SAT4J, tree search for SAT � 11-fold speedup ◮ GLS + for Most Probable Explanation (MPE) problem � > 360-fold speedup 21

Other Fielded Applications of ParamILS ◮ SAPS, local search for SAT � 8-fold and 130-fold speedup ◮ SAT4J, tree search for SAT � 11-fold speedup ◮ GLS + for Most Probable Explanation (MPE) problem � > 360-fold speedup ◮ Applications by others – Protein folding [Thatchuk, Shmygelska & Hoos ’07] – Time-tabling [Fawcett, Hoos & Chiarandini ’09] – Local Search for SAT [Khudabukhsh, Xu, Hoos, & Leyton-Brown ’09] 21

Other Fielded Applications of ParamILS ◮ SAPS, local search for SAT � 8-fold and 130-fold speedup ◮ SAT4J, tree search for SAT � 11-fold speedup ◮ GLS + for Most Probable Explanation (MPE) problem � > 360-fold speedup ◮ Applications by others – Protein folding [Thatchuk, Shmygelska & Hoos ’07] – Time-tabling [Fawcett, Hoos & Chiarandini ’09] – Local Search for SAT [Khudabukhsh, Xu, Hoos, & Leyton-Brown ’09] � demonstrates versatility & maturity 21

Outline 1. Problem Definition & Intuition 2. Model-Free Search for Algorithm Configuration 3. Model-Based Search for Algorithm Configuration State of the Art Improvements for Stochastic Blackbox Optimization Beyond Stochastic Blackbox Optimization 4. Conclusions 22

Model-Based Optimization: Motivation Fundamentally different approach for algorithm configuration ◮ So far: discussed local search approach ◮ Now: alternative choice, based on predictive models 23

Model-Based Optimization: Motivation Fundamentally different approach for algorithm configuration ◮ So far: discussed local search approach ◮ Now: alternative choice, based on predictive models – Model-based optimization was less well developed � emphasis on methodological improvements 23

Model-Based Optimization: Motivation Fundamentally different approach for algorithm configuration ◮ So far: discussed local search approach ◮ Now: alternative choice, based on predictive models – Model-based optimization was less well developed � emphasis on methodological improvements ◮ In then end: state-of-the-art configuration tool 23

Model-Based Deterministic Blackbox Optimization (BBO) EGO algorithm [Jones, Schonlau & Welch ’98] 30 . . 25 True function . . 20 response y 15 10 5 0 −5 0 0.2 0.4 0.6 0.8 1 parameter x 25

Model-Based Deterministic Blackbox Optimization (BBO) EGO algorithm [Jones, Schonlau & Welch ’98] 1. Get response values at initial design points 30 . 25 True function Function evaluations . 20 response y 15 10 5 0 −5 0 0.2 0.4 0.6 0.8 1 parameter x 25

Model-Based Deterministic Blackbox Optimization (BBO) EGO algorithm [Jones, Schonlau & Welch ’98] 1. Get response values at initial design points 30 . . 25 . Function evaluations . 20 response y 15 10 5 0 −5 0 0.2 0.4 0.6 0.8 1 parameter x 25

Model-Based Deterministic Blackbox Optimization (BBO) EGO algorithm [Jones, Schonlau & Welch ’98] 1. Get response values at initial design points 2. Fit a model to the data 30 DACE mean prediction DACE mean +/− 2*stddev 25 . Function evaluations . 20 response y 15 10 5 0 −5 0 0.2 0.4 0.6 0.8 1 parameter x 25

Model-Based Deterministic Blackbox Optimization (BBO) EGO algorithm [Jones, Schonlau & Welch ’98] 1. Get response values at initial design points 2. Fit a model to the data 3. Use model to pick most promising next design point 30 DACE mean prediction DACE mean +/− 2*stddev 25 . Function evaluations EI (scaled) 20 response y 15 10 5 0 −5 0 0.2 0.4 0.6 0.8 1 parameter x 25

Model-Based Deterministic Blackbox Optimization (BBO) EGO algorithm [Jones, Schonlau & Welch ’98] 1. Get response values at initial design points 2. Fit a model to the data 3. Use model to pick most promising next design point 4. Repeat 2. and 3. until time is up 30 DACE mean prediction DACE mean +/− 2*stddev 25 True function Function evaluations EI (scaled) 20 response y 15 10 5 0 −5 0 0.2 0.4 0.6 0.8 1 parameter x 25

Model-Based Deterministic Blackbox Optimization (BBO) EGO algorithm [Jones, Schonlau & Welch ’98] 1. Get response values at initial design points 2. Fit a model to the data 3. Use model to pick most promising next design point 4. Repeat 2. and 3. until time is up 30 30 DACE mean prediction DACE mean prediction DACE mean +/− 2*stddev DACE mean +/− 2*stddev 25 True function 25 True function Function evaluations Function evaluations EI (scaled) EI (scaled) 20 20 response y response y 15 15 10 10 5 5 0 0 −5 −5 0 0.2 0.4 0.6 0.8 1 0 0.2 0.4 0.6 0.8 1 parameter x parameter x First step Second step 25

Stochastic Blackbox Optimization (BBO): State of the Art Extensions of EGO algorithm for stochastic case – Sequential Parameter Optimization (SPO) [Bartz-Beielstein, Preuss, Lasarczyk, ’05-’09] – Sequential Kriging Optimization (SKO) [Huang, Allen, Notz & Zeng, ’06] 26

Stochastic Blackbox Optimization (BBO): State of the Art Extensions of EGO algorithm for stochastic case – Sequential Parameter Optimization (SPO) [Bartz-Beielstein, Preuss, Lasarczyk, ’05-’09] – Sequential Kriging Optimization (SKO) [Huang, Allen, Notz & Zeng, ’06] Application domain for stochastic BBO ◮ Randomized algorithms with continuous parameters ◮ Optimization for single instances 26

Stochastic Blackbox Optimization (BBO): State of the Art Extensions of EGO algorithm for stochastic case – Sequential Parameter Optimization (SPO) [Bartz-Beielstein, Preuss, Lasarczyk, ’05-’09] – Sequential Kriging Optimization (SKO) [Huang, Allen, Notz & Zeng, ’06] Application domain for stochastic BBO ◮ Randomized algorithms with continuous parameters ◮ Optimization for single instances Empirical Evaluation ◮ SPO more robust 26

Improvements for stochastic BBO I: Studied SPO components ◮ Improved component: “intensification mechanism” – Increase N ( θ ) similarly as in FocusedILS – Improved robustness 28

Improvements for stochastic BBO I: Studied SPO components ◮ Improved component: “intensification mechanism” – Increase N ( θ ) similarly as in FocusedILS – Improved robustness II: Better Models ◮ Compared various probabilistic models – Model SPO uses – Approximate Gaussian process (GP) – Random forest (RF) 28

Improvements for stochastic BBO I: Studied SPO components ◮ Improved component: “intensification mechanism” – Increase N ( θ ) similarly as in FocusedILS – Improved robustness II: Better Models ◮ Compared various probabilistic models – Model SPO uses – Approximate Gaussian process (GP) – Random forest (RF) ◮ New models much better – Resulting configuration procedure: ActiveConfigurator – Improved state of the art for model-based stochastic BBO 28

Improvements for stochastic BBO I: Studied SPO components ◮ Improved component: “intensification mechanism” – Increase N ( θ ) similarly as in FocusedILS – Improved robustness II: Better Models ◮ Compared various probabilistic models – Model SPO uses – Approximate Gaussian process (GP) – Random forest (RF) ◮ New models much better – Resulting configuration procedure: ActiveConfigurator – Improved state of the art for model-based stochastic BBO – Randomized algorithm with continuous parameters – Optimization for single instances 28

Extension I: Categorical Parameters Models that can handle categorical inputs ◮ Random forests: out of the box ◮ Extended (approximate) Gaussian processes – new kernel based on weighted Hamming distance 30

Extension I: Categorical Parameters Models that can handle categorical inputs ◮ Random forests: out of the box ◮ Extended (approximate) Gaussian processes – new kernel based on weighted Hamming distance Application domain ◮ Algorithms with categorical parameters ◮ Single instances 30

Extension I: Categorical Parameters Models that can handle categorical inputs ◮ Random forests: out of the box ◮ Extended (approximate) Gaussian processes – new kernel based on weighted Hamming distance Application domain ◮ Algorithms with categorical parameters ◮ Single instances Empirical evaluation ◮ ActiveConfigurator outperformed FocusedILS 30

Extension II: Multiple Instances Models incorporating multiple instances ◮ Can still learn probabilistic models of algorithm performance ◮ Model inputs: ◮ algorithm parameters ◮ instance features 31

Extension II: Multiple Instances Models incorporating multiple instances ◮ Can still learn probabilistic models of algorithm performance ◮ Model inputs: ◮ algorithm parameters ◮ instance features General algorithm configuration ◮ Algorithms with categorical parameters ◮ Multiple instances 31

Extension II: Multiple Instances Models incorporating multiple instances ◮ Can still learn probabilistic models of algorithm performance ◮ Model inputs: ◮ algorithm parameters ◮ instance features General algorithm configuration ◮ Algorithms with categorical parameters ◮ Multiple instances Empirical evaluation ◮ ActiveConfigurator never worse than FocusedILS ◮ Overall: model-based approaches very promising 31

Outline 1. Problem Definition & Intuition 2. Model-Free Search for Algorithm Configuration 3. Model-Based Search for Algorithm Configuration 4. Conclusions 32

Conclusions Algorithm configuration ◮ Is a high-dimensional optimization problem – Can be solved by automated approaches – Sometimes much better than by human experts 33

Conclusions Algorithm configuration ◮ Is a high-dimensional optimization problem – Can be solved by automated approaches – Sometimes much better than by human experts ◮ Can cut development time & improve results 33

Conclusions Algorithm configuration ◮ Is a high-dimensional optimization problem – Can be solved by automated approaches – Sometimes much better than by human experts ◮ Can cut development time & improve results Scaling to very complex problems allows us to ◮ Build very flexible algorithm frameworks ◮ Apply automated tool to instantiate framework � Generate custom algorithms for different problem types 33

Conclusions Algorithm configuration ◮ Is a high-dimensional optimization problem – Can be solved by automated approaches – Sometimes much better than by human experts ◮ Can cut development time & improve results Scaling to very complex problems allows us to ◮ Build very flexible algorithm frameworks ◮ Apply automated tool to instantiate framework � Generate custom algorithms for different problem types Blackbox approaches ◮ Very general ◮ Can be used to optimize your parameters 33

Main Contribution of this thesis Comprehensive study of the algorithm configuration problem 34

Main Contribution of this thesis Comprehensive study of the algorithm configuration problem ◮ Empirical analysis of configuration scenarios ◮ Two fundamentally different solution approaches ◮ Demonstrated practical relevance of algorithm configuration 34

Main Contribution of this thesis Comprehensive study of the algorithm configuration problem ◮ Empirical analysis of configuration scenarios ◮ Two fundamentally different solution approaches – Model-free Iterated Local Search approach ◮ Demonstrated practical relevance of algorithm configuration 34

Automating the Configuration of Algorithms for Solving Hard - PowerPoint PPT Presentation

Automating the Configuration of Algorithms for Solving Hard Computational Problems Ph.D. Thesis Defence Frank Hutter Supervisory committee: Prof. Holger Hoos (supervisor) Prof. Kevin Leyton-Brown (co-supervisor) Prof. Kevin Murphy

Automating the Automating the configuration of flow configuration of flow monitoring probes

Configuration management Configuration management Configuration management Configuration

Augeas a configuration API Raphal Pinson Configuration Management Sitewide configuration

Integrating Problem Solving 2020 Integrating Problem Solving 2020 Integrating Problem Solving

CNC PINpad USA, December 2014 Configuration Configuration Description POS Dollar General

EPiServer och Configuration Management EPiServer och Configuration Management Configuration

Automating batch fecundity measurements Automating batch fecundity measurements using digital

REDHAT KICKSTART REDHAT KICKSTART Automating Linux Installation Automating Linux Installation

Automating MySQL Deployments on Kubernetes Calin Don & Flavius Mecea Presslabs Automating

Automating Authority Work Automating authority work, or, Be your own authority control vendor

Automating Production of Cross Media Automating Production of Cross Media Content for

RANDOMIZING AND RANDOMIZING AND AUTOMATING ASSESSMENT AUTOMATING ASSESSMENT WITH R WITH R exams

SUSE Cloud 8 CLM Overview Input Model and Configuration Processor Configuration Processing Flow

Distributed Configuration Management: Mercurial CSCI 5828 Spring 2012 Mark Grebe Configuration

AHU-01 Configuration - 60% FD CLASHES WITH UPDATED MONORAIL BEAM MONORAIL BEAM (8m) TITLE: AHU

Software Development: Tools and Processes Lecture - 13: Software Configuration Management

Folding polyominoes with holes into a cube , 3

Applicative, Traversable, and Foldable Advanced functional programming - Lecture 4 Trevor L.

Unfolding Mathematics with Unit Origami WMC Annual Meeting May 2017 Joseph Georgeson retired

Objectives Mapping and Folding Defjne the foldr and map functions. Dr. Mattox Beckman Use

Origami and mathematics: why you are not just folding paper Stefania Lisai MACS PhD Seminar 5th

Simple folding of array-based VLSI structures Liudmila Cheremisinova The United Institute of

On Simulated Annealing in EDA A tribute to Prof. C. L. Liu at ISPD 2012 Martin D.F. Wong

Welcome to... Meal Plan #046 @stemettes | #stemillions SNACK CACKLE POP @stemettes |

Automating the Configuration of Algorithms for Solving Hard - PowerPoint PPT Presentation

Automating the Configuration of Algorithms for Solving Hard Computational Problems Ph.D. Thesis Defence Frank Hutter Supervisory committee: Prof. Holger Hoos (supervisor) Prof. Kevin Leyton-Brown (co-supervisor) Prof. Kevin Murphy

Automating the Automating the configuration of flow configuration of flow monitoring probes

Configuration management Configuration management Configuration management Configuration

Augeas a configuration API Raphal Pinson Configuration Management Sitewide configuration

Integrating Problem Solving 2020 Integrating Problem Solving 2020 Integrating Problem Solving

CNC PINpad USA, December 2014 Configuration Configuration Description POS Dollar General

EPiServer och Configuration Management EPiServer och Configuration Management Configuration

Automating batch fecundity measurements Automating batch fecundity measurements using digital

REDHAT KICKSTART REDHAT KICKSTART Automating Linux Installation Automating Linux Installation

Automating MySQL Deployments on Kubernetes Calin Don &amp; Flavius Mecea Presslabs Automating

Automating Authority Work Automating authority work, or, Be your own authority control vendor

Automating Production of Cross Media Automating Production of Cross Media Content for

RANDOMIZING AND RANDOMIZING AND AUTOMATING ASSESSMENT AUTOMATING ASSESSMENT WITH R WITH R exams

SUSE Cloud 8 CLM Overview Input Model and Configuration Processor Configuration Processing Flow

Distributed Configuration Management: Mercurial CSCI 5828 Spring 2012 Mark Grebe Configuration

AHU-01 Configuration - 60% FD CLASHES WITH UPDATED MONORAIL BEAM MONORAIL BEAM (8m) TITLE: AHU

Software Development: Tools and Processes Lecture - 13: Software Configuration Management

Folding polyominoes with holes into a cube , 3

Applicative, Traversable, and Foldable Advanced functional programming - Lecture 4 Trevor L.

Unfolding Mathematics with Unit Origami WMC Annual Meeting May 2017 Joseph Georgeson retired

Objectives Mapping and Folding Defjne the foldr and map functions. Dr. Mattox Beckman Use

Origami and mathematics: why you are not just folding paper Stefania Lisai MACS PhD Seminar 5th

Simple folding of array-based VLSI structures Liudmila Cheremisinova The United Institute of

On Simulated Annealing in EDA A tribute to Prof. C. L. Liu at ISPD 2012 Martin D.F. Wong

Welcome to... Meal Plan #046 @stemettes | #stemillions SNACK CACKLE POP @stemettes |

Automating MySQL Deployments on Kubernetes Calin Don & Flavius Mecea Presslabs Automating