Adding value to optimisation by interrogating fitness models - PowerPoint PPT Presentation

Adding value to optimisation by interrogating fitness models Alexander Brownlee www.cs.stir.ac.uk/~sbr sbr@cs.stir.ac.uk

Outline • "Adding value" • Markov network fitness model • Single-generation examples (recap) • Multi-generation examples • Discussion • (RW Application and some more discussion in SAEOpt tomorrow) 2

Value-added Optimisation • A philosophy whereby we provide more than simply optimal solutions • Information gained during optimisation can highlight sensitivities and linkage • This can be useful to the decision maker: – Confidence in the optimality of results – Aids decision making – Insights into the problem • Help solve similar problems • Highlight problems / misconceptions in definition 3

Value-added Optimisation • This information can come from – the trajectory followed by the algorithm – models built during the run • If we are constructing a model as part of the optimisation process, anything we can learn from it comes "for free" • See also – M. Hauschild, M. Pelikan, K. Sastry, and C. Lima. Analyzing probabilistic models in hierarchical BOA. IEEE TEC 13(6):1199- 1217, December 2009 – R. Santana, C. Bielza, J. A. Lozano, and P. Larranaga. Mining probabilistic models learned by EDAs in the optimization of multi-objective problems. In Proc. GECCO 2009, pp 445-452 4

Markov network fitness model (MFM) • Originally developed as part of DEUM EDA • An undirected probabilistic graphical model – Representation of the joint probability distribution (factorises as a Gibbs dist.) – Node: variables – Edges: dependencies between variables • Gibbs distribution of MN is equated to mass distribution of fitness in population ( ) − U x ( ) f x e T ln( ( )) ( ) / ( ) = ≡ − = f x U x T p x ∑ ( ) − ( ) U y f y ∑ e T y y • Energy has negative log relationship to probability, so minimise U to maximise f 5

Markov network example • For a bit-string encoded problem f(x 0 …x 3 ), model can be represented by: α + α + α + α + α + α + α x x x x x x x x x x 0 0 1 1 2 2 3 3 01 0 1 02 0 2 03 0 3 ln( ( )) = − f x α + α + α + α + x x x x x x x x x x c 13 1 3 23 2 3 013 0 1 3 023 0 2 3 x 0 • Build a set of equations using values from population and solve to estimate the α x 1 • Variables are -1 and +1 instead of 0 and 1 x 2 • Can then sample to generate new solutions x 3 6

Mining the model (1) ln( ( )) ( ) / − = f x U x T • As we minimise energy, we maximise fitness. So to minimise energy: α i x i • If the value taken by x i is 1 (+1) in high-fitness solutions, then a i will be negative • If the value taken by x i is 0 (-1) in the high-fitness solutions, then a i will be positive • If no particular value is taken by x i optimal solutions, then a i will be near zero 7

Mining the model (2) ln( ( )) ( ) / − = f x U x T • As we minimise energy, we maximise fitness. So to minimise energy: α x x ij i j • If the values taken by x i and x j are equal (+1) in the optimal solutions, then a i will be negative • If the values taken by x i and x j are opposite (-1) in the optimal solutions, then a ij will be positive • Higher order interactions follow this pattern 8

Single stage experiments • Often the model closely fits the fitness function in the first generation (see DEUM d ) • Experiments: 1. generate 30 populations of solutions at random and evaluate 2. estimate MFM parameters for each population 3. calculate means of each α across the 30 models • This section mostly a recap of earlier results 9

Onemax • Fitness is the sum of x i set to 1 0 -0.001 -0.002 -0.003 Coefficient values -0.004 -0.005 -0.006 -0.007 -0.008 -0.009 -0.01 0 10 20 30 40 50 60 70 80 90 100 10 Univariate alpha numbers

BinVal • Fitness is the weighted sum of x i set to 1 (the bit string is treated as a binary number) 0.05 0 -0.05 -0.1 Coefficient values -0.15 -0.2 -0.25 -0.3 -0.35 -0.4 0 10 20 30 40 50 60 70 80 90 100 11 Univariate alpha numbers

Trap 5 • Bit string is broken into blocks of size u • The blocks are scored separately: fitness is sum of these scores • Deceptive for algorithms ignoring the blocks 6 5 4 Trap5(u) 3 2 1 0 0 1 2 3 4 5 6 Number of ones in block u 12

Trap 5 0.012 0.01 0.008 0.006 Coefficient values 0.004 0.002 0 -0.002 -0.004 -0.006 0 10 20 30 40 50 60 70 80 90 100 13 Univariate alpha numbers

Trap 5 0.008 0.006 0.004 0.002 Coefficient values 0 -0.002 -0.004 -0.006 -0.008 -0.01 -0.012 0 10 20 30 40 50 60 70 80 90 100 14 Bivariate alpha numbers

Trap 5 0 50 -0.001 40 -0.002 Coefficient values 30 Larvae Number -0.003 -0.004 20 -0.005 10 -0.006 -0.007 0 0 5 10 15 20 15 Quintavariate alpha numbers

Experiments • This works well for some problems, but for others there is not enough information in a randomly generated population • Need some convergence (c.f. WCCI 2008 paper on selection 1 ) • Here running a GA to cause convergence so it is independent of model 1 Brownlee, A. E. I., McCall, J. A. W., Zhang, Q. & Brown, D. (2008). Approaches to Selection and their Effect on Fitness Modelling in an Estimation of Distribution Algorithm, Proc. of the World Congress on Computational Intelligence 2008, Hong Kong, China, pp. 2621-2628. IEEE Press 16

Leading Ones • Fitness is the count of contiguous 1s starting with x 0 in the bit string • Univariate terms: generation 1, generation 30 17

Leading Ones • Bivariates: terms representing neighbours in the bit string chain 18

Hierarchical IF-and-only-IF • Recursively combine blocks to get fitness: fitness gained for equal left/right halves of blocks • Univariates: noise; Bivariates tend to -ve • Left is generation 1, right is generation 100 19

Discussion • Signs of global optima can appear very early in evolutionary process • Often these become stronger as evolution proceeds (what we'd expect) • Provides guidance to most sensitive variables and linkages 20

Adding value • Mining the model… – Provides some reasoning for why a particular solution is optimal – Highlights errors in the problem definition, such as poorly defined objectives – Allows decision maker to choose optimal solutions wrt abstract objectives, e.g. aesthetic considerations absent from model – Helps identify "hitch-hiker" values 21

Conclusions • When using an MBEA, we have explicit models of the fitness function • These can be mined to gain greater insights into the problem, (almost) for free so it doesn't hurt to at least consider: "adding value" to optimisation • How can we generalise? How might this extend to other model types? 22

Adding value to optimisation by interrogating fitness models - PowerPoint PPT Presentation

Adding value to optimisation by interrogating fitness models Alexander Brownlee www.cs.stir.ac.uk/~sbr sbr@cs.stir.ac.uk Outline "Adding value" Markov network fitness model Single-generation examples (recap)

W3: Individual Fitness Fitness Exercise Testing Pyschology Exercise Benchmark Sociology

2. Two typical geometries of fitness landscapes Fitness landscape analysis for understanding and

Medicines optimisation The road to excellence Workshop Overview of meds optimisation Your

Fitness + In-Memory Computing = Getting ahead of the game Craig Gresbrink Solutions Architect

1. Basis of fitness landscape Fitness landscape analysis for understanding and designing local

Mining Markov Network Surrogates for Value-Added Optimisation Alexander Brownlee

Wellness 3C: Curriculum On Facts about Fitness Facts about Fitness Cardiorespiratory

FOOD FITNESS MINDSET www.staceygreenside.org ABOUT ME Wife and mother Formerly worked

Operation Fitness Personalized Health and Fitness Presented by Bishop-Lyons Entertainment

Race Driver Fitness, nutrition and development The best way to physical fitness and preparation

Automated and Accurate Geometry Extraction and Shape Optimisation of 3D Topology Optimisation

Introductory Course on Non-smooth Optimisation Lecture 09 - Non-convex optimisation Jingwei Liang

Introduction to program optimisation Michel Schinz (based on Erik Stenmans slides) Advanced

Adding a Programming Language Adding a Language Francois Ouellet , Director of Development

More on adding components Adding a button to a panel: buttonPanel.add (clickButton);

Interrogating the Nigerian Public Perceptions of Celebrity Advocacy on Womens Rights POPOOLA,

RETHINKING THE INTERN EVALUATION TO BETTER PREDICT IMPACT: ONE INSTITUTIONS SELECTED

Development and Validation of a Code for the Oxygen Distribution of Zircaloy Cladding in

Market Performance and Planning Forum October 24, 2018 ISO PUBLIC ISO PUBLIC Objective: Enable

F Y 2 0 1 8 R e s u l t s March 28, 2019 Table of Contents Pages I. Key Highlights 4 II.

Chapter 4. Measures of Poverty Summary Assume that information is available on a welfare measure

The Cost of Constraints: Risk Management, Agency Theory and Asset Prices Ashwin Alankar Peter

medical education Professor Reg Dennick Professor of Medical Education School of Medicine

AGENDA REVIEW OF MATERIAL SAMPLE SIZE DETERMINATION HYPOTHESIS/RESEARCH QUESTION