Mathematical Models of Artificial Genetic Representations with - PowerPoint PPT Presentation

Mathematical Models of Artificial Genetic Representations with Neutrality c 2 and Nino Baˇ Carlos M. Fonseca 1 , Vida Vukaˇ c 3 sinovi´ si´ 1 CISUC, Department of Informatics Engineering, University of Coimbra, Portugal cmfonsec@dei.uc.pt 2 Computer Systems, Joˇ zef Stefan Institute, Ljubljana, Slovenia vida.vukasinovic@ijs.si 3 Faculty of Mathematics, Natural Sciences and Information Technologies, University of Primorska, Koper, Slovenia nino.basic@famnit.upr.si Dagstuhl Seminar 17191, 7-12 May 2017

Acknowledgements • COST Action CA15140 on Improving Applicability of Nature-Inspired Op- timisation by Joining Theory and Practice (ImAppNIO) for a Short-Term Scientific Mission grant COST is supported by the EU Framework Programme Horizon 2020 • National funds through Portuguese Foundation for Science and Technology (FCT) • European Regional Development Fund (FEDER) through COMPETE 2020 Operational Program for Competitiveness and Internationalisation (POCI)

Outline • Background ◦ Neutrality in natural evolution ◦ Neutrality in artificial evolution ◦ Uniformly neutral representations • Mathematical formulation • Concluding remarks

1. Background 1.1. Neutrality in natural evolution • Most mutations at the genotypic level are not expressed in the phenotype (Kimura, 1968) • Genotypes connected by neutral mutations form (large) neutral networks • Random genetic drift instead of natural selection • Accumulation of neutral mutations may lead to beneficial mutations later • Neutrality is believed to account for improved search space exploration • Massive redundancy and neutrality

• RNA fitness landscapes (Schuster et al. 1994) ◦ Genotype is a sequence of nucleotides (bases) ◦ Phenotype is a shape (secondary structure, represented as a graph) ◦ Shape space considerably smaller than sequence space (redundancy) ◦ Few common shapes, many rare ones (non-uniform redundancy) ◦ Many single-base (and even two-base) mutations are neutral • Such genotype-phenotype mappings are defined by the physical laws gov- erning the folding process (and may have themselves evolved)

1.2. Neutrality in artificial evolution • Genotype-phenotype mappings are referred to as representations • Good representations and operators are crucial to evolutionary algorithm performance • The influence of genotypic redundancy and neutrality on search performance is (still) not well understood ◦ Larger search spaces make the problem harder (?) ◦ Larger neighbourhoods induced by neutral networks (may) make the problem easier (???) • There have been attempts to identify representation properties that influence the performance of evolutionary algorithms (Rothlauf, 2006) • Several contradicting results in the literature (Galv´ an-L´ opez et al, 2011)

• Many artificial redundant representations have been proposed ◦ Emphasis on very high redundancy ◦ Not amenable to analysis, typically evaluated experimentally • Will focus on a family of representations based on error control codes (Fonseca and Correia, 2005) ◦ Emphasis on low redundancy (is high redundancy really justified?) ◦ Various degrees of uniform redundancy, neutrality, connectivity, locality, and synonymity can be obtained ◦ Have allowed the influence of the above properties on optimisation performance to be studied experimentally (Correia, 2013) • Not trying to model natural representations!

1.3. Uniformly neutral representations Split redundant genotypic space into 2 ℓ − k interspersed classes

Uniformly neutral representations Map genotypes in each class so as to form neutral networks

In the binary case, block error-control codes define suitable genotypic classes • one main class ( C 0 , the code itself) • 2 ℓ − k − 1 cosets ( C j ) • minimum distance is at least 2 Decoding • Determine genotype class (polynomial division) • Map genotype to main class (add a constant) • Decode to obtain phenotype (truncation) ⇒ Neutral networks can be explicitly designed by specifying the representation of the zero phenotype, z j , in each class C j

• Exhaustive enumeration of all representations based on a given main class has only been achieved for ℓ − k = 3 bits of redundancy • Higher redundancy leads to extremely large numbers of different representations • MDS depiction of some neutral network shapes ( ℓ = 7, k = 4) 5 5 8 6 1 6 4 4 2 3 5 4 3 2 7 8 2 8 3 6 1 6 8 1 1 7 2 4 7 3 5 7

2. Mathematical formulation 2.1. Binary representations • The genotypic space is a vector space G = Z ℓ 2 , where addition of two vec- tors, ⊕ , is defined as the componentwise XOR operation, and scalar multiplication is the multiplication of a vector by a constant from Z 2 . • The phenotypic space is P = Z k 2 , with the same operations. • A binary representation is a surjective mapping r : G → P . If ℓ > k , the representation is redundant.

2.2. Mutations and neutral networks • A mutation is a bilinear mapping m : G × G → G , where m ( g , e ) = g ⊕ e for each g , e ∈ G . The single point mutation of the i -th component of g is denoted by m i ( g ) = m ( g , e i ) , where e i is a vector of length ℓ with a 1 on the i -th component and zeros elsewhere. • A mutation m i is neutral if r ( g ) = r ( m i ( g )) . • M ⊆ G is a neutral network if for each g 1 , g 2 ∈ M there exists a sequence of genotypes h 1 = g 1 , h 2 ,..., h µ = g 2 , where h j ∈ M for all j = 1 ,..., µ , and neutral mutations m i j for j = 1 ,..., µ − 1 such that h j + 1 = m i j ( h j ) .

2.3. Representation properties • Uniform redundancy: r is uniformly redundant if | r − 1 ( p 1 ) | = | r − 1 ( p 2 ) | for all p 1 , p 2 ∈ P • Connectivity: c r = 1 � r ( N ( r − 1 ( p ))) � � | P | ∑ � p ∈ P where N ( r − 1 ( p )) = { g ∈ G \ r − 1 ( p ) |∃ h ∈ r − 1 ( p ) : d G ( g , h ) = 1 } • Synonymity: s r = 1 1 | P | ∑ ∑ d G ( g , h ) � | r − 1 ( p ) | � p ∈ P { g , h }⊆ r − 1 ( p ) 2 • Locality: 2 ∑ l r = d P ( r ( g 1 ) , r ( g 2 )) ℓ | G | { g 1 , g 2 }⊆ G : d G ( g 1 , g 2 )= 1

2.4. Uniformly neutral representations • Let ν be an inclusion of P into G. G g g ∼ h ν h P g ⊕ h r • Let ∼ be a relation on G defined as g ∼ h ⇔ g ⊕ h ∈ ν ( P ) . Note that ∼ is an equivalence relation. The corresponding equivalence classes are called cosets. • An equivalent definition of coset of ν ( P ) in G is g ⊕ ν ( P ) = { g ⊕ ν ( p ) : p ∈ P } , where g is a coset representative.

• Let Z = { z 0 , z 1 ,..., z τ − 1 } , where τ = | G / ν ( G ) | and z 0 ∈ ν ( P ) , denote the set of representatives of all cosets of ν ( P ) in G , i.e. Z is a transversal. • Definition 1 Let ν be an inclusion of phenotype space P in genotype space G and Z be a transversal of all cosets of ν ( P ) in G. A representation r is compatible with ν and Z if the following conditions hold: 1. Z forms a neutral network in G 2. r ( z 0 ) = 0 P 3. for each g ∈ ν ( P ) , r ( z 0 ⊕ g ) = r ( z i ⊕ g ) for every i = 0 ,..., τ − 1 .

• Definition 2 A representation r is said to be fully compatible with inclusion ν and transversal Z if r is compatible with ν and Z, and r | ν ( P ) = ν − 1 ( m ( · , z 0 )) . • Theorem 1 Let ν be a linear inclusion of P into G. Let r be a representation which is compatible with inclusion ν and transversal Z. Then, c r ( p 1 ) = c r ( p 2 ) s r ( p 1 ) = s r ( p 2 ) and for every p 1 , p 2 ∈ P. Moreover, if r is fully compatible with ν and Z, then l r ( p 1 ) = l r ( p 2 ) .

• Theorem 2 Let ν be a linear inclusion of P into G. Let r , r ′ be representations which are compatible with inclusion ν and transversals Z, Z ⊕ c, respectively. Then, c r ′ = c r s r ′ = s r . and Moreover, if r , r ′ are fully compatible with ν and Z, Z ⊕ c, respectively, then l r ′ = l r . • Theorem 3 Let ν be a linear inclusion of P into G. Let π be a permutation of the components of g ∈ G such that π ( ν ( P )) = ν ( P ) . Let r , r ′ be representations which are compatible with inclusion ν and transversals Z, π ( Z ) , respectively. Then, c r ′ = c r s r ′ = s r . and

3. Concluding remarks • Developed a mathematical framework for the study and characterisation of artificial genetic representations • Formalised a class of uniform neutral representations from the literature • Equivalence classes of representations related to: ◦ Translations ◦ Permutations (automorphisms of ν ( P ) ) • Restrict enumeration to representatives of such equivalence classes • Now targeting the enumeration of representations with 4 bits of redundancy • Will allow the effect of locality on search performance to be studied under fixed values of the other properties

Mathematical Models of Artificial Genetic Representations with - PowerPoint PPT Presentation

Mathematical Models of Artificial Genetic Representations with Neutrality c 2 and Nino Ba Carlos M. Fonseca 1 , Vida Vuka c 3 sinovi si 1 CISUC, Department of Informatics Engineering, University of Coimbra, Portugal cmfonsec@dei.uc.pt 2

1 2 Genetic Program Genetic Program Parameter 3 Genetic Program Genetic Program 4 Softcoding

Genetic.io Genetic Algorithms in all their shapes and forms ! Genetic.io Make something of your

Germ- -line Genetic Therapy line Genetic Therapy Germ Munson- -Davis Look Bravely at a Davis

Genetic Programming What is it? Genetic Programming Genetic programming (GP) is an

Mathematical Representations of Mathematical Representations of Preference and Utility

Mathematical Analysis of Genetic Algorithms Genetic Algorithms are not appropriate for

61A Lecture 16 Announcements String Representations String Representations 4 String

Introduction to Genetic Epidemiology CM van Duijn Genetic Epidemiology Unit Gene Discovery

Introduction to Genetic Epidemiology CM van Duijn Genetic Epidemiology Unit Gene Discovery

Genetic drift (two types) Genetic drift: changes in allele frequencies due to chance. Founder

All in the Family How Genetic Counselors Facilitate Familial Genetic Testing Amanda Openshaw, MS,

Genetic Algorithms Welcome to this introduction to Genetic Algorithms. The objective of this paper

GENETIC ALGORITHMS By Joy Reistad Overview What are genetic algorithms? History

CSCI 446: Artificial Intelligence Genetic Algorithms Genetic Algorithms Basic concept is

Louisiana Artificial Reef Program Update Artificial Reef Council | June 4, 2018 Louisiana

Artificial Intelligence Artificial Intelligence Artificial Intelligence Study and design of

Erasmus+ Mobility: Connecting Adult Education in Ireland to its Rich Tapestry of Values,

e-MOTICON e-MO MObility Transnational strategy for an Interoperable CO COmmunity and

sqrrl sqrrl Secure. Scale. Adapt Secure. Scale. Adapt. Adam

Chernoff approximation of diffusions and further applications Yana A. Butko Analysis and

Social-media Storytelling Linking Hao Wu Seamus Lawless Gareth Jones Francois Pitie The ADAPT

Unstructured Texts Jinhua Du ADAPT Centre, Dublin City University, Ireland The ADAPT Centre is

Pay-fo Pa for-Us Use in th the EGI Marketplace EG Sy Holsinger, EGI Foundati tion Se

P2P-NEXT EUROPEAN UNION FRAMEWORK 7 PROJECT WWW.P2P-NEXT.ORG Johnathan Ishmael