Making Solomonoff Induction Effective You Can Learn What You Can - PowerPoint PPT Presentation

Making Solomonoff Induction Effective You Can Learn What You Can Bound J¨ org Zimmermann and Armin B. Cremers Institute of Computer Science University of Bonn, Germany 1 J¨ org Zimmermann and Armin B. Cremers: Making Solomonoff Induction Effective

The General Prediction Problem Given a finite sequence of bits, e.g.: 0010010000111111011010101000100010000101 Question: What is the next bit? 2 J¨ org Zimmermann and Armin B. Cremers: Making Solomonoff Induction Effective

Asynchronous Learning Framework (ALF) A learning system observing and predicting an environment: observations o 1 o 2 o 3 o 4 ... Learning Environment work System work tape tape ... ... q p p 1 p 2 p 3 p 4 ... predictions 3 J¨ org Zimmermann and Armin B. Cremers: Making Solomonoff Induction Effective

Solomonoff Induction • Bayesian learning in program space. • Prior ∼ 2 −| p | , | p | = length of program p in bits. • But posterior distribution on program space is not computable! (the programs stopping to produce output cause trouble). 4 J¨ org Zimmermann and Armin B. Cremers: Making Solomonoff Induction Effective

Key Points of our Approach • Learning driven by a combined search in program and proof space. • Reduction of learnability to provability and set existence axioms. Axiom systems of reverse mathematics and large cardinal axioms can be used to show that proof-theoretic strength translates into learning strength. • Introduction of a new learning framework, the Synchronous Learning Framework (SLF) , which couples the time scales of the learning system and the environment. 5 J¨ org Zimmermann and Armin B. Cremers: Making Solomonoff Induction Effective

Probabilistic Learning Systems Λ : { 0 , 1 } ∗ × { 0 , 1 } → [0 , 1] Q with Λ( x, 0) + Λ( x, 1) = 1 for all x ∈ { 0 , 1 } ∗ . Λ is an effective probabilistic learning system if Λ is a total recursive function. 6 J¨ org Zimmermann and Armin B. Cremers: Making Solomonoff Induction Effective

Learnability Learning as learning in the limit: Eventually the learning system will become near certain about the true continuation of the observed bit sequence. Definition: An infinite bit sequence s is learnable in the limit by the probabilistic learning system Λ , if for all ǫ > 0 there is an n 0 so that for all n ≥ n 0 and all k ≥ 1 : Λ ( k ) ( s 1: n , s n +1: n + k ) > 1 − ǫ. Λ ( k ) : extending prediction horizon to k bits by feeding Λ with its own predictions. 7 J¨ org Zimmermann and Armin B. Cremers: Making Solomonoff Induction Effective

Σ -driven Learning Systems • Turning an Axiom System Σ into a Learning System Λ : Σ − → Λ(Σ) • A Σ -driven learning system is a learning system using the background theory Σ in order to derive totality proofs for recursive functions. • These provably recursive functions are used to build a guard function, which schedules the learning process and guarantees its effectiveness. 8 J¨ org Zimmermann and Armin B. Cremers: Making Solomonoff Induction Effective

Generator Time Function The generator time function of a program p is defined as: G p : N → N ∪ {∞} G p ( n ) = #transitions executed by p to generate the first n bits. 9 J¨ org Zimmermann and Armin B. Cremers: Making Solomonoff Induction Effective

Observation Equivalence s p = the bit sequence generated by program p . Then the observation class [ s ] of a bit sequence s is defined as: p ∈ [ s ] iff s = s p . 10 J¨ org Zimmermann and Armin B. Cremers: Making Solomonoff Induction Effective

Generator-Predictor Theorem The infinite bit sequence s is learnable by Λ(Σ) , if: ∃ p ∈ [ s ] , f recursive function : Σ ⊢ φ tot ( f ) and f ≥ d G p . φ tot ( f ) = f is a total recursive function. f ≥ d g = f dominates g (i.e., ∃ n 0 ∀ n ≥ n 0 : f ( n ) ≥ g ( n ) ). 11 J¨ org Zimmermann and Armin B. Cremers: Making Solomonoff Induction Effective

The hard case: infinite number of switches G p #transitions guard function suspend reactivate #observed bits 12 J¨ org Zimmermann and Armin B. Cremers: Making Solomonoff Induction Effective

Σ -driven probabilistic learning system Idea: retroactive change of prior: ∼ 2 − ( | p | + switch ( p,n )) (Solomonoff prior ∼ 2 −| p | ) = ⇒ Dynamic Bayesian Inference, i.e., construction of model space and prior probabilities is interleaved with the inference process. 13 J¨ org Zimmermann and Armin B. Cremers: Making Solomonoff Induction Effective

Conclusions 1 • The generator-predictor theorem establishes a natural perspective on the effective core of Solomonoff induction. • This shifts the questions related to learnability to questions related to provability, and therefore into the realm of the foundations of mathematics. 14 J¨ org Zimmermann and Armin B. Cremers: Making Solomonoff Induction Effective

Synchronous Learning Framework (SLF) Observation: in real world learning situations, the generator and the learner are not suspended while the other one is busy. G p ( n ) s is synchronous : ⇐ ⇒ lim sup < ∞ for at least one p ∈ [ s ] . n n →∞ = ⇒ the time scales of the learning system and the environment are coupled. 15 J¨ org Zimmermann and Armin B. Cremers: Making Solomonoff Induction Effective

Clockification arbitrary computable sequence 0 0 1 0 0 ... 10 0 1 11 clock signal 00 10 00 00 10 11 00 10 10 ... synchronous sequence Internal Clock 16 J¨ org Zimmermann and Armin B. Cremers: Making Solomonoff Induction Effective

Synchronous Learning Framework • Clockification transforms every computable bit sequence into a synchronous one. • All synchronous bit sequences are learnable by Λ(Σ) , if Σ ⊢ “ n 2 is a total recursive function”. • Thus in the SLF all effectively generated bit sequences can be effectively learned. 17 J¨ org Zimmermann and Armin B. Cremers: Making Solomonoff Induction Effective

Final Conclusion If the learning system is enhanced by an internal clock: Effective universal induction is possible! Hence future research can focus on efficient universal induction. 18 J¨ org Zimmermann and Armin B. Cremers: Making Solomonoff Induction Effective

Making Solomonoff Induction Effective You Can Learn What You Can - PowerPoint PPT Presentation

Making Solomonoff Induction Effective You Can Learn What You Can Bound J org Zimmermann and Armin B. Cremers Institute of Computer Science University of Bonn, Germany 1 J org Zimmermann and Armin B. Cremers: Making Solomonoff Induction

Solomonoff Induction Violates Nicods Criterion Jan Leike and Marcus Hutter

Induction Stepwise induction (for T PA , T cons ) Complete induction (for T PA , T cons )

Induction and recursion Chapter 5 Chapter Summary Mathematical Induction Strong Induction

Mathematical Induction Lecture 10-11 Menu Mathematical Induction Strong Induction

MA THEMA TICAL INDUCTION Induction and Deduction Mathematical Induction (its

Beyond Inductive Definitions Induction-Recursion, Induction-Induction, Coalgebras Anton

Lecture Outline Strengthening Induction Hypothesis. Lecture Outline Strengthening Induction

Strong induction (3) 23/38 Let P be a unary predicate on N Strong induction: Induction . . .

Induction and Its Applications Example for Regular Induction: Correctness of a Decimal-

Prediction and Solomonoff Pter Gcs Boston University Quantum Foundations worshop, August

Partial Operator Induction with Beta Distribution Nil Geisweiller AGI-18 Prague 1 Problem:

Induction and Recursion CMPS/MATH 2170: Discrete Mathematics Outline Mathematical induction

Natural Deduction and Rule Induction Dr. Liam OConnor University of Edinburgh LFCS UNSW, Term

Foundations of Computer Science Lecture 6 Strong Induction Strengthening the Induction

Mathematical Induction 2. Assume the statement is true for any particular value of n and show that

Foundations of Computer Science Lecture 6 Strong Induction Strengthening the Induction

Induction in Isabelle: Lecture 14 1 Notation In lecture 14 we introduced some recursive

Theory of Computation Course note based on Computability, Complexity, and Languages: Fundamentals

total order broadcast VL Networked Embedded Systems Markus Kammerstetter (e0226196) overview

Preference Aggregation with Restricted Ballot Languages: Sincerity and Strategy-Proofness Ulle

Concrete Semantics with Isabelle/HOL Tobias Nipkow Fakult at f ur Informatik Technische

4 Recurrences As noted in Section 2.3.2, when an algorithm contains a recursive call to itself,

UNITARITY SATURATION IN P-P SCATTERING Uri Maor Raymond and Beverly Sackler Faculty of Exact

A Dynamic Data Driven Wildland Fire Model The DDDAS Wildfire Team Presented by Jonathan D.