First Experiments with Watchlist Guidance on Mizar Zarathustra - PowerPoint PPT Presentation

Introduction Theory Experimental Setup Results Examples Further Work First Experiments with Watchlist Guidance on Mizar Zarathustra Goertzel Jan Jakubův Josef Urban Stephan Schulz Czech Technical University in Prague DHBW Stuttgart AITP’18, 29th March 2018 Zarathustra Goertzel, Jan Jakubův, Josef Urban, Stephan Schulz First Experiments with Watchlist Guidance on Mizar 1/38

Introduction Theory Experimental Setup Results Examples Further Work Watchlist Guidance The watchlist contains clauses that guide E prover’s search, Called hints in Prover9 by Bob Veroff Can be thought of as containng: Mathematical Tricks (not already in E) Proof sketches/schema Conjectured lemmas/sub-goals From other E (ATP) proofs. Thus learning ! Zarathustra Goertzel, Jan Jakubův, Josef Urban, Stephan Schulz First Experiments with Watchlist Guidance on Mizar 2/38

Introduction Theory Experimental Setup Results Examples Further Work Watchlist Guidance The watchlist contains clauses that guide E prover’s search, Called hints in Prover9 by Bob Veroff Can be thought of as containng: Mathematical Tricks (not already in E) Proof sketches/schema Conjectured lemmas/sub-goals From other E (ATP) proofs. Thus learning. Current result: 7.2% more proofs in Mizar Mathematical Library! Zarathustra Goertzel, Jan Jakubův, Josef Urban, Stephan Schulz First Experiments with Watchlist Guidance on Mizar 3/38

Introduction Theory Experimental Setup Results Examples Further Work Watchlist Guidance The watchlist contains clauses that guide E prover’s search, Called hints in Prover9 by Bob Veroff Can be thought of as containng: Mathematical Tricks (not already in E) Proof sketches/schema Conjectured lemmas/sub-goals From other E (ATP) proofs. Thus learning. Current result: 7.2% more proofs! And some longer ones proved only by watchlist guidance. Zarathustra Goertzel, Jan Jakubův, Josef Urban, Stephan Schulz First Experiments with Watchlist Guidance on Mizar 4/38

Introduction Theory Experimental Setup Results Examples Further Work First Experiment For AITP18 we experimented with how to include the watchlist in E strategies (via PreferWatchlist). how to build the watchlist from prior proofs Zarathustra Goertzel, Jan Jakubův, Josef Urban, Stephan Schulz First Experiments with Watchlist Guidance on Mizar 5/38

Introduction Theory Experimental Setup Results Examples Further Work Further Experiments After AITP18 submission we continued experimenting with matching hints modulo skolem names (e.g., eskfoobar(A,B) ⊑ esk(A,B)) Dynamic weighting of hints by proof completion K-NN recommendation of hints from prior proofs Zarathustra Goertzel, Jan Jakubův, Josef Urban, Stephan Schulz First Experiments with Watchlist Guidance on Mizar 6/38

Introduction Theory Experimental Setup Results Examples Further Work Where’s the AI in this? Watchlist provides logical interface between AI/human and ATP: High and low level systems Watchlist with dynamic weighting can provide a vectorial proofstate characterization for AI methods (e.g. ENIGMA’s SVM) Learning from prior proofs Zarathustra Goertzel, Jan Jakubův, Josef Urban, Stephan Schulz First Experiments with Watchlist Guidance on Mizar 7/38

Introduction Theory Experimental Setup Results Examples Further Work E Basics E’s a saturation-based refutational theorem prover (using superposition calculus) To prove P , E does inference from ¬ P until contradiction or all inferences have been done. Repeats given-clause loop : while (no proof found) { select a given clauses apply inference rules to selected clauses process inferred clauses } Zarathustra Goertzel, Jan Jakubův, Josef Urban, Stephan Schulz First Experiments with Watchlist Guidance on Mizar 8/38

Introduction Theory Experimental Setup Results Examples Further Work Motivation while (no proof found) { select a given clauses apply inference rules to selected clauses process inferred clauses } select a given clauses To prove hard conjectures, clauses must be selected intelligently and specifically for the given conjecture. Zarathustra Goertzel, Jan Jakubův, Josef Urban, Stephan Schulz First Experiments with Watchlist Guidance on Mizar 9/38

Introduction Theory Experimental Setup Results Examples Further Work Basic Watchlist Mechanism Check if each clause subsumes a hint (That is, clause ⊑ hint, and hint led to the empty set before!) If yes, boost clause priority (and thus score), making selection more likely. And remove hint from watchlist (optional). Used via the PreferWatchlist priority function Zarathustra Goertzel, Jan Jakubův, Josef Urban, Stephan Schulz First Experiments with Watchlist Guidance on Mizar 10/38

Introduction Theory Experimental Setup Results Examples Further Work Priority? Weight? Score? E chooses given clauses by weighted round-robbin selection, with priority queues as specified in a strategy containing Clause Evaluation Functions . An example E strategy: -tKBO -H(2*Clauseweight(PreferWatchlist,20,9999,4) ,4*FIFOWeight(PreferProcessed)) Where Priority functions: PreferWatchlist, PreferProcessed Weight functions: Clauseweight, FIFOWeight Zarathustra Goertzel, Jan Jakubův, Josef Urban, Stephan Schulz First Experiments with Watchlist Guidance on Mizar 11/38

Introduction Theory Experimental Setup Results Examples Further Work baseline 02: an actual evolved strategy file --definitional-cnf=24 --split-aggressive --simul-par amod --forward-context-sr --destructive-er-aggressive --destructive-er --prefer-initial-clauses -tKBO -winv freqrank -c1 -Ginvfreq -F1 --delete-bad-limit=150000000 -WSelectMaxLComplexAvoidPosPred -H(1*ConjectureTermPrefixWeight(DeferSOS,1,3,0.1,5,0,0.1,1,4) ,1*ConjectureTermPrefixWeight(DeferSOS,1,3,0.5,100,0,0.2,0.2,4) ,1*Refinedweight(PreferWatchlist,4,300,4,4,0.7) ,1*RelevanceLevelWeight2(PreferProcessed,0,1,2,1,1,1,200,200,2.5,9999.9,9999.9) ,1*StaggeredWeight(DeferSOS,1),1*SymbolTypeweight(DeferSOS,18,7,-2,5,9999.9,2,1.5) ,2*Clauseweight(PreferWatchlist,20,9999,4) ,2*ConjectureSymbolWeight(DeferSOS,9999,20,50,-1,50,3,3,0.5) ,2*StaggeredWeight(DeferSOS,2)) Zarathustra Goertzel, Jan Jakubův, Josef Urban, Stephan Schulz First Experiments with Watchlist Guidance on Mizar 12/38

Introduction Theory Experimental Setup Results Examples Further Work Dynamic Watchlist Multiple watchlists: one for each loaded proof. Keep track of which prior proofs (files) hints come from. Count what % of each proof has been covered (subsumed) Assign more priority via PreferWatchlistRelevant for higher completion %. –> closer to contradiction –> closer to current proof Can represent proof state. Zarathustra Goertzel, Jan Jakubův, Josef Urban, Stephan Schulz First Experiments with Watchlist Guidance on Mizar 13/38

Introduction Theory Experimental Setup Results Examples Further Work Dynamic Watchlist with Decay Watchlist subsumption is sparse, so Inherit watchlist priority from parents with multiplicative decay. Zarathustra Goertzel, Jan Jakubův, Josef Urban, Stephan Schulz First Experiments with Watchlist Guidance on Mizar 14/38

Introduction Theory Experimental Setup Results Examples Further Work Watchlist Relevance Calculation For a clause C matching at least one watchlist W i : � progress ( W ) � relevance 0 ( C ) = max | W | W ∈{ W i : C ⊑ W i } Where progress ( W ) is how many clauses in watchlist W have been subsumed. Zarathustra Goertzel, Jan Jakubův, Josef Urban, Stephan Schulz First Experiments with Watchlist Guidance on Mizar 15/38

Introduction Theory Experimental Setup Results Examples Further Work Watchlist Relevance Calculation For a clause C matching at least one watchlist W i : � progress ( W ) � relevance 0 ( C ) = max | W | W ∈{ W i : C ⊑ W i } With decay, for δ < 1, � � relevance 1 ( C ) = relevance 0 ( C )+ δ ∗ relevance 1 ( D ) avg D ∈ parents ( C ) Aditionally, reset to 0 if relevance 1 ( C ) < α and relevance 1 ( C ) < β for threshold parameters α, β > 0. length ( C ) Zarathustra Goertzel, Jan Jakubův, Josef Urban, Stephan Schulz First Experiments with Watchlist Guidance on Mizar 16/38

Introduction Theory Experimental Setup Results Examples Further Work Baseline Strategies We use 57897 Mizar40 conjectures with premises pre-selected. We previously evolved 32 strategies with a coverage of 24702 proofs (in 5s each). In practice, we use a top 5 greedy cover of strategies 2, 8, 9, 26 and 28. In 5 s (in parallel) they together solve 21122 problems. Zarathustra Goertzel, Jan Jakubův, Josef Urban, Stephan Schulz First Experiments with Watchlist Guidance on Mizar 17/38

Introduction Theory Experimental Setup Results Examples Further Work Putting PreferWatchlist into Strategies Some options: Stephan Schulz’s -xUseWatchlistEvo (EVO) Add EVO’s CEFs to baseline strategy (uwl_evo) Namely: ConjectureRelativeSymbolWeight (Prefer,...) Replace all priority functions with PreferWatchlist (pref) Always PreferWatchlist, default to other priority function if not on watchilst (uwl). Matching Skolems by name and arity only (ska). Zarathustra Goertzel, Jan Jakubův, Josef Urban, Stephan Schulz First Experiments with Watchlist Guidance on Mizar 18/38

First Experiments with Watchlist Guidance on Mizar Zarathustra - PowerPoint PPT Presentation

Introduction Theory Experimental Setup Results Examples Further Work First Experiments with Watchlist Guidance on Mizar Zarathustra Goertzel Jan Jakubv Josef Urban Stephan Schulz Czech Technical University in Prague DHBW Stuttgart

Towards Standardized Mizar Environments Adam Naumowicz Institute of Informatics University of

MIZAR in MathWiki Adam Naumowicz adamn@mizar.org Institute of Computer Science University of

Extracting Higher-Order Goals from the Introduction Three Constructs Mizar Mathematical Library

Introduction to the MIZAR system Adam Naumowicz adamn@mizar.org Institute of Computer Science

Moving Money Inside the global watchlist for banking across borders Tom Wills (UK) Times of

John Deere Guidance Systems Guidance you can grow with | 2 Guidance you can grow with: John

1 MISSION YOUTH GUIDANCE GALA 2016 YOUTH GUIDANCE GALA 2016 Mission : Youth Guidance creates and

Experiments on deflection of charged Experiments on deflection of charged Experiments on

SAN DIEGO COUNTY K-12 SCHOOLS TELEBRIEFING August 4, 2020 8/4/2020 NEW GUIDANCE'S! YOUTH

Integrity of MML - Mizar Mathematical Library Piotr Rudnicki Andrzej Trybulec University of

Mizar demo http://grid01.ciirc.cvut.cz/~mptp/out4.ogv 2 / 36 Using Learning to Guide Theorem

Traffic Management as a Service Ing. Giulia Dovinola, Senior Researcher, SWARCO Mizar 20-11-18 1

Dependencies in Formal Mathematics: Applications and extraction for Coq and Mizar Jesse Alama 1 ,

Isabelle Import for Mizar Cezary Kaliszyk Karol P ak CICM18, Hagenberg Proof

Progress in the formalization of Matiyasevichs theorem in the Mizar system Karol Pk pakkarol

Aggregate Advisory Board August 12, 2015 Technical Guidance New Guidance on Development of

COL202: Discrete Mathematical Structures Ragesh Jaiswal, CSE, IIT Delhi Ragesh Jaiswal, CSE, IIT

Intro to Mathematical Reasoning via Discrete Mathematics CMSC-37115 Instructor: Laszlo Babai

For Wednesday No reading Nothing due Prepare for exam Wednesday night Note: we do

Cubical sets as a classifying topos Bas Spitters Carnegie Mellon University, Pittsburgh Aarhus

Set Operations B . It is denoted A B . A B = { x | x A x B } A B = { x | x A x B } 1 2 3 4 5 6 7 8

Negation Normal Form a formula is in negation normal form (NNF) if it only contains ,

Discrete Mathematics with Applications Chapter 2: The Logic of Compound Statements (Part 2)

Close Encounters with Non-Normal Worlds Franz Berto F.Berto@uva.nl f.berto@abdn.ac.uk 1 0.