Ensemble Quasi-Newton HMC Xiao-Yong Jin and James Osborn Argonne - PowerPoint PPT Presentation

Ensemble Quasi-Newton HMC Xiao-Yong Jin and James Osborn Argonne National Laboratory July 23, 2018 The 36th International Symposium on Lattice Field Theory East Lansing, MI 1

Reduce critical slowing down • Part of US DOE-funded   Exascale Computing Project (ECP) • Support research in lattice QCD   to prepare for exascale • Reducing critical slowing down,   lead by Norman Christ,   is part of the USQCD's e ff ort in ECP • See Norman's slides   for a list of people actively involved 2

Outline • Generate ensemble assisted Markov chains • Apply Quasi-Newton HMC • Test on 2D U(1) pure gauge theory   (work in progress) 3

Generate multiple Markov chains ⋯ 0 1 2 3 ⋯ 0 1 2 3 0 ′ 1 ′ 2 ′ 3 ′ • Can we exchange information between chains? • Use info from other chains • Extra info from itself (not explored in this talk) • Any advantage? 4

Generate the next state ⋯ 0 1 2 3 of each Markov chain ℱ ({1,2,3}) with information from other chains: ⋯ 0 1 2 3 0 ′ ℱ (a set of configs) ℱ ({2,3,0 ′ }) ⋯ 0 1 2 3 0 ′ 1 ′ Detailed balance: ℱ ({3,0 ′ ,1 ′ }) evolve backward from (3 ′ ,2 ′ ,1 ′ ,0 ′ ) ⋯ 0 1 2 3 0 ′ 1 ′ 2 ′ ℱ ({0 ′ ,1 ′ ,2 ′ }) ⋯ 0 1 2 3 0 ′ 1 ′ 2 ′ 3 ′ Reverse ⋯ 0 1 2 3 3 ′ 2 ′ 1 ′ 0 ′ � 5

Ensemble assisted Markov chains: in parallel ⋯ 0, 1 2, 3 ℱ ({2,3}) ⋯ 0, 1 2, 3 0 ′ , 1 ′ ℱ ({0 ′ ,1 ′ }) ⋯ 0, 1 2, 3 0 ′ , 1 ′ 2 ′ , 3 ′ Reverse ⋯ 0, 1 2, 3 2 ′ , 3 ′ 0 ′ , 1 ′ • Embedding Markov chains in Markov chains 6

Ensemble assisted Markov chains: multi-state ⋯ 0, 0 ′ , 1, 1 ′ 2, 2 ′ , 3, 3 ′ ℱ ({2, 2 ′ , 3, 3 ′ }) ⋯ 0, 0 ′ , 1, 1 ′ 2, 2 ′ , 3, 3 ′ 0 ′′ , 0 ′′′ , 1 ′′ , 1 ′′′ ℱ ({0 ′′ , 0 ′′′ , 1 ′′ , 1 ′′′ }) ⋯ 0, 0 ′ , 1, 1 ′ 2, 2 ′ , 3, 3 ′ 0 ′′ , 0 ′′′ , 1 ′′ , 1 ′′′ 2 ′′ , 2 ′′′ , 3 ′′ , 3 ′′′ Reverse ⋯ 0, 0 ′ , 1, 1 ′ 2, 2 ′ , 3, 3 ′ 2 ′′ , 2 ′′′ , 3 ′′ , 3 ′′′ 0 ′′ , 0 ′′′ , 1 ′′ , 1 ′′′ • Embedding Markov chains in Markov chains 7

What kind of information from other chains? • How do we generate the next state? • Modify MD evolution   “Quasi-Newton MCMC” — Zhang & Sutton (2011)   “Ensemble precondition” — Matthews et al (2016)   “Quasi-Newton Langevin” — Simsekli et al (2016)   “Magnetic HMC” — Tripuraneni et al (2016)   “Wormhole” — Lan et al (2013) • Modify Metropolis-Hastings   “Multi-try” — Liu, Liang, and Wong (2000) • Other techniques? Machine learning!!! 8

Quasi-Newton method for HMC Hamiltonian • BFGS approximation of the Hessian:   G ′ � s = y Update an old approximation to a new one G ′ � = G + yy † y † s − Gss † G s = ln U ′ � U † step yield s † Gs y = ∇ S ( U ′ � ) − ∇ S ( U ) • Approximate Hessian from con fi gs of other MC   Repeatedly apply the update according to N stream • Use the approximate Hessian for the mass matrix H = S ( U ) + 1 2 p † G − 1 p • Note: Fourier acceleration ≃ Local free fi eld Hessian 9

Quasi-Newton method • Avoids the slow down of the steepest decent in narrow valleys • Caveat in the current study: • The approximated Hessian is global • We do not use the current location 10

Bene fi ts of rank-2 update (BFGS style) • Factorizable matrix (Brodlie et al 1973) • Initializing random momenta G ′ � = G + ww † − zz † G ′ � = (1 − uv † ) G (1 − vu † ) → • Exactly invertible • MD evolution • Computing the kinetic energy G ′ � − 1 = ( 1 − vu † uv † v † u − 1 ) G − 1 ( 1 − v † u − 1 ) 11

Gauge fi xing of 2D U(1) lattice • Removes exact zero modes from the real Hessian • Frozen degrees of freedom take the same values • We choose maximal tree gauge fi xing • Fix two more non-gauge degree of freedom 12

Regulate the approximated Hessian matrix • Remove low modes in the approximate global Hessian • Add one more term to keep the rank-2 update G ′ � = G + yy † y † s − ( 1 − λ s † s Gss † G s † Gs ) s † Gs • Works in practice, but not a strict bound • Caveat: • Mildly violates G ′ � s = y • Still no upper bound 13

Test on 2D U(1) theory (work in progress) • Fixed , lattice size 32 × 32 β = 5.8 • Serial version of the ensemble Markov chain • Second order Omelyan integrator (did not tune λ ) • Look at the autocorrelation of the topological ⟨ Q 2 / V ⟩ susceptibility, Q = 1 2 π ∑ • Topological charge,   Arg □ x Arg : ℂ ↦ ( − π , π ) x • Topological charge is exact integer with periodic boundary conditions 14

� �� Acceptance tuning � �� 8 streams QNHMC 𝜇 =0.01 HMC 8 streams 16 streams QNHMC 𝜇 =0.1 QNHMC 𝜇 =0.01 15

� �� Autocorrelation of topological susceptibility � �� Trajectory length has no e ff ect on HMC <2 × for G fi x HMC (update half lattice) 16

� � �� Autocorrelation of topological susceptibility �� Cost grows if allow lower eigenmodes � �� We need more tuning 17

Summary & Outlook • We devise an algorithm creating multiple Markov chains in parallel   Allow exchange of information while generating the Markov chains • We modify HMC to use information from neighboring Markov chains   BFGS approximated Hessian as the mass matrix of the MD Hamiltonian   Use a custom regulator for the approximated Hessian for stability • We still need more tuning and testing (parameters / observables) • Ways to improve the algorithm • Exploit the ensemble of Markov chains (multi-scale?) • Other method for constructing the mass matrix • Use other information / observables to augment MD / Metropolis • Machine learning! 18

Ensemble Quasi-Newton HMC Xiao-Yong Jin and James Osborn Argonne - PowerPoint PPT Presentation

Ensemble Quasi-Newton HMC Xiao-Yong Jin and James Osborn Argonne National Laboratory July 23, 2018 The 36th International Symposium on Lattice Field Theory East Lansing, MI 1 Reduce critical slowing down Part of US DOE-funded

Quasi-Newton methods for minimization Lectures for PHD course on Non-linear equations and

Quasi-Newton methods for minimization Lectures for PHD course on Numerical optimization Enrico

NEWTON SEPAC End of Year Report to Newton School Committee June 10, 2019 Newton SEPAC Co-Chairs

Boosting (ensemble) Module 4 - Ensemble classifiers - Objectives module 4: boosting (ensemble

Designs Chapter 11 Quasi-Experimentation Quasi-experiments resemble experiments, but lack

MATH 4211/6211 Optimization Quasi-Newton Method Xiaojing Ye Department of Mathematics &

Convex Optimization ( EE227A: UC Berkeley ) Lecture 25 (Newton, quasi-Newton) 23 Apr, 2013

Worldwide Newton Conference Paris, September 2004 eBook composition on the Newton MessagePad 2100

Images of Isaac Newton 1 Portrait of Isaac Newton, Godfrey Kneller, 1689 This image is in the

De mystifying me dia and building good r e lationships Pre pare d by HMC Co mmunic atio ns fo r

CLOVER HMC AND STAGGERED MULTIGRID ON SUMMIT AND VOLTA Kate Clark, July 25th 2018 OUTLINE with

Adaptive HMC via the Infinite Exponential Family Arthur Gretton Gatsby Unit, CSML, University

Variable selection bias Bias in Ensemble Bias in Ensemble Methods Methods Variable selection

Quasi-Resonant Converters Introduction 20.1 The zero-current-switching quasi-resonant switch

Well quasi-ordering Aronszajn lines. Carlos Martinez-Ranero Centro de Ciencias Matematicas March

Contents 1. General Problem 2. Quasi-primal algebras Logics associated with a quasi-primal

Robust, Expressive, and Quantitative Linear Temporal Logics: Pick any Two for Free Joint work

Spack 201 Intermediate Spack Tutorial CORAL2 COE Spackathon Los Alamos National Laboratory

RPS Cost Containment Options State-Federal RPS Collaborative Webinar Hosted by Clean Energy

Welfare by Participating In Concentrated Package Delivery Markets By Michael D. Bradley, Jeff

Trident Toward a Unified SDN Programming Framework with Automatic Updates Kai Gao 1 Taishi Nojima

Nuclear Browns Ferry Nuclear Plant May 29, 2013 Preston Swafford, Chief Nuclear Officer

PerfSONAR in SIR Service Integration through an Identity Hub Diego R. Lopez, RedIRIS PerfSONAR

Sampling Techniques for Probabilistic and Deterministic Graphical models ICS 276, Spring 2018

Sambuz

Useful Links

Newsletter

Mail Us

Ensemble Quasi-Newton HMC Xiao-Yong Jin and James Osborn Argonne - PowerPoint PPT Presentation

Ensemble Quasi-Newton HMC Xiao-Yong Jin and James Osborn Argonne National Laboratory July 23, 2018 The 36th International Symposium on Lattice Field Theory East Lansing, MI 1 Reduce critical slowing down Part of US DOE-funded

Quasi-Newton methods for minimization Lectures for PHD course on Non-linear equations and

Quasi-Newton methods for minimization Lectures for PHD course on Numerical optimization Enrico

NEWTON SEPAC End of Year Report to Newton School Committee June 10, 2019 Newton SEPAC Co-Chairs

Boosting (ensemble) Module 4 - Ensemble classifiers - Objectives module 4: boosting (ensemble

Designs Chapter 11 Quasi-Experimentation Quasi-experiments resemble experiments, but lack

MATH 4211/6211 Optimization Quasi-Newton Method Xiaojing Ye Department of Mathematics &amp;

Convex Optimization ( EE227A: UC Berkeley ) Lecture 25 (Newton, quasi-Newton) 23 Apr, 2013

Worldwide Newton Conference Paris, September 2004 eBook composition on the Newton MessagePad 2100

Images of Isaac Newton 1 Portrait of Isaac Newton, Godfrey Kneller, 1689 This image is in the

De mystifying me dia and building good r e lationships Pre pare d by HMC Co mmunic atio ns fo r

CLOVER HMC AND STAGGERED MULTIGRID ON SUMMIT AND VOLTA Kate Clark, July 25th 2018 OUTLINE with

Adaptive HMC via the Infinite Exponential Family Arthur Gretton Gatsby Unit, CSML, University

Variable selection bias Bias in Ensemble Bias in Ensemble Methods Methods Variable selection

Quasi-Resonant Converters Introduction 20.1 The zero-current-switching quasi-resonant switch

Well quasi-ordering Aronszajn lines. Carlos Martinez-Ranero Centro de Ciencias Matematicas March

Contents 1. General Problem 2. Quasi-primal algebras Logics associated with a quasi-primal

Robust, Expressive, and Quantitative Linear Temporal Logics: Pick any Two for Free Joint work

Spack 201 Intermediate Spack Tutorial CORAL2 COE Spackathon Los Alamos National Laboratory

RPS Cost Containment Options State-Federal RPS Collaborative Webinar Hosted by Clean Energy

Welfare by Participating In Concentrated Package Delivery Markets By Michael D. Bradley, Jeff

Trident Toward a Unified SDN Programming Framework with Automatic Updates Kai Gao 1 Taishi Nojima

Nuclear Browns Ferry Nuclear Plant May 29, 2013 Preston Swafford, Chief Nuclear Officer

PerfSONAR in SIR Service Integration through an Identity Hub Diego R. Lopez, RedIRIS PerfSONAR

Sampling Techniques for Probabilistic and Deterministic Graphical models ICS 276, Spring 2018

Sambuz

Useful Links

Newsletter

Mail Us

MATH 4211/6211 Optimization Quasi-Newton Method Xiaojing Ye Department of Mathematics &