Konstantin Turitsyn T-4 & CNLS, Los Alamos National Laboratory - PowerPoint PPT Presentation

Accelerating MCMC algorithms by breaking detailed balance Konstantin Turitsyn T-4 & CNLS, Los Alamos National Laboratory Landau Institute for Theoretical Physics, Moscow In collaboration with Misha Chertkov (LANL) Marija Vucelja (Weizmann) http://arXiv.org/abs/0809.0916 Thursday, September 3, 2009

MCMC Algorithms • Problem: produce samples x from a given distribution π x defined up to a constant. • Solution: use a Markovian random walk x t converging to the required stationary distribution. • Detailed balance: T yx π x = T xy π y • Only local moves are allowed: x t+1 is close to x t Thursday, September 3, 2009

Detailed balance Random walker distribution evolves according to = ∑ P t + 1 T xy P t x y y Assuming ∑ x T xy � 1 one can rewrite the equation as T yx P t + 1 − T xy P t ∑ � � = 0 x y y Balance condition: ∀ t : P t x = π x ∑ [ T yx π x − T xy π y ] = 0 y Detailed balance (reversibility, equilibrium): T yx π x = T xy π y Sufficient but not necessary! Thursday, September 3, 2009

Loop decomposition It is useful to consider ergodic flux matrix Q xy = T xy π y Detailed balance = symmetry of ergodic flow: Q xy = Q yx Asymmetric part of Q xy can be decomposed: Q xy − Q yx = ∑ � � C α xy − C α J α yx α Here J α is amplitude of the probability flow C α xy is adjacency matrix of the loop Flow amplitudes are bounded by the reversible part Thursday, September 3, 2009

Physical analogies • PDF evolution ⇔ Diffusion-advection of passive scalar • Balance condition ⇔ Flow Incompressibility • Reversibility ⇔ Diffusion • Irreversible motion ⇔ Advection • Loops ⇔ Vortices Thursday, September 3, 2009

Slow convergence Several types of distributions are characterized by slow mixing: • Glassy landscapes: Regions that dominate the partition function are separated by “energy barriers” • Entropy barriers: Regions of high probability are separated by narrow paths (high probability but small entropy) • Single region with high probability of large size (entropy) Thursday, September 3, 2009

Acceleration with loops L T ~ L 2 T ~ L Irreversibility can significantly accelerate random walks on regular lattices. Thursday, September 3, 2009

Other approaches ? Naive way: • Exponentially many loops required for real systems • Flow amplitude can not be determined based on local information Proposed approach: • Calculate irreversible transition probabilities “on a fly” • Do not enforce the balance condition, instead compensate for compressibility Thursday, September 3, 2009

Skewed detailed balance • Create two copies of the system (‘+’ and ‘-’) • Decompose transition probabilities as T xy = T (+) + T ( − ) xy xy T (+) xy π x = T ( − ) yx π y • Compensate the compressibility by introducing transition between copies: � � Λ ( ± , ∓ ) T ( ± ) 0 , ∑ − T ∓ = max xx xy xy y Thursday, September 3, 2009

Skewed detailed balance 2 • Extended matrix satisfies balance condition and corresponds to irreversible process: T (+) Λ (+ , − ) ˆ ˆ � � ˆ T = Λ ( − , +) ˆ T ( − ) ˆ • Random walk becomes non-Markovian in original space. • System copy index is analogous to momentum in physics: diffusive motion turns into ballistic/super- diffusive. • No complexity overhead for Glauber and other local dynamics. Thursday, September 3, 2009

Spin cluster model Simple example: classical Ising model defined on a full graph: � � J ... s N = Z − � N ∑ � � s i s j π s � � i , j System experiences phase transition at J=1. Anomalous fluctuations of magnetization δ S ~ N 3/4 lead to critical slowdown of Glauber dynamics: T ~ N 3/2 Irreversible dynamics: flip only positive spins in first copy, and only negative in second. Thursday, September 3, 2009

Spin cluster model: results Dynamics is strongly accelerated: convergence time (defined via correlation function of S ) decreases to T~N 3/4 (T~N 0.85 in simulations ) 20 18 16 14 log 2 T 12 10 8 6 4 2 2 4 6 8 10 12 14 log 2 N Thursday, September 3, 2009

Ising model Two-dimensional Ising model shares a lot of properties in the critical point. One can try the same algorithm. Constant factor acceleration is observed ( ~3x ), however the constant does not depend on system size. Flipping between the copies happens too frequently ( T~L 1/2 ) � 0.3 1.7 � 0.4 1.6 � 0.5 1.5 � 0.6 1.4 � 0.7 1.3 � 0.8 200 400 600 800 1000 200 400 600 800 1000 Thursday, September 3, 2009

Extensions 1 Mix irreversible fluxes in the same direction (2N copies instead of 2) � � T ( α ) yx π x − T ( α ) Λ ( α , β ) ∑ − ∑ π x = 0 xy π y xx y β Thursday, September 3, 2009

Extensions 2 Mix irreversible fluxes in different directions (i.e. horizontal and vertical). Thursday, September 3, 2009

Extensions 3 Introduce generalized “momentum” variable. Break symmetry in momentum space. Thursday, September 3, 2009

Phase transitions • Explore the full phase space high-wavelength spatial harmonics of the order parameter (i.e. magnetization) • Make the dynamics more adaptive: Reversible dynamics : ∂ t M k = − Γ k M k + ξ k ( t ) Γ k ∼ k χ , k → 0 Irreversible (Hamiltonian): ∂ tt M k = − Γ 2 k M k • Separate momentum variables for different parts of space Thursday, September 3, 2009

Other approaches ? Broken DB: • Lifting operation ( Chen, Lovasz, Pak 99 ) - theoretical limits of acceleration ( T irr > √ T rev ). Some toy models: ( Diaconis, Holmes, Neal 97 ) .Applications to distributed computing: ( Jung, Shah, Shin 07) • Hamiltonian (Hybrid) Monte Carlo ( Horvath, Kennedy 88 ) - continuum limit of our construction. • Successive over-relaxation ( Adler 81 ), sequential updating ( Ren, Orkoulas 06 ) - another way of producing irreversible fluxes. Reversible algorithms: • Cluster algorithms ( Swendsen, Wang 87 ) - teleportation instead of ballistic motion • Simulated tempering ( Marinari, Parisi 92 ) - several copies of the system, but with different distributions Thursday, September 3, 2009

Konstantin Turitsyn T-4 & CNLS, Los Alamos National Laboratory - PowerPoint PPT Presentation

Accelerating MCMC algorithms by breaking detailed balance Konstantin Turitsyn T-4 & CNLS, Los Alamos National Laboratory Landau Institute for Theoretical Physics, Moscow In collaboration with Misha Chertkov (LANL) Marija Vucelja (Weizmann)

Marija Vucelja Gregory Falkovich Konstantin S. Turitsyn Courant Institute of Mathematical

Konstantin Ignatyev konstantin@inspur.com Data Acquisition Data Storage OLAP OLTP ETL (batch)

Resource Access Decision Facility: Overview Konstantin Beznosov Baptist Health Systems of South

Remake build system for animation projects Konstantin Dmitriev ksee.zelgadis@gmail.com Hi! My

IOTA ARCHITECTURE: DATA VIRTUALIZATION AND PROCESSING MEDIUM DR. KONSTANTIN BOUDNIK DR.

Representations of the virtual braid groups to the rook algebras Konstantin Gotin Novosibirsk

When Regulations Backfire: The Case of the Community Reinvestment Act Konstantin Golyaev

Inductive Learning and Ockhams Razor Konstantin Genin Kevin T. Kelly Carnegie Mellon

A Type System for Format Strings Konstantin Weitz weitzkon@uw.edu Gene Kim genelkim@uw.edu

ASSESSMENT IN THE WORKPLACE Konstantin von Vietinghoff-Scheel Chair of Health & Safety

on Cell-Free Extracts Pitt iGEM 2015 Konstantin Borisov, Robert Donahue, Garrett Green, Apurva

Machine Learning in Spam Filtering A Crash Course in ML Konstantin Tretyakov kt@ut.ee

The The Hadoop Di adoop Dist stri ributed buted Fi File le System System Konstantin

Evaluation of Intersect for polymer flood full field simulations Konstantin MONASTYREV Adolfo

Deep Learning Algorithms for Recognition of Facial Ageing Features Konstantin Kiselev Research

Metadata locking in MySQL 5.5 Konstantin Osipov Engineer @ Mail.Ru kostja.osipov@gmail.com

DATA ANALYTICS USING DEEP LEARNING GT 8803 // FALL 2018 // JOY ARULRAJ L E C T U R E # 0 2 : A

Optimal ranking of online search requests for long-term revenue maximization Pierre LEcuyer

Machine Learning 11 AI Slides (6e) c Lin Zuoquan@PKU 1998-2020 11 1 11 Machine Learning

Policy tradeoffs under risk of abrupt climate change by Y. Tsur and A. Zemel Comments by

AST 1420 Galactic Structure and Dynamics Today: dynamics of stars in galactic disks Last

Adventures of a Long-Range Walker Thierry DAUXOIS CNRS & ENS Lyon 1 Stefano Ruffo

Hoare logic Lecture 5: Introduction to separation logic Jean Pichon-Pharabod University of

Outline Introduction Operating System Security Memory protection CS 239

Sambuz

Useful Links

Newsletter

Mail Us

Konstantin Turitsyn T-4 & CNLS, Los Alamos National Laboratory - PowerPoint PPT Presentation

Accelerating MCMC algorithms by breaking detailed balance Konstantin Turitsyn T-4 & CNLS, Los Alamos National Laboratory Landau Institute for Theoretical Physics, Moscow In collaboration with Misha Chertkov (LANL) Marija Vucelja (Weizmann)

Marija Vucelja Gregory Falkovich Konstantin S. Turitsyn Courant Institute of Mathematical

Konstantin Ignatyev konstantin@inspur.com Data Acquisition Data Storage OLAP OLTP ETL (batch)

Resource Access Decision Facility: Overview Konstantin Beznosov Baptist Health Systems of South

Remake build system for animation projects Konstantin Dmitriev ksee.zelgadis@gmail.com Hi! My

IOTA ARCHITECTURE: DATA VIRTUALIZATION AND PROCESSING MEDIUM DR. KONSTANTIN BOUDNIK DR.

Representations of the virtual braid groups to the rook algebras Konstantin Gotin Novosibirsk

When Regulations Backfire: The Case of the Community Reinvestment Act Konstantin Golyaev

Inductive Learning and Ockhams Razor Konstantin Genin Kevin T. Kelly Carnegie Mellon

A Type System for Format Strings Konstantin Weitz weitzkon@uw.edu Gene Kim genelkim@uw.edu

ASSESSMENT IN THE WORKPLACE Konstantin von Vietinghoff-Scheel Chair of Health &amp; Safety

on Cell-Free Extracts Pitt iGEM 2015 Konstantin Borisov, Robert Donahue, Garrett Green, Apurva

Machine Learning in Spam Filtering A Crash Course in ML Konstantin Tretyakov kt@ut.ee

The The Hadoop Di adoop Dist stri ributed buted Fi File le System System Konstantin

Evaluation of Intersect for polymer flood full field simulations Konstantin MONASTYREV Adolfo

Deep Learning Algorithms for Recognition of Facial Ageing Features Konstantin Kiselev Research

Metadata locking in MySQL 5.5 Konstantin Osipov Engineer @ Mail.Ru kostja.osipov@gmail.com

DATA ANALYTICS USING DEEP LEARNING GT 8803 // FALL 2018 // JOY ARULRAJ L E C T U R E # 0 2 : A

Optimal ranking of online search requests for long-term revenue maximization Pierre LEcuyer

Machine Learning 11 AI Slides (6e) c Lin Zuoquan@PKU 1998-2020 11 1 11 Machine Learning

Policy tradeoffs under risk of abrupt climate change by Y. Tsur and A. Zemel Comments by

AST 1420 Galactic Structure and Dynamics Today: dynamics of stars in galactic disks Last

Adventures of a Long-Range Walker Thierry DAUXOIS CNRS &amp; ENS Lyon 1 Stefano Ruffo

Hoare logic Lecture 5: Introduction to separation logic Jean Pichon-Pharabod University of

Outline Introduction Operating System Security Memory protection CS 239

Sambuz

Useful Links

Newsletter

Mail Us

ASSESSMENT IN THE WORKPLACE Konstantin von Vietinghoff-Scheel Chair of Health & Safety

Adventures of a Long-Range Walker Thierry DAUXOIS CNRS & ENS Lyon 1 Stefano Ruffo