Stan: Probabilistic Modeling Language, MCMC Sampler, and Optimizer - PowerPoint PPT Presentation

Dec 28, 2023 •236 likes •318 views

Stan: Probabilistic Modeling Language, MCMC Sampler, and Optimizer Development Team: Andrew Gelman, Bob Carpenter , Matt Hoffman, Daniel Lee, Ben Goodrich, Michael Betancourt, Marcus Brubaker, Jiqiang Guo, Peter Li, Allen Riddell MCMski 2014

Stan: Probabilistic Modeling Language, MCMC Sampler, and Optimizer Development Team: Andrew Gelman, Bob Carpenter , Matt Hoffman, Daniel Lee, Ben Goodrich, Michael Betancourt, Marcus Brubaker, Jiqiang Guo, Peter Li, Allen Riddell MCMski 2014 mc-stan.org
Goals / Aims • Scalability – model complexity, number of parameters, data size • Efficiency – fast iterations, low memory, high effective sample sizes • Robustness – numerical routines, model structure (i.e., posterior geometry) • Usability – general purpose, clear modeling language, integration (R, Python, command line), expose log prob & gradients/Hessians & I/O
History • Derived from BUGS • declarative → imperative • untyped → strong static typing • Gibbs sampling → adaptive (R)HMC & optimization • interpreted → compiled • restrictive licenses (proprietary/GPL) → liberal (BSD)
Technical Implementation • Model Specification – (trans) data, (trans) parameters, log prob, generated quantities • Sampling via Adaptive Hamiltonian Monte Carlo – warmup converges & estimates mass matrix and step size – (Geo)NUTS adapts number of steps • Optimization via BFGS Quasi-Newton • Translated to C++ with Template Metaprogramming – constraints to transforms + Jacobians; declarations to I/O – automatic differentitation for gradients & Hessians – custom probability and special functions
Strengths • high effective sample size/second (HMC / RHMC) • expressive language vs. BUGS; extensible like JAGS • extensive doc & example models • active, helpful user community • large, diverse development team • integrated into R, Python, command-line (shell) • reusable template lib (auto-diff, distributions & funs, models)
Limitations • no discrete parameters (can marginalize) • no implicit missing data (code as parameters) • not parallelized within chains • language limited relative to black boxes (cf., emcee) • limited data types and constraints • C++ template code is complex for user extension • sampling slow, nonscalable; optimization brittle or approx
Current and Future Development • (stiff) diff eq solving by integration • Riemann manifold HMC (more complex geometry) • approximate inference: [stochastic] VB, EP , max marginal • structured matrices: Cholesky correlation, sparse • L-BFGS optimization (more scalable) • more robust adaptation (cross chain?) • parallelization within and across chains • better probabilistic testing for correctness • faster, cleaner C++ code & more useful interfaces
How Stan Got its Name • “Stan” is not an acronym; Gelman mashed up 1. Eminem song about a stalker fan, and 2. Stanislaw Ulam (1909–1984), co-inventor of Monte Carlo method (and hydrogen bomb). Ulam holding the Fermiac, Enrico Fermi’s physical Monte Carlo simulator for random neutron diffusion

Recommend

An MCMC library for probabilistic programming Rob Zinkov June 13th, 2014 Rob Zinkov An MCMC

An MCMC library for probabilistic programming Rob Zinkov June 13th, 2014 Rob Zinkov An MCMC library for probabilistic programming June 13th, 2014 1 / 19 Special Thanks to Praveen Rob Zinkov An MCMC library for probabilistic programming

447 views • 19 slides

Parallel tempering and Interacting MCMC algorithms Gersende FORT / Eric MOULINES Telecom Paris

Parallel tempering and Interacting MCMC algorithms Adaptive Equi-Energy sampler Parallel tempering and Interacting MCMC algorithms Gersende FORT / Eric MOULINES Telecom Paris Tech CNRS - LTCI Parallel tempering and Interacting MCMC algorithms

917 views • 20 slides

Modern Computational Statistics Lecture 8: Advanced MCMC Cheng Zhang School of Mathematical

Modern Computational Statistics Lecture 8: Advanced MCMC Cheng Zhang School of Mathematical Sciences, Peking University October 14, 2019 Overview of MCMC 2/36 Simple MCMC methods, such as Metropolis algorithm and Gibbs sampler explore the

612 views • 39 slides

Sampling and Reporting for Sampler 1 and 2 Certification Sampling and Reporting for Sampler 1 and 2

Sampling and Reporting for Sampler 1 and 2 Certification Sampling and Reporting for Sampler 1 and 2 Certification Jennifer Hill, PE Peter Nathanson, PE Daniel B. Stephens & Associates Independent Contractor 505.353.9106 peternathanson@kunm.org

2k views • 188 slides

PM Sampler Placement and Sampler Errors! Why should Regulatory and Agricultural Industries Care?

PM Sampler Placement and Sampler Errors! Why should Regulatory and Agricultural Industries Care? Dr. Michael Buser USDA Agricultural Research Service Cotton Production and Processing Research Unit Lubbock, TX (806) 746-5353 x 104 Office

307 views • 11 slides

Testing MCMC Samplers Jason M.T. Roos First European Bayesian Summit in Marketing Testing MCMC

Motivation Simulation Diagnosis Workfmow Example Testing MCMC Samplers Jason M.T. Roos First European Bayesian Summit in Marketing Testing MCMC Samplers Jason M.T. Roos Motivation Simulation Diagnosis Workfmow Example MCMC is awful

879 views • 44 slides

Additional notes on MCMC sampling Shravan Vasishth March 18, 2020 For more details on MCMC, some

Additional notes on MCMC sampling Shravan Vasishth March 18, 2020 For more details on MCMC, some good books are: 1. The MCMC handbook by Brooks et al. [1] 2. Gilks et al [2] 3. Lynch [3] 1 Introduction: Monte Carlo integration Monte Carlo

939 views • 56 slides

An Introduction to Stan and RStan Introduction I (MW) am not a developer of Stan , only a very

Michael Weylandt Houston R Users Group 2016-12-06 Rice University An Introduction to Stan and RStan Introduction I (MW) am not a developer of Stan , only a very happy user. Credit for Stan goes to the Stan Development Team: Andrew Gelman, Bob

1.33k views • 118 slides

modeling in Stan using rstan mc-stan.org Hamiltonian Monte Carlo Speed (rotation-invariance +

Fast Bayesian modeling in Stan using rstan mc-stan.org Hamiltonian Monte Carlo Speed (rotation-invariance + convergence + mixing) Flexibility of priors Stability to initial values See Radford Neal s chapter in the Handbook of MCMC

377 views • 18 slides

Language Modeling CSE354 - Spring 2020 Task Language Modeling Probabilistic Modeling

Language Modeling CSE354 - Spring 2020 Task Language Modeling Probabilistic Modeling h o w ? (i.e. auto-complete) Probability Theory Logistic Regression Sequence Modeling Language Modeling -- assigning a

762 views • 53 slides

Markov chain Monte Carlo (MCMC) methods Gibbs Sampler Example 12 (Matlab) Consider again

Markov chain Monte Carlo (MCMC) methods Gibbs Sampler Example 12 (Matlab) Consider again deblurring of the constellation image (b) in the case (ii) of Example 6 in which 1 -norm based prior pr ( x ) exp ( n i = 1 | x i | ) was

445 views • 21 slides

Photobioreactor system case-study Tom a s Stan ek Tom a s Stan ek

Photobioreactor system case-study Tom a s Stan ek Tom a s Stan ek Photobioreactor system case-study Tom a s Stan ek Photobioreactor system case-study Overal description Various devices with some autonomous logic and

920 views • 48 slides

CS 4110 Probabilistic Programming Probabilistic Programming It's not about writing software.

CS 4110 Probabilistic Programming Probabilistic Programming It's not about writing software. Probabilistic Programming Probabilistic programming is a tool for statistical modeling. OR A probabilistic programming language is a plain old

688 views • 41 slides

Probabilistic model Probabilistic model c Probabilistic model Probabilistic model c c

Probabilistic model Probabilistic model c Probabilistic model Probabilistic model c c checking with PRISM: hecking with PRISM: hecking with PRISM: hecking with PRISM: an overview an overview an overview an overview Marta Kwiatkowska

1.02k views • 55 slides

Introduction to MCMC and BUGS Basic recipes, and a sample of some techniques for getting

Introduction to MCMC and BUGS Basic recipes, and a sample of some techniques for getting started. Two simple worked out examples. R-code will be provided for those. No background in MCMC assumed. Not for experts! MCMC: Reminder

950 views • 47 slides

MCMC and Variational Inference for AutoEncoders Achille Thin 1 , Alain Durmus 2 , Eric Moulines 1 1

Introduction Deep Latent Generative Models (DLGMs) MetFlow and MetVAE: MCMC & VI From classical to Flow-based MCMC Experiments MCMC and Variational Inference for AutoEncoders Achille Thin 1 , Alain Durmus 2 , Eric Moulines 1 1 Ecole

799 views • 67 slides

General Data Protection Regulation Nataliia Bielova @nataliabielova

General Data Protection Regulation Nataliia Bielova @nataliabielova Security and ethical aspects of data Universit Cote d'Azur be for specified, explicit and legitimate purposes

389 views • 15 slides

By Meliah Schultzman and Navneet Grewal National Housing Law Project 1 Housekeeping

By Meliah Schultzman and Navneet Grewal National Housing Law Project 1 Housekeeping Materials were emailed to registrants and will be emailed again after the webinar, along with evaluations. Materials and recording will be posted at

468 views • 44 slides

S.T.A.L.K.E.R : Clear Sky a showcase for Direct3D 10.0/ 1 Speakers: Igor A. Lobanchikov

GSC Game Worlds S.T.A.L.K.E.R : Clear Sky a showcase for Direct3D 10.0/ 1 Speakers: Igor A. Lobanchikov Former Lead Gfx Engineer at GSC Holger Gruen - ISV Engineer AMD GPG Agenda Introduction The X-Ray rendering

798 views • 55 slides

Scaling properties in multijet events towards high multiplicities Peter Schichtel

Scaling properties in multijet events towards high multiplicities Peter Schichtel Durham University @ Higgs plus Jets 2014 Content Introduction theoretical uncertainties in multi-jet observables jet scaling patterns Jet

451 views • 18 slides

Introduction to the physics of multiferroics Charles Simon Laboratoire CRISMAT, CNRS and

Introduction to the physics of multiferroics Charles Simon Laboratoire CRISMAT, CNRS and ENSICAEN, F14050 Caen. Models in magnetism: from basics aspects to practical use Timisoara september 2009 Summary Introduction and definitions

1.22k views • 85 slides

Network Privacy Mostly issues of preserving privacy of data flowing through network Start

Network Privacy Mostly issues of preserving privacy of data flowing through network Start with encryption With good encryption, data values not readable So whats the problem? Lecture 17 Page 1 CS 236 Online Traffic Analysis

532 views • 21 slides

Data Integration using the Distributed Annotation System (DAS) Andreas Prli , Ewan Birney,

Data Integration using the Distributed Annotation System (DAS) Andreas Prli , Ewan Birney, Tony Cox, Thomas A. Down, Rob Finn, Stefan Grf, David Jackson, Andreas Khri, Eugene Kulesha, Roger Pettett, James Smith, Jim Stalker, Tim J. P

1.02k views • 33 slides

Government: Hypothetical Case Studies Unreasonable complainant conduct in a virtual world SOCAP

Government: Hypothetical Case Studies Unreasonable complainant conduct in a virtual world SOCAP Amora Jamison Hotel, Sydney 30 August 2011 Chris Wheeler Deputy NSW Ombudsman Public servants and social media Recent examples: The

634 views • 25 slides