Efficient Probabilistic Inference in the Quest for Physics Beyond - PowerPoint PPT Presentation

Efficient Probabilistic Inference in the Quest for Physics Beyond the Standard Model Atılım Güneş Baydin , Lukas Heinrich, Wahid Bhimji, Lei Shao, Saeid Naderiparizi, Andreas Munk, Jialin Liu, Bradley Gram-Hansen, Gilles Louppe, Lawrence Meadows, Philip Torr, Victor Lee, Prabhat, Kyle Cranmer, Frank Wood

Probabilistic programming

Deep learning Model is learned from data as a differentiable transformation Inputs Outputs Neural network 3

Deep learning Model is learned from data as a differentiable transformation Inputs Outputs Neural network (differentiable program) Difficult to interpret the actual learned model 4

Deep learning Model is learned from data as a differentiable transformation Inputs Outputs Neural network (differentiable program) Probabilistic programming Model is defined as a structured generative program Inputs Outputs Model / probabilistic program / simulator 5

Probabilistic programming Inputs Outputs Model / probabilistic program / simulator Probabilistic model: a joint distribution of random variables Latent (hidden, unobserved) variables ● Observed variables (data) ● 6

Probabilistic programming Inputs Outputs Model / probabilistic program / simulator Probabilistic model: a joint distribution of random variables Latent (hidden, unobserved) variables ● Observed variables (data) ● Probabilistic graphical models use graphs to express conditional dependence Bayesian networks ● Markov random fields (undirected) ● 7

Probabilistic programming Inputs Outputs Model / probabilistic program / simulator Probabilistic model: a joint distribution of random variables Latent (hidden, unobserved) variables ● Observed variables (data) ● Probabilistic programming extends this to “ordinary programming with two added constructs” Sampling from distributions ● Conditioning by specifying observed values ● 8

Inference Inputs Outputs Model / probabilistic program / simulator Use your model to analyze (explain) some given data as the posterior distribution of latents conditioned on observations Likelihood: How do data depend on latents? Posterior: Prior, describes latents Distribution of latents describing given data 9 See Edward tutorials for a good intro: http://edwardlib.org/tutorials/

Inference Inputs Simulated data Model / probabilistic program / simulator Run many times ● Record execution traces , ● Approximate the posterior ● Observed data 10

Inference Inputs Simulated data Model / probabilistic program / simulator Run many times ● Record execution traces , ● This is importance sampling, other Approximate the posterior ● Observed inference engines run differently data 11

Inference reverses the generative process Inputs Simulated data (detector response) Generative model / simulator (e.g., Sherpa, Geant) Inputs Observed data (detector response) 12 Real world system

Inference Live demo 13

Inference engines Markov chain Monte Carlo ● Probprog-specific: ○ prior proposal Lightweight ■ Metropolis–Hastings Random-walk ■ posterior Metropolis–Hastings Sequential ○ Autocorrelation in samples ○ “Burn in” period ○ Importance sampling ● We sample in trace space: Propose from prior ○ each sample (trace) is one full execution of Use learned proposal ○ the model/simulator! parameterized by observations No autocorrelation or burn in ○ Each sample is independent (parallelizable) ○ Others: variational inference, Hamiltonian Monte Carlo, etc. ● 14

Probabilistic programming languages (PPLs) Anglican (Clojure) ● Church (Scheme) ● Edward, TensorFlow Probability (Python, TensorFlow) ● Pyro (Python, PyTorch) ● Figaro (Scala) ● Infer.NET (C#) ● LibBi (C++ template library) ● PyMC3 (Python) ● Stan (C++) ● WebPPL (JavaScript) ● For more, see http://probabilistic-programming.org 18

Existing simulators as probabilistic programs

Execute existing simulators as probprog A stochastic simulator implicitly defines a probability distribution by sampling (pseudo-)random numbers → already satisfying one requirement for probprog Key idea: Interpret all RNG calls as sampling from a prior distribution ● Introduce conditioning functionality to the simulator ● Execute under the control of general-purpose inference engines ● Get posterior distributions over all simulator latents ● conditioned on observations 20

Execute existing simulators as probprog A stochastic simulator implicitly defines a probability distribution by sampling (pseudo-)random numbers → already satisfying one requirement for probprog Advantages: Vast body of existing scientific simulators (accurate generative models) with years of development: MadGraph, Sherpa, Geant4 Enable model-based (Bayesian) machine learning in these ● Explainable predictions directly reaching into the simulator ● (simulator is not used as a black box) Results are still from the simulator and meaningful ● 21

Coupling probprog and simulators Several things are needed: A PPL with with simulator control incorporated into design ● A language-agnostic interface for connecting PPLs to simulators ● Front ends in languages commonly used for coding simulators ● 22

Coupling probprog and simulators Several things are needed: A PPL with with simulator control incorporated into design ● pyprob A language-agnostic interface for connecting PPLs to simulators ● PPX - the P robabilistic P rogramming e X ecution protocol Front ends in languages commonly used for coding simulators ● pyprob_cpp 23

pyprob https://github.com/probprog/pyprob A PyTorch-based PPL Inference engines: Markov chain Monte Carlo ● Lightweight Metropolis Hastings (LMH) ○ Random-walk Metropolis Hastings (RMH) ○ Importance Sampling ● Regular (proposals from prior) ○ Inference compilation (IC) ○ Hamiltonian Monte Carlo (in progress) ● 24

pyprob https://github.com/probprog/pyprob A PyTorch-based PPL Inference engines: Markov chain Monte Carlo ● Lightweight Metropolis Hastings (LMH) ○ Random-walk Metropolis Hastings (RMH) ○ Importance Sampling ● Regular (proposals from prior) ○ Inference compilation (IC) ○ Le, Baydin and Wood. Inference Compilation and Universal Probabilistic Programming. AISTATS 2017 25

PPX https://github.com/probprog/ppx P robabilistic P rogramming e X ecution protocol Cross-platform, via flatbuffers: http://google.github.io/flatbuffers/ ● Supported languages: C++, C#, Go, Java, JavaScript, PHP, Python, ● TypeScript, Rust, Lua Similar to Open Neural Network Exchange (ONNX) for deep learning ● Enables inference engines and simulators to be implemented in different programming languages ● executed in separate processes, separate machines across networks ● 27

E.g., SHERPA, GEANT 28

PPX 29

pyprob_cpp https://github.com/probprog/pyprob_cpp A lightweight C++ front end for PPX 30

Probprog and high-energy physics “etalumis” simulate

etalumis | simulate Andreas Wahid Lei Lukas Atılım Güneş Munk Bhimji Shao Heinrich Baydin Saeid Larry Bradley Gilles Jialin Naderiparizi Meadows Gram-Hansen Louppe Liu Victor Kyle Prabhat Phil Frank Lee Cranmer Torr Wood Cori supercomputer, Lawrence Berkeley Lab 2,388 Haswell nodes (32 cores per node) 32 9,688 KNL nodes (68 cores per node)

pyprob_cpp and Sherpa 33

Efficient Probabilistic Inference in the Quest for Physics Beyond - PowerPoint PPT Presentation

Efficient Probabilistic Inference in the Quest for Physics Beyond the Standard Model Atlm Gne Baydin , Lukas Heinrich, Wahid Bhimji, Lei Shao, Saeid Naderiparizi, Andreas Munk, Jialin Liu, Bradley Gram-Hansen, Gilles Louppe, Lawrence

Probabilistic model Probabilistic model c Probabilistic model Probabilistic model c c

Probabilistic Graphical Models Probabilistic Graphical Models MAP inference Siamak Ravanbakhsh

15-780 Graduate Artificial Intelligence: Probabilistic inference J. Zico Kolter (this

On Computational and Probabilistic Inference Rajat Mani Thomas Objectives: Revisiting Bayesian

Quantification of Uncertainty in Extreme Scale Computations (QUEST) www.quest-scidac.org H. Najm

CS 4110 Probabilistic Programming Probabilistic Programming It's not about writing software.

CS325 Artificial Intelligence Ch 14b Probabilistic Inference Cengiz Gnay Spring 2013

t ' ! tractable probabilistic inference meeting ! December 11th 2019 - NeurIPS 2019 , Vancouver

Inference in Bayesian networks Chapter 14.45 Chapter 14.45 1 Outline Exact inference

SCHOOL PROGRAM FUNDING QUEST ART SCHOOL + GALLERY Quest Art School + Gallery is one of three

A QUEST TO UNDERSTAND THE ORGANISATION OF LARGE RELATIONAL DATA: FROM WILD GOATS TO BITCOINS

Corporate Presentation Based on H1 2017 Financial Results 1 01. QUEST GROUP OVERVIEW Table of

QUEST: The value of QUEST: The value of pre-enrollment pre-enrollment programs to the programs

QUEST Smart Energy Communities in Northern & Remote Canada 27/08/2013 Presentation Overview

Atsuto Suzuki 1 in 2007 2 Quest for Quest for Unifying International Linear Collider

CODEX-b and the Quest for Longevity Philip Ilten May 13, 2020 Imperial Seminar Ilten The Quest

Statistics for Applications Chapter 8: Bayesian Statistics 1/17 The Bayesian approach (1)

Thompson Sampling and Linear Bandits Instructor: Sham Kakade 1 Review The basic paradigm is as

Data Asymptotics Dr. Jarad Niemi STAT 544 - Iowa State University February 7, 2018 Jarad Niemi

Semantic Foundations for Probabilistic Programming Chris Heunen Ohad Kammar, Sam Staton, Frank

Updates-Leak: Data Set Inference and Reconstruction Attacks in Online Learning Ahmed Salem ,

Pair HMMs and Pairwise Sequence Alignment COMP 571 Luay Nakhleh, Rice University Pair HMMs

Lecture 5 Jan-Willem van de Meent Conjugate Priors <latexit

Using selective pressure to improve protein Aude GRELAUD tridimensional structure prediction

Sambuz

Useful Links

Newsletter

Mail Us

Efficient Probabilistic Inference in the Quest for Physics Beyond - PowerPoint PPT Presentation

Efficient Probabilistic Inference in the Quest for Physics Beyond the Standard Model Atlm Gne Baydin , Lukas Heinrich, Wahid Bhimji, Lei Shao, Saeid Naderiparizi, Andreas Munk, Jialin Liu, Bradley Gram-Hansen, Gilles Louppe, Lawrence

Probabilistic model Probabilistic model c Probabilistic model Probabilistic model c c

Probabilistic Graphical Models Probabilistic Graphical Models MAP inference Siamak Ravanbakhsh

15-780 Graduate Artificial Intelligence: Probabilistic inference J. Zico Kolter (this

On Computational and Probabilistic Inference Rajat Mani Thomas Objectives: Revisiting Bayesian

Quantification of Uncertainty in Extreme Scale Computations (QUEST) www.quest-scidac.org H. Najm

CS 4110 Probabilistic Programming Probabilistic Programming It's not about writing software.

CS325 Artificial Intelligence Ch 14b Probabilistic Inference Cengiz Gnay Spring 2013

t ' ! tractable probabilistic inference meeting ! December 11th 2019 - NeurIPS 2019 , Vancouver

Inference in Bayesian networks Chapter 14.45 Chapter 14.45 1 Outline Exact inference

SCHOOL PROGRAM FUNDING QUEST ART SCHOOL + GALLERY Quest Art School + Gallery is one of three

A QUEST TO UNDERSTAND THE ORGANISATION OF LARGE RELATIONAL DATA: FROM WILD GOATS TO BITCOINS

Corporate Presentation Based on H1 2017 Financial Results 1 01. QUEST GROUP OVERVIEW Table of

QUEST: The value of QUEST: The value of pre-enrollment pre-enrollment programs to the programs

QUEST Smart Energy Communities in Northern &amp; Remote Canada 27/08/2013 Presentation Overview

Atsuto Suzuki 1 in 2007 2 Quest for Quest for Unifying International Linear Collider

CODEX-b and the Quest for Longevity Philip Ilten May 13, 2020 Imperial Seminar Ilten The Quest

Statistics for Applications Chapter 8: Bayesian Statistics 1/17 The Bayesian approach (1)

Thompson Sampling and Linear Bandits Instructor: Sham Kakade 1 Review The basic paradigm is as

Data Asymptotics Dr. Jarad Niemi STAT 544 - Iowa State University February 7, 2018 Jarad Niemi

Semantic Foundations for Probabilistic Programming Chris Heunen Ohad Kammar, Sam Staton, Frank

Updates-Leak: Data Set Inference and Reconstruction Attacks in Online Learning Ahmed Salem ,

Pair HMMs and Pairwise Sequence Alignment COMP 571 Luay Nakhleh, Rice University Pair HMMs

Lecture 5 Jan-Willem van de Meent Conjugate Priors &lt;latexit

Using selective pressure to improve protein Aude GRELAUD tridimensional structure prediction

Sambuz

Useful Links

Newsletter

Mail Us

QUEST Smart Energy Communities in Northern & Remote Canada 27/08/2013 Presentation Overview

Lecture 5 Jan-Willem van de Meent Conjugate Priors <latexit