LR-GLM: High-Dimensional Bayesian Inference Using Low-Rank Data - PowerPoint PPT Presentation

LR-GLM: High-Dimensional Bayesian Inference Using Low-Rank Data Approximations Brian Trippe , Jonathan Huggins, Raj Agrawal, and Tamara Broderick

LR-GLM: High-Dimensional Bayesian Inference Using Low-Rank Data Approximations Brian Trippe , Jonathan Huggins, Raj Agrawal, and Tamara Broderick Cases Controls Diseased Healthy Genomic Study (motivating example) - Goal: Understand relationship between genomic variation & disease outcome - N=20,000 samples — D=500,000 SNPs https://www.ebi.ac.uk/training/

LR-GLM: High-Dimensional Bayesian Inference Using Low-Rank Data Approximations Brian Trippe , Jonathan Huggins, Raj Agrawal, and Tamara Broderick Cases Controls Diseased Healthy Genomic Study (motivating example) - Goal: Understand relationship between genomic variation & disease outcome - N=20,000 samples — D=500,000 SNPs https://www.ebi.ac.uk/training/ Generalized Linear Models (GLMs) - Interpretability - E.g. Logistic/Poisson/Negative Binomial Regression

LR-GLM: High-Dimensional Bayesian Inference Using Low-Rank Data Approximations Brian Trippe , Jonathan Huggins, Raj Agrawal, and Tamara Broderick Cases Controls Diseased Healthy Genomic Study (motivating example) - Goal: Understand relationship between genomic variation & disease outcome - N=20,000 samples — D=500,000 SNPs https://www.ebi.ac.uk/training/ Generalized Linear Models (GLMs) - Interpretability Bayesian Modeling & Inference - E.g. Logistic/Poisson/Negative - Coherent uncertainty quantification Binomial Regression Problem: Super-linear scaling with D

LR-GLM: High-Dimensional Bayesian Inference Using Low-Rank Data Approximations Brian Trippe , Jonathan Huggins, Raj Agrawal, and Tamara Broderick Cases Controls Diseased Healthy Genomic Study (motivating example) - Goal: Understand relationship between genomic variation & disease outcome - N=20,000 samples — D=500,000 SNPs https://www.ebi.ac.uk/training/ Generalized Linear Models (GLMs) - Interpretability Bayesian Modeling & Inference - E.g. Logistic/Poisson/Negative - Coherent uncertainty quantification Binomial Regression Problem: Super-linear scaling with D We present LR-GLM , a method with linear scaling in D and theoretical guarantees on quality

How does it work?

How does it work? Cartoon Example - Logistic Regression with two correlated features

How does it work? Cartoon Example - Logistic Regression with two correlated features Uncertainty in Effect Sizes

How does it work? Cartoon Example - Logistic Regression with two correlated features Uncertainty in Effect Sizes n o i t a m r o f n I f o s L t i o t t L l e I n f o r m a t i o n

How does it work? Cartoon Example - Logistic Regression with two correlated features Uncertainty in Effect Sizes n o The LR-GLM Approximation i t a m r o We ignore the least informative f n I | f o directions { s L t i o t i β ) ≈ p ( y i | x i UU T β ) t L l p ( y i | x T e I n f o r m a t i o n

How does it work? Cartoon Example - Logistic Regression with two correlated features Uncertainty in Effect Sizes n o The LR-GLM Approximation i t a m r o We ignore the least informative f n I | f o directions { s L t i o t i β ) ≈ p ( y i | x i UU T β ) t L l p ( y i | x T e I n f o r m a t i Approximation Quality o n - Exact when data are low rank - We prove: Approximation is close when the data are approximately low rank

Does it Work?

Does it Work? Evaluate by comparing exact means and uncertainties (slow) against our approximation (fast)

Does it Work? Evaluate by comparing exact means and uncertainties (slow) against our approximation (fast) Post. Uncertainty Estimation Post. Mean Estimation Approx. Uncertainty Approx. Mean Exact Mean Exact Uncertainty

Does it Work? Evaluate by comparing exact means and uncertainties (slow) against our approximation (fast) Post. Uncertainty Estimation Post. Mean Estimation Approx. Uncertainty Approx. Mean Exact Mean Exact Uncertainty We rigorously show… - Rank of approximation defines a computational-statistical trade-o ff - The approximation is conservative (overestimates uncertainty) - For high-dimensional, correlated data, LR-GLM closely approximates the exact posterior up to 5X faster!

Does it Work? LR-GLM: High-Dimensional Bayesian Inference Using Low-Rank Data Approximations Evaluate by comparing exact means and uncertainties (slow) against Brian L. Trippe , Jonathan H. Huggins, Raj Agrawal and Tamara Broderick our approximation (fast) Paper: proceedings.mlr.press/v97/trippe19a Post. Uncertainty Estimation Post. Mean Estimation Poster: Pacific Ballroom #214 Approx. Uncertainty Approx. Mean Exact Mean Exact Uncertainty We rigorously show… - Rank of approximation defines a computational-statistical trade-o ff - The approximation is conservative (overestimates uncertainty) - For high-dimensional, correlated data, LR-GLM closely approximates the exact posterior up to 5X faster!

LR-GLM: High-Dimensional Bayesian Inference Using Low-Rank Data - PowerPoint PPT Presentation

LR-GLM: High-Dimensional Bayesian Inference Using Low-Rank Data Approximations Brian Trippe , Jonathan Huggins, Raj Agrawal, and Tamara Broderick LR-GLM: High-Dimensional Bayesian Inference Using Low-Rank Data Approximations Brian Trippe ,

2 3 4 5 8 9 MINNEAPOLIS MILWAUKEE MSA RANK #16 MSA RANK #39 CHICAGO MSA RANK #3

Intro to GLM Day 2: GLM and Maximum Likelihood Federico Vegetti Central European University

Bayesian Estimation of Low-rank Matrices Pierre Alquier Journes de Statistique du Sud,

CS440/ECE448 Lecture 15: Bayesian Inference and Bayesian Learning Slides by Svetlana Lazebnik,

GLM Proxy Data Monte Bateman Proxy Data Creator Introduction GLM is an optical instrument

GLM and GAMs Workshop By Aaron Greenville Stats model Distributions GLM and GLMM

MANOVA and the Multivariate GLM Here we generalize the notation we learned before to the case of

Lecture 19 Spatial GLM + Point Reference Spatial Data Colin Rundel 11/09/2017 1 Spatial GLM

Lecture 19 Spatial GLM + Point Reference Spatial Data Colin Rundel 04/03/2017 1 Spatial GLM

Parallel Numerical Algorithms Chapter 6 Matrix Models Section 6.2 Low Rank Approximation

On the minimum rank of a graph Jisu Jeong June 21, 2013 Jisu Jeong On the minimum rank of a

Basics of Bayesian Inference A frequentist thinks of unknown parameters as fixed Basics of

Inference in Bayesian networks Chapter 14.45 Chapter 14.45 1 Outline Exact inference

Low-Rank Tensor Techniques for High-Dimensional Problems Daniel Kressner CADMOS Chair for

Being Bayesian About Being Bayesian About Net work St ruct ure Net work St ruct ure A Bayesian

Outline Intro to RL and Bayesian Learning History of Bayesian RL Model-based Bayesian

Multi-Query Optimization in Wide-Area Streaming Analytics Albert Jonathan, Abhishek Chandra, Jon

Tie story begins almost 50 years ago with the invention of domain theory and denotational semantics

Long Baseline Neutrino Experiments Jonathan Paley, Ph.D. Indiana University Neutrinos and Dark

Status of the Debian OpenPGP keyring Jonathan McDowell, Gunnar Wolf What do we do Daniel Kahn

Part 12 Hands-on examples of imprecise simulation in engineering (continued) by Edoardo Patelli

Type-safe interactive web service generation from Scribble Jonathan King, Nicholas Ng, Nobuko

MANA MANAGING HOME BIA GING HOME BIAS: : DEFENDING THE V DEFENDING THE VALUE OF LUE OF GL

X10 X10 Jonathan Lee Jonathan Lee Daniel Lee Daniel Lee What is X10? What is X10?