Truncated Random Measures Jonathan Huggins MIT CSAIL and Dept. of - PowerPoint PPT Presentation

Inference in BNP models • Option #1: Integrate out the parameter (CRP, IBP, etc.)   issues: care about the parameters, using approximations (HMC/VB), distributed computation • Option #2: use a finite approximation...   with e.g. variational inference, HMC   All BNP priors [Blei 06; Neal 10] Problem: Priors with finite approx (new) Wide variety of priors in Previously studied priors BNP with no finite with finite approx (past work) approximation Contributions: ● 2 representation forms (7 reps total) that allow finite approximation of (normalized) completely random measures ( (N)CRMs ) ● Approximation error analysis

Inference in BNP models • Option #1: Integrate out the parameter (CRP, IBP, etc.)   issues: care about the parameters, using approximations (HMC/VB), distributed computation • Option #2: use a finite approximation...   with e.g. variational inference, HMC   All BNP priors [Blei 06; Neal 10] Problem: Priors with finite approx (new) Wide variety of priors in Previously studied priors BNP with no finite with finite approx (past work) approximation Contributions: ● 2 representation forms (7 reps total) that allow finite approximation of (normalized) completely random measures ( (N)CRMs ) ● Approximation error analysis ● Computational complexity analysis (not in this talk)

Past work: finite approximations to BNP priors Finite Approximation Computational Approximation Error Bounds Complexity ✓ ✓ ✓ DP ✓ ✓ ✓ BP ✓ BPP ✓ ✓ ✓ 𝚫 P ✓ (N)CRM

Past work: finite approximations to BNP priors Finite Approximation Computational Approximation Error Bounds Complexity ✓ ✓ ✓ DP [Sethuraman 94] [Ishwaran 01] [Roychowdhury 15] ✓ ✓ ✓ BP [Teh 07] [Doshi-Velez 09] [Paisley 12] [Paisley 12] [Thibaux 07] ✓ BPP [Broderick 14] ✓ ✓ ✓ 𝚫 P [Bondesson 82] [Roychowdhury 15] [Roychowdhury 15] ✓ (N)CRM [Broderick 14]

Past work: finite approximations to BNP priors Finite Approximation Computational Approximation Error Bounds Complexity ✓ ✓ ✓ DP [Sethuraman 94] [Ishwaran 01] [Roychowdhury 15] ✓ ✓ ✓ BP [Teh 07] [Doshi-Velez 09] [Paisley 12] [Paisley 12] Sparse results for a few [Thibaux 07] ✓ BPP priors in BNP [Broderick 14] ✓ ✓ ✓ 𝚫 P [Bondesson 82] [Roychowdhury 15] [Roychowdhury 15] ✓ (N)CRM [Broderick 14]

Past work: finite approximations to BNP priors Finite Approximation Computational Approximation Error Bounds Complexity ✓ ✓ ✓ DP [Sethuraman 94] [Ishwaran 01] [Roychowdhury 15] ✓ ✓ ✓ BP [Teh 07] [Doshi-Velez 09] [Paisley 12] [Paisley 12] Sparse results for a few [Thibaux 07] ✓ BPP priors in BNP [Broderick 14] ✓ ✓ ✓ 𝚫 P [Bondesson 82] [Roychowdhury 15] [Roychowdhury 15] ✓ (N)CRM No general theory [Broderick 14]

Truncation Roadmap

Truncation Roadmap Tractable models in BNP

Truncation Roadmap Tractable models in BNP two forms for sequential representations

Truncation Roadmap Tractable models in BNP two forms for sequential representations Truncation and error analysis

The Standard Model in BNP (By Example ) politics s t r d o o … p o s f 343 189 Doc 1 (532 words) 210 Doc 2 (210 words) 854 Doc 3 (854 words) 342 584 Doc 4 (926 words) … 0.7 0.5 0.2

The Standard Model in BNP (By Example ) frequency space politics s t r d o o … p o s f 343 189 Doc 1 (532 words) 210 Doc 2 (210 words) 854 Doc 3 (854 words) topic   342 584 space Doc 4 (926 words) … 0.7 0.5 0.2

The Standard Model in BNP (By Example ) frequency space politics s t r d o o … p o s f 343 189 Doc 1 (532 words) 0.7 210 Doc 2 (210 words) sports 854 Doc 3 (854 words) topic   342 584 space Doc 4 (926 words) … 0.7 0.5 0.2

The Standard Model in BNP (By Example ) frequency space politics s t r d o o … p o Θ s f 343 189 Doc 1 (532 words) 0.7 210 Doc 2 (210 words) sports 854 Doc 3 (854 words) topic   342 584 space Doc 4 (926 words) ϴ is a random … discrete measure on the topics 0.7 0.5 0.2

The Standard Model in BNP (By Example ) rate   space “traits” ψ 1 ψ 2 ψ 3 … Θ 343 189 Obs 1 210 Obs 2 854 Obs 3 trait   342 584 space Obs 4 ϴ is a random … discrete measure θ 1 θ 2 θ 3 on the topics topics traits “rates”

Poisson processes and (N)CRMs How do we generate infinitely many trait/rate points ( 𝜔 , 𝜄 )? rate   space trait space

Poisson processes and (N)CRMs How do we generate infinitely many trait/rate points ( 𝜔 , 𝜄 )? Poisson point process with measure 𝜉 (d 𝜄 x d 𝜔 ): rate   space trait space [Kingman 93]

Poisson processes and (N)CRMs How do we generate infinitely many trait/rate points ( 𝜔 , 𝜄 )? Poisson point process with measure 𝜉 (d 𝜄 x d 𝜔 ): rate   space Θ completely random measure (CRM) (e.g. BP, 𝚫 P) trait space [Kingman 93]

Poisson processes and (N)CRMs How do we generate infinitely many trait/rate points ( 𝜔 , 𝜄 )? Poisson point process with measure 𝜉 (d 𝜄 x d 𝜔 ): rate   space Θ completely random measure (CRM) (e.g. BP, 𝚫 P) trait space Normalize rates: normalized CRM (NCRM) (e.g. DP) [Kingman 93]

Poisson processes and (N)CRMs How do we generate infinitely many trait/rate points ( 𝜔 , 𝜄 )? Poisson point process with measure 𝜉 (d 𝜄 x d 𝜔 ): rate   space Θ completely random measure (CRM) (e.g. BP, 𝚫 P) trait space Normalize rates: normalized CRM (NCRM) (e.g. DP) Captures a large class of useful priors in BNP [Kingman 93]

Poisson processes and (N)CRMs How do we generate infinitely many trait/rate points ( 𝜔 , 𝜄 )? Poisson point process with measure 𝜉 (d 𝜄 x d 𝜔 ): rate   space Θ completely random measure (CRM) (e.g. BP, 𝚫 P) trait space Normalize rates: normalized CRM (NCRM) (e.g. DP) Captures a large class of useful priors in BNP How do we pick a finite subset of the points? [Kingman 93]

Sequential representation & truncation We pick a finite subset of atoms ( 𝜔 , 𝜄 ) by: rate   space Θ trait space

Sequential representation & truncation We pick a finite subset of atoms ( 𝜔 , 𝜄 ) by: 1) ordering the atoms (sequential representation) rate   space Θ trait space

Sequential representation & truncation We pick a finite subset of atoms ( 𝜔 , 𝜄 ) by: 1) ordering the atoms (sequential representation) rate   space Θ 1 trait space

Sequential representation & truncation We pick a finite subset of atoms ( 𝜔 , 𝜄 ) by: 1) ordering the atoms (sequential representation) rate   space Θ 1 2 trait space

Sequential representation & truncation We pick a finite subset of atoms ( 𝜔 , 𝜄 ) by: 1) ordering the atoms (sequential representation) rate   space Θ 1 3 2 trait space

Sequential representation & truncation We pick a finite subset of atoms ( 𝜔 , 𝜄 ) by: 1) ordering the atoms (sequential representation) rate   space 4 Θ 1 3 2 trait space

Sequential representation & truncation We pick a finite subset of atoms ( 𝜔 , 𝜄 ) by: 1) ordering the atoms (sequential representation) rate   space 4 Θ 1 3 K 2 trait space

Sequential representation & truncation We pick a finite subset of atoms ( 𝜔 , 𝜄 ) by: 1) ordering the atoms (sequential representation) 2) removing any atoms beyond the K-th (truncation) rate   space 4 Θ 1 3 K 2 trait space

Ordering of (N)CRM atoms We describe 2 forms for sequential representations

Ordering of (N)CRM atoms We describe 2 forms for sequential representations Series representation   function of a homogenous   Poisson point process   (4 versions)

Ordering of (N)CRM atoms We describe 2 forms for sequential representations Superposition representation   Series representation   infinite sum of homogenous function of a homogenous   CRMs, each with finite # of atoms   Poisson point process   (3 versions) (4 versions)

Ordering of (N)CRM atoms We describe 2 forms for sequential representations Superposition representation   Series representation   infinite sum of homogenous function of a homogenous   CRMs, each with finite # of atoms   Poisson point process   (3 versions) (4 versions) Theorem (H., Campbell, How, Broderick).   Can generate (N)CRMs using all 7 sequential representations

Sequential representation comparison Why so many representations?

Sequential representation comparison Why so many representations? They’re all useful in different circumstances

Sequential representation comparison Why so many representations? They’re all useful in different circumstances Series Reps Superposition Reps B-Rep IL-Rep R-Rep T-Rep DB-Rep PL-Rep SB-Rep ✓ ✓ ✓ / ✗ ✓ ✓ Error ✗ ✗ Bound (exp) (exp) (exp) (exp) Decay ✓ ✓ ✓ Ease of ✗ ✗✗ ✗ ✗ Analysis ✓ ✓ ✓ ✓ ✓ ✓ ✓ Generality ✓ ✓ ✗ ✗ ✗ ✗ ✗ Known # Atoms

Sequential representation example Given Gamma process:

Sequential representation example Given Gamma process: Step 1: compute

Sequential representation example Given Gamma process: Step 1: compute Step 2: compute

Sequential representation example Given Gamma process: Step 1: compute Step 2: compute Exponential( 𝜇 ) density!

Sequential representation example Given Gamma process: Step 1: compute Step 2: compute Exponential( 𝜇 ) Step 3: plug in! density!

Choosing between the seven representations How close is our finite approximation?

Choosing between the seven representations How close is our finite approximation? Truncation error:

Choosing between the seven representations How close is our finite approximation? Truncation error: truncated full infinite ϴ K ϴ

Choosing between the seven representations How close is our finite approximation? Truncation error: truncated full infinite ϴ K ϴ generated data generated data

Choosing between the seven representations How close is our finite approximation? Truncation error: truncated full infinite ϴ K ϴ generated data generated data Compare the distribution of the data under full vs. truncated

Choosing between the seven representations How close is our finite approximation? Truncation error: Depends on number of observations N and truncation level K

Choosing between the seven representations How close is our finite approximation? Truncation error: Depends on number of observations N and truncation level K As N gets larger, error increases

Choosing between the seven representations How close is our finite approximation? Truncation error: Depends on number of observations N and truncation level K ε As N gets larger, error increases As K gets larger, error decreases

Choosing between the seven representations How close is our finite approximation? Truncation error: Depends on number of observations N and truncation level K ε As N gets larger, error increases As K gets larger, error decreases Cannot evaluate exactly , so we develop new upper bounds

Protobound Leads to all the other truncation error bounds in this work Lemma (H., Campbell, How, Broderick). The truncation error i.e. P( whoops! )

Truncated Random Measures Jonathan Huggins MIT CSAIL and Dept. of - PowerPoint PPT Presentation

Truncated Random Measures Jonathan Huggins MIT CSAIL and Dept. of EECS with: T. Campbell, J. How, T. Broderick What leads to a statistical method being used for science? What leads to a statistical method being used for science? 1.

Truncated Differentials Lars R. Knudsen June 2014 Lars R. Knudsen Truncated Differentials

On truncated discrete moment problems Tobias Kuna University of Reading, UK (Joint work with

Random Numbers RANDOM VS PSEUDO RANDOM Truly Random numbers From Wolfram: A random number

Minimax risk of truncated series estimators over symmetric convex polytopes Adel Javanmard

Bivariate Truncated Moment Problems with Algebraic Variety in the Nonnegative Quadrant in R 2

Truncated Sums, Matrix Iteration Giacomo Boffi

Asymmetric truncated Toeplitz operators of rank one Bartosz anucha Maria Curie-Skodowska

Truncated Sums, Matrix Iteration Giacomo Boffi

Constructing dependent random probability measures from completely random measures Changyou Chen 1

Chapter 2: Random Variables In this chapter we will cover: 1. Discrete Random variables, ( 2.1

Random Numbers, Files, and Onwards Random Numbers Computers cannot produce truly random numbers.

Transitional Measures Introduction to Regulatory Measures 1 Why Regulatory Measures ?

Random permutations with logarithmic cycle Random permutations weights Classical measures The

Discrete Random Variables October 7, 2010 Discrete Random Variables Random Variables In many

Chapter 9 Object recognition Random Forests 9.9 Random forests 2 9.9 Random forests

Back to Random Walks on Graphs Random walk on a graph: Stationary distribution: Back to Random

Practical Parallel Array Fusion with Repa (Workshop) Ben Lippmeier University of New South Wales

Watershed Below TMDL Threshold At TMDL Threshold Above TMDL Threshold Water Quality Overview

Synthesis of A2 Katie Skalak, Dorothy Merritts, John Brakebill A2 Panelists : Jim Pizzuto

Enterprise Java Beans (EJB) MIE456 Tutorial Agenda What is EJB How does EJB work?

LENDING: TRANSITION TO A LOW-CARBON ECONOMY CECILE MOITRY BNP PARIBAS GREEN FINANCE CONFERENCE

for cryptocurrency and blockchain assets. With our unique exchange facility we offer: Secure &

A Framework for Debt-Maturity Management Saki Bigio (UCLA) Galo Nuo (Bank of Spain) Juan

The Problem Domain Story: As a Investment Bank, we need meet all BCBS239 and associated

Sambuz

Useful Links

Newsletter

Mail Us

Truncated Random Measures Jonathan Huggins MIT CSAIL and Dept. of - PowerPoint PPT Presentation

Truncated Random Measures Jonathan Huggins MIT CSAIL and Dept. of EECS with: T. Campbell, J. How, T. Broderick What leads to a statistical method being used for science? What leads to a statistical method being used for science? 1.

Truncated Differentials Lars R. Knudsen June 2014 Lars R. Knudsen Truncated Differentials

On truncated discrete moment problems Tobias Kuna University of Reading, UK (Joint work with

Random Numbers RANDOM VS PSEUDO RANDOM Truly Random numbers From Wolfram: A random number

Minimax risk of truncated series estimators over symmetric convex polytopes Adel Javanmard

Bivariate Truncated Moment Problems with Algebraic Variety in the Nonnegative Quadrant in R 2

Truncated Sums, Matrix Iteration Giacomo Boffi

Asymmetric truncated Toeplitz operators of rank one Bartosz anucha Maria Curie-Skodowska

Truncated Sums, Matrix Iteration Giacomo Boffi

Constructing dependent random probability measures from completely random measures Changyou Chen 1

Chapter 2: Random Variables In this chapter we will cover: 1. Discrete Random variables, ( 2.1

Random Numbers, Files, and Onwards Random Numbers Computers cannot produce truly random numbers.

Transitional Measures Introduction to Regulatory Measures 1 Why Regulatory Measures ?

Random permutations with logarithmic cycle Random permutations weights Classical measures The

Discrete Random Variables October 7, 2010 Discrete Random Variables Random Variables In many

Chapter 9 Object recognition Random Forests 9.9 Random forests 2 9.9 Random forests

Back to Random Walks on Graphs Random walk on a graph: Stationary distribution: Back to Random

Practical Parallel Array Fusion with Repa (Workshop) Ben Lippmeier University of New South Wales

Watershed Below TMDL Threshold At TMDL Threshold Above TMDL Threshold Water Quality Overview

Synthesis of A2 Katie Skalak, Dorothy Merritts, John Brakebill A2 Panelists : Jim Pizzuto

Enterprise Java Beans (EJB) MIE456 Tutorial Agenda What is EJB How does EJB work?

LENDING: TRANSITION TO A LOW-CARBON ECONOMY CECILE MOITRY BNP PARIBAS GREEN FINANCE CONFERENCE

for cryptocurrency and blockchain assets. With our unique exchange facility we offer: Secure &amp;

A Framework for Debt-Maturity Management Saki Bigio (UCLA) Galo Nuo (Bank of Spain) Juan

The Problem Domain Story: As a Investment Bank, we need meet all BCBS239 and associated

Sambuz

Useful Links

Newsletter

Mail Us

for cryptocurrency and blockchain assets. With our unique exchange facility we offer: Secure &