Testing High-Dimensional Distributions: Subcube Conditioning, Random - PowerPoint PPT Presentation

Testing High-Dimensional Distributions: Subcube Conditioning, Random Restrictions, and Mean Testing Cl´ ement Canonne (IBM Research) February 25, 2020 Joint work with Xi Chen, Gautam Kamath, Amit Levi, and Erik Waingarten 1

Outline

Introduction Property Testing Distribution Testing Our Problem Subcube conditioning Results, and how to get them Conclusion 2

Introduction

Property Testing Sublinear-time, 3

Property Testing Sublinear-time, approximate, 3

Property Testing Sublinear-time, approximate, randomized 3

Property Testing Sublinear-time, approximate, randomized decision algorithms that make local queries to their input. 3

Property Testing Sublinear-time, approximate, randomized decision algorithms that make local queries to their input. • Big dataset: too big 3

Property Testing Sublinear-time, approximate, randomized decision algorithms that make local queries to their input. • Big dataset: too big • Expensive access: pricey data 3

Property Testing Sublinear-time, approximate, randomized decision algorithms that make local queries to their input. • Big dataset: too big • Expensive access: pricey data • “Model selection”: many options 3

Property Testing Sublinear-time, approximate, randomized decision algorithms that make local queries to their input. • Big dataset: too big • Expensive access: pricey data • “Model selection”: many options • “Good enough:” a priori knowledge 3

Property Testing Sublinear-time, approximate, randomized decision algorithms that make local queries to their input. • Big dataset: too big • Expensive access: pricey data • “Model selection”: many options • “Good enough:” a priori knowledge Need to infer information – one bit – from the data: quickly, or with very few lookups. 3

“Is it far from a kangaroo?” 4

Property Testing Introduced by [RS96, GGR96] – has been a very active area since. • Known space (e.g., { 0 , 1 } N ) • Property P ⊆ { 0 , 1 } N • Oracle access to unknown x ∈ { 0 , 1 } N • Proximity parameter ε ∈ (0 , 1] Must decide x ∈ P vs. dist( x , P ) > ε (has the property, or is ε -far from it) 5

Distribution Testing Now, our “big object” is a probability distribution over a (finite) domain. 6

Distribution Testing Now, our “big object” is a probability distribution over a (finite) domain. • type of queries: independent samples * 6

Distribution Testing Now, our “big object” is a probability distribution over a (finite) domain. • type of queries: independent samples * • type of distance: total variation 6

Distribution Testing Now, our “big object” is a probability distribution over a (finite) domain. • type of queries: independent samples * • type of distance: total variation • type of object: distributions 6

Distribution Testing Now, our “big object” is a probability distribution over a (finite) domain. • type of queries: independent samples * • type of distance: total variation • type of object: distributions *Disclaimer: not always, as we will see. 6

Our Problem

Uniformity testing We focus on arguably the simplest and most fundamental property: uniformity. Given samples from p : is p = u , or TV( p , u ) > ε ? 7

Uniformity testing We focus on arguably the simplest and most fundamental property: uniformity. Given samples from p : is p = u , or TV( p , u ) > ε ? Oh, and we would like to do that for high-dimensional distributions. 7

Uniformity testing: Good News Its is well-known ([Pan08, VV14], and then [DGPP16, DGPP18] and more) that testing uniformity over a domain of size N takes √ N /ε 2 ) samples. Θ( 8

Uniformity testing: Bad News In the high-dimensional setting (we think of {− 1 , 1 } n with n ≫ 1) that means Θ(2 n / 2 /ε 2 ) samples, exponential in the dimension. 9

Uniformity testing: Good News In the high-dimensional setting with structure* testing uniformity over {− 1 , 1 } n takes Θ( √ n /ε 2 ) samples [CDKS17]. ∗ when we assume product distributions. 10

Uniformity testing: Bad News We do not want to make any structural assumption. p is, a priori, arbitrary. 11

Uniformity testing: Bad News We do not want to make any structural assumption. p is, a priori, arbitrary. So what to do? 11

Subcube Conditioning Variant of conditional sampling [CRS15, CFGM16] suggested in [CRS15] and first studied in [BC18]: can specify assignments of any of the n bits, and get a sample from p conditioned on those bits being fixed. 12

Subcube Conditioning Variant of conditional sampling [CRS15, CFGM16] suggested in [CRS15] and first studied in [BC18]: can specify assignments of any of the n bits, and get a sample from p conditioned on those bits being fixed. Very well suited to this high-dimensional setting. 12

Testing Result [BC18] showed that subcube conditional queries allow uniformity testing with ˜ O ( n 3 /ε 3 ) samples (no longer exponential!). Surprisingly, we show it is sublinear: Theorem (Main theorem) Testing uniformity with subcube conditional queries has sample O ( √ n /ε 2 ) . complexity ˜ (immediate Ω( √ n /ε 2 ) lower bound from the product case) 13

Ingredients This relies on two main ingredients: a structural result analyzing random restrictions of a distribution; and a subroutine for a related testing task, mean testing. 14

Structural Result (I) Definition (Projection) Let p be any distribution over {− 1 , 1 } n , and S ⊆ [ n ]. The projection p S of p on S is the marginal distribution of p on {− 1 , 1 } | S | . Definition (Mean) Let p be as above. µ ( p ) ∈ R n is the mean vector of p , µ ( p ) = E x ∼ p [ x ]. 15

Structural Result (II) Definition (Restriction) Let p be any distribution over {− 1 , 1 } n , and σ ∈ [0 , 1]. A random restriction ρ = ( S , x ) is obtained by (i) sampling S ⊆ [ n ] by including each element i.i.d. w.p. σ ; (ii) sampling x ∼ p . Conditioning p on x i = x i for all i ∈ S gives the distribution p | ρ . 16

Structural Result (III) Theorem (Restriction theorem, Informal) Let p be any distribution over {− 1 , 1 } n . Then, when p is “hit” by a random restriction ρ as above, � � � � E ρ � µ ( p | ρ ) � 2 ≥ σ · E S TV( p S , u ) . 17

Structural Result (IV) Theorem (Pisier’s inequality [Pis86, NS02]) Let f : {− 1 , 1 } n → R be s.t. E x [ f ( x )] = 0 . Then �� n � � � � � E x ∼{− 1 , 1 } n [ | f ( x ) | ] � log n · E x , y ∼{− 1 , 1 } n y i x i L i f ( x ) . � � � � � � i =1 18

Structural Result (IV) Theorem (Pisier’s inequality [Pis86, NS02]) Let f : {− 1 , 1 } n → R be s.t. E x [ f ( x )] = 0 . Then �� n � � � � � E x ∼{− 1 , 1 } n [ | f ( x ) | ] � log n · E x , y ∼{− 1 , 1 } n y i x i L i f ( x ) . � � � � � � i =1 Theorem (Robust version) Let f : {− 1 , 1 } n → R be s.t. E x [ f ( x )] = 0 and G = ( {− 1 , 1 } n , E ) be any orientation of the hypercube. Then, �� E x ∼{− 1 , 1 } n [ | f ( x ) | ] � log n · E x , y ∼{− 1 , 1 } n y i x i L i f ( x ) . � � � � i ∈ [ n ] ( x , x ( i ) ) ∈ E 18

Mean Testing Result (I) Consider the following question: from i.i.d. (“standard”) samples from p on {− 1 , 1 } n , distinguish (i) p = u and (ii) � µ ( p ) � 2 > ε . 19

Mean Testing Result (I) Consider the following question: from i.i.d. (“standard”) samples from p on {− 1 , 1 } n , distinguish (i) p = u and (ii) � µ ( p ) � 2 > ε . Remarks No harder than uniformity testing. 19

Mean Testing Result (I) Consider the following question: from i.i.d. (“standard”) samples from p on {− 1 , 1 } n , distinguish (i) p = u and (ii) � µ ( p ) � 2 > ε . Remarks No harder than uniformity testing. Can ask the same for Gaussians: p = N (0 n , I n ) vs. p = N ( µ, Σ) with � µ ( p ) � 2 > ε . 19

Mean Testing Result (II) Theorem (Mean Testing theorem) For ε ∈ (0 , 1] , ℓ 2 mean testing has (standard) sample complexity Θ( √ n /ε 2 ) , for both Boolean and Gaussian cases. 20

Mean Testing Result (III) Main idea Use a nice unbiased estimator that works well in the product case: � 1 m m X (2 i ) , 1 X (2 i − 1) � � � Z = m m j =1 j =1 21

Mean Testing Result (III) Main idea Use a nice unbiased estimator that works well in the product case: � 1 m m X (2 i ) , 1 X (2 i − 1) � � � Z = m m j =1 j =1 E [ Z ] = � µ ( p ) � 2 2 , and Var[ Z ] ≈ � Σ( p ) � 2 F . 21

Testing High-Dimensional Distributions: Subcube Conditioning, Random - PowerPoint PPT Presentation

Testing High-Dimensional Distributions: Subcube Conditioning, Random Restrictions, and Mean Testing Cl ement Canonne (IBM Research) February 25, 2020 Joint work with Xi Chen, Gautam Kamath, Amit Levi, and Erik Waingarten 1 Outline

Formal Modeling in Cognitive Science 1 Distributions Lecture 20: Joint, Marginal, and Conditional

? ? ? ? Basic Charts Outline - Distributions & Histograms - Mean, Mode, Average - Chart

Subcube isoperimetry and power of coalitions Petr Gregor Charles University in Prague

MaxSAT Resolution and Subcube Sums Yuval Filmus Meena Mahajan Gaurav Sood Marc Vinyals SAT

Testing properties of distributions Ronitt Rubinfeld MIT and Tel Aviv University Distributions

Testing Monotone Continuous Distributions on High-dimensional Real Cubes Michal Adamaszek DIMAP,

Levels of Testing Chapter 12 Beyond unit testing Developer Testing stages Unit testing

Testing Terminology System testing Types of errors Function testing Structure

Stat 5102 Lecture Slides: Deck 1 Empirical Distributions, Exact Sampling Distributions,

Create Distributions Empirically using Excel V0E 10/11/2014 0E 2014 Schield Creating

Input Distributions Reading: Chapter 6 in Law Input Distributions Overview Probability Theory

Lecture 5: Probability Distributions Random Variables Probability Distributions

Outline Power Law Size Distributions Distributions Power Law Size Distributions Overview

n -dimensional manifold M with T := TM n -dimensional manifold M with T := TM T n -dimensional

Property-Based Testing Matt Bachmann @mattbachmann Testing is Important Testing is Important

Software Testing Overview What is software testing? General testing criteria Testing

Scaling Limits for some P.D.E. Systems with Random Initial Data Gi-Ren Liu and Narn-Rueih Shieh

Semiparametric Estimation of Random Coefficients in Structural Economic Models Stefan Hoderlein 1

on Random Matrix Theory MFM Modeling Workshop 2016 Final Report Introduction The purpose of

Revisiting the Threshold Random Walk Scan Detector Vagishwari Nagaonkar Dr.John Mchugh Faculty

Approximate Likelihood Procedures for the Boolean Model Using Linear Transects John C. Handley

Search via quantum walk J er emie Roland UC Berkeley Joint work with eric Magniez 1 Ashwin

Limit theorems for excursion sets of stationary random fields Evgeny Spodarev | 23.01.2013 WIAS,

"The Effect of Expected Income on Individual Migration Decisions" J. Kennan, J. Walker

Testing High-Dimensional Distributions: Subcube Conditioning, Random - PowerPoint PPT Presentation

Testing High-Dimensional Distributions: Subcube Conditioning, Random Restrictions, and Mean Testing Cl ement Canonne (IBM Research) February 25, 2020 Joint work with Xi Chen, Gautam Kamath, Amit Levi, and Erik Waingarten 1 Outline

Formal Modeling in Cognitive Science 1 Distributions Lecture 20: Joint, Marginal, and Conditional

? ? ? ? Basic Charts Outline - Distributions &amp; Histograms - Mean, Mode, Average - Chart

Subcube isoperimetry and power of coalitions Petr Gregor Charles University in Prague

MaxSAT Resolution and Subcube Sums Yuval Filmus Meena Mahajan Gaurav Sood Marc Vinyals SAT

Testing properties of distributions Ronitt Rubinfeld MIT and Tel Aviv University Distributions

Testing Monotone Continuous Distributions on High-dimensional Real Cubes Michal Adamaszek DIMAP,

Levels of Testing Chapter 12 Beyond unit testing Developer Testing stages Unit testing

Testing Terminology System testing Types of errors Function testing Structure

Stat 5102 Lecture Slides: Deck 1 Empirical Distributions, Exact Sampling Distributions,

Create Distributions Empirically using Excel V0E 10/11/2014 0E 2014 Schield Creating

Input Distributions Reading: Chapter 6 in Law Input Distributions Overview Probability Theory

Lecture 5: Probability Distributions Random Variables Probability Distributions

Outline Power Law Size Distributions Distributions Power Law Size Distributions Overview

n -dimensional manifold M with T := TM n -dimensional manifold M with T := TM T n -dimensional

Property-Based Testing Matt Bachmann @mattbachmann Testing is Important Testing is Important

Software Testing Overview What is software testing? General testing criteria Testing

Scaling Limits for some P.D.E. Systems with Random Initial Data Gi-Ren Liu and Narn-Rueih Shieh

Semiparametric Estimation of Random Coefficients in Structural Economic Models Stefan Hoderlein 1

on Random Matrix Theory MFM Modeling Workshop 2016 Final Report Introduction The purpose of

Revisiting the Threshold Random Walk Scan Detector Vagishwari Nagaonkar Dr.John Mchugh Faculty

Approximate Likelihood Procedures for the Boolean Model Using Linear Transects John C. Handley

Search via quantum walk J er emie Roland UC Berkeley Joint work with eric Magniez 1 Ashwin

Limit theorems for excursion sets of stationary random fields Evgeny Spodarev | 23.01.2013 WIAS,

&quot;The Effect of Expected Income on Individual Migration Decisions&quot; J. Kennan, J. Walker

? ? ? ? Basic Charts Outline - Distributions & Histograms - Mean, Mode, Average - Chart

"The Effect of Expected Income on Individual Migration Decisions" J. Kennan, J. Walker