On the Statistical Rate of Nonlinear Recovery in Generative Models - PowerPoint PPT Presentation

On the Statistical Rate of Nonlinear Recovery in Generative Models with Heavy-tailed Data Xiaohan Wei , Zhuoran Yang, and Zhaoran Wang University of Southern California, Princeton University and Northwestern University June 12th, 2019

Generative Model vs Sparsity in Signal Recovery Classical sparsity: structure of the signals depend on basis.

Generative Model vs Sparsity in Signal Recovery Classical sparsity: structure of the signals depend on basis. Generative model: explicit parametrization of low-dimensional signal manifold.

Generative Model vs Sparsity in Signal Recovery Classical sparsity: structure of the signals depend on basis. Generative model: explicit parametrization of low-dimensional signal manifold. Previous works: [Bora et al. 2017] [Hand et al. 2018] [Mardani et al. 2017].

Nonlinear Recovery via Generative Models Given: Generative model G : R k → R d and measurement matrix X ∈ R m × d .

Nonlinear Recovery via Generative Models Given: Generative model G : R k → R d and measurement matrix X ∈ R m × d . Goal: Recovery G ( θ ∗ ) up to scaling from nonlinear observations y = f ( XG ( θ ∗ )) .

Nonlinear Recovery via Generative Models Given: Generative model G : R k → R d and measurement matrix X ∈ R m × d . Goal: Recovery G ( θ ∗ ) up to scaling from nonlinear observations y = f ( XG ( θ ∗ )) . Challenges: 1 High-dimensional recovery: k ≪ d , m ≪ d .

Nonlinear Recovery via Generative Models Given: Generative model G : R k → R d and measurement matrix X ∈ R m × d . Goal: Recovery G ( θ ∗ ) up to scaling from nonlinear observations y = f ( XG ( θ ∗ )) . Challenges: 1 High-dimensional recovery: k ≪ d , m ≪ d . 2 Non-Gaussian X and unknown non-linearity f .

Nonlinear Recovery via Generative Models Given: Generative model G : R k → R d and measurement matrix X ∈ R m × d . Goal: Recovery G ( θ ∗ ) up to scaling from nonlinear observations y = f ( XG ( θ ∗ )) . Challenges: 1 High-dimensional recovery: k ≪ d , m ≪ d . 2 Non-Gaussian X and unknown non-linearity f . 3 Observations y can be heavy-tailed .

Our Method: Stein + Adaptive Thresholding Suppose the rows of X := [ X 1 , · · · , X m ] T ∈ R m × d have density p : R d → R . Define the (row-wise) score transformation: S p ( X ) := [ S p ( X 1 ) , · · · , S p ( X m )] T = [ ∇ log p ( X 1 ) , · · · , ∇ log p ( X m )] T .

Our Method: Stein + Adaptive Thresholding Suppose the rows of X := [ X 1 , · · · , X m ] T ∈ R m × d have density p : R d → R . Define the (row-wise) score transformation: S p ( X ) := [ S p ( X 1 ) , · · · , S p ( X m )] T = [ ∇ log p ( X 1 ) , · · · , ∇ log p ( X m )] T . (First-order) Stein’s identity: when E f ′ ( � X i , G ( θ ∗ ) � ) > 0, � � S p ( X ) T y ∝ G ( θ ∗ ) . E (Second-order) Stein’s identity: when E f ′′ ( � X i , G ( θ ∗ ) � ) > 0, δ is a constant, � � S p ( X ) T diag ( y ) S p ( X ) ∝ G ( θ ∗ ) G ( θ ∗ ) T + δ · I d × d . E

Our Method: Stein + Adaptive Thresholding Suppose the rows of X := [ X 1 , · · · , X m ] T ∈ R m × d have density p : R d → R . Define the (row-wise) score transformation: S p ( X ) := [ S p ( X 1 ) , · · · , S p ( X m )] T = [ ∇ log p ( X 1 ) , · · · , ∇ log p ( X m )] T . (First-order) Stein’s identity: when E f ′ ( � X i , G ( θ ∗ ) � ) > 0, � � S p ( X ) T y ∝ G ( θ ∗ ) . E (Second-order) Stein’s identity: when E f ′′ ( � X i , G ( θ ∗ ) � ) > 0, δ is a constant, � � S p ( X ) T diag ( y ) S p ( X ) ∝ G ( θ ∗ ) G ( θ ∗ ) T + δ · I d × d . E Adaptive thresholding: suppose � y i � L q < ∞ , q > 4, and τ m ∝ m 2 / q , � y i = sign ( y i ) · ( | y i | ∧ τ m ) , i ∈ { 1 , 2 , · · · , m }

Our Method: Stein + Adaptive Thresholding Least-squares estimator: � � 2 � � � G ( θ ) − 1 � � m S p ( X ) T � � θ ∈ argmin θ ∈ R k y . � 2

Our Method: Stein + Adaptive Thresholding Least-squares estimator: � � 2 � � � G ( θ ) − 1 � � m S p ( X ) T � � θ ∈ argmin θ ∈ R k y . � 2 Main performance theorem: Theorem (Wei, Yang and Wang, 2019) For any accuracy level ε ∈ ( 0 , 1 ] , suppose (1) E f ′ ( � X i , G ( θ ∗ ) � ) > 0 , (2) the generative model G is a ReLU network with zero bias, (3) the number of measurements m ∝ k ε − 2 log d . Then, with high probability, � � � � G ( � G ( θ ∗ ) θ ) � � − ≤ ε. � � � G ( � � G ( θ ∗ ) � 2 � � θ ) � 2 2 Similar results hold for more general Lipschitz generators G .

Our Method: Stein + Adaptive Thresholding PCA type estimator: θ ∈ argmax � G ( θ ) � 2 = 1 G ( θ ) T S p ( X ) T diag ( � � y ) S p ( X ) G ( θ )

Our Method: Stein + Adaptive Thresholding PCA type estimator: θ ∈ argmax � G ( θ ) � 2 = 1 G ( θ ) T S p ( X ) T diag ( � � y ) S p ( X ) G ( θ ) Main performance theorem: Theorem (Wei, Yang and Wang, 2019) For any accuracy level ε ∈ ( 0 , 1 ] , suppose (1) E f ′′ ( � X i , G ( θ ∗ ) � ) > 0 , (2) the generative model G is a ReLU network with zero bias, (3) the number of measurements m ∝ k ε − 2 log d . Then, with high probability, � � � G ( θ ∗ ) � � � G ( � � θ ) − ≤ ε. � � G ( θ ∗ ) � 2 2 Similar results hold for more general Lipschitz generators G .

Thank you! Poster 198, Pacific Ballroom, 6:30-9:00 pm

On the Statistical Rate of Nonlinear Recovery in Generative Models - PowerPoint PPT Presentation

On the Statistical Rate of Nonlinear Recovery in Generative Models with Heavy-tailed Data Xiaohan Wei , Zhuoran Yang, and Zhaoran Wang University of Southern California, Princeton University and Northwestern University June 12th, 2019 Generative

Labor Classification Yrs Rate 1 Rate 2 Rate 3 Rate 4 Rate 5 Rate 6 Rate 7 Rate 8 Rate 9

Nonlinear Control Lecture # 31 Nonlinear Observers Nonlinear Control Lecture # 31 Nonlinear

Nonlinear Control Lecture # 22 Special nonlinear Forms Nonlinear Control Lecture # 22 Special

Nonlinear Control Lecture # 21 Special nonlinear Forms Nonlinear Control Lecture # 21 Special

Nonlinear Control Lecture # 8 Special nonlinear Forms Nonlinear Control Lecture # 8 Special

Nonlinear Control Lecture # 12 Nonlinear Observers and Output Feedback Stabilization Nonlinear

Nonlinear Control Lecture # 20 Special nonlinear Forms Nonlinear Control Lecture # 20 Special

Nonlinear Control Lecture # 1 Introduction Nonlinear Control Lecture # 1 Introduction Nonlinear

Numerical Proofs in Nonlinear Control Sicun Gao, UCSD Nonlinear control working Nonlinear

Variable Rate Debt Options: Auction Rate Securities Auction Rate Securities What are Auction Rate

Strip Recovery: Strip Recovery: Strip Recovery: Strip Recovery: A 12 A 12- -Step

OHIOS RECOVERY HAS TRACKED THE RECOVERY STARTED WELL BEFORE NATIONAL RECOVERY GOV KASICHS

Statistical Statistical Statistical Model Statistical Model Model Checking Model Checking

27 MARCH 2014 27 MARCH 2012 1 BITR and BDTI Rate evolution BITR Rate Evolution (ws) BDTI Rate

Rate Proceeding November 5, 2019 Chehalis Agenda Whats Driving the Rate Increase?

Interest Rate Swap and Interest Rate Swap and Variable Rate Debt Programs Variable Rate Debt

Nonlinear Optimization Practical Advice for Non-linear Least Square Problems Niclas Brlin

EKF, UKF Pieter Abbeel UC Berkeley EECS Many slides adapted from Thrun, Burgard and Fox,

Linear Regression Aar$ Singh & Barnabas Poczos Machine

Separable Nonlinear Least Squares Problems in Image Processing Julianne Chung and James Nagy

Detection and Estimation Theory Lecture 12 Mojtaba Soltanalian- UIC msol@uic.edu

Theory of Generalized Linear Models If Y has a Poisson distribution with parameter then P (

Optimal state estimation for numerical weather prediction using reduced order models A.S. Lawless

A motivation for polynomial regression We have obtained input-output pairs { ( x t , y t ) } t over

On the Statistical Rate of Nonlinear Recovery in Generative Models - PowerPoint PPT Presentation

On the Statistical Rate of Nonlinear Recovery in Generative Models with Heavy-tailed Data Xiaohan Wei , Zhuoran Yang, and Zhaoran Wang University of Southern California, Princeton University and Northwestern University June 12th, 2019 Generative

Labor Classification Yrs Rate 1 Rate 2 Rate 3 Rate 4 Rate 5 Rate 6 Rate 7 Rate 8 Rate 9

Nonlinear Control Lecture # 31 Nonlinear Observers Nonlinear Control Lecture # 31 Nonlinear

Nonlinear Control Lecture # 22 Special nonlinear Forms Nonlinear Control Lecture # 22 Special

Nonlinear Control Lecture # 21 Special nonlinear Forms Nonlinear Control Lecture # 21 Special

Nonlinear Control Lecture # 8 Special nonlinear Forms Nonlinear Control Lecture # 8 Special

Nonlinear Control Lecture # 12 Nonlinear Observers and Output Feedback Stabilization Nonlinear

Nonlinear Control Lecture # 20 Special nonlinear Forms Nonlinear Control Lecture # 20 Special

Nonlinear Control Lecture # 1 Introduction Nonlinear Control Lecture # 1 Introduction Nonlinear

Numerical Proofs in Nonlinear Control Sicun Gao, UCSD Nonlinear control working Nonlinear

Variable Rate Debt Options: Auction Rate Securities Auction Rate Securities What are Auction Rate

Strip Recovery: Strip Recovery: Strip Recovery: Strip Recovery: A 12 A 12- -Step

OHIOS RECOVERY HAS TRACKED THE RECOVERY STARTED WELL BEFORE NATIONAL RECOVERY GOV KASICHS

Statistical Statistical Statistical Model Statistical Model Model Checking Model Checking

27 MARCH 2014 27 MARCH 2012 1 BITR and BDTI Rate evolution BITR Rate Evolution (ws) BDTI Rate

Rate Proceeding November 5, 2019 Chehalis Agenda Whats Driving the Rate Increase?

Interest Rate Swap and Interest Rate Swap and Variable Rate Debt Programs Variable Rate Debt

Nonlinear Optimization Practical Advice for Non-linear Least Square Problems Niclas Brlin

EKF, UKF Pieter Abbeel UC Berkeley EECS Many slides adapted from Thrun, Burgard and Fox,

Linear Regression Aar$ Singh &amp; Barnabas Poczos Machine

Separable Nonlinear Least Squares Problems in Image Processing Julianne Chung and James Nagy

Detection and Estimation Theory Lecture 12 Mojtaba Soltanalian- UIC msol@uic.edu

Theory of Generalized Linear Models If Y has a Poisson distribution with parameter then P (

Optimal state estimation for numerical weather prediction using reduced order models A.S. Lawless

A motivation for polynomial regression We have obtained input-output pairs { ( x t , y t ) } t over

Linear Regression Aar$ Singh & Barnabas Poczos Machine