Machine Learning in Economic Analysis Serena Ng Columbia University - PowerPoint PPT Presentation

Machine Learning in Economic Analysis Serena Ng Columbia University and NBER September 2016 Machine Learning: What’s in it for Economics Becker Friedman Institute University of Chicago Serena Ng Columbia University and NBER Machine Learning in Economic Analysis

Two creative papers Structural estimation of discrete choice models using random projections to reduce data dimension. 3000+ combinations of soft drinks/store. Analyze connectedness using a regularized SVAR. connectedness has a spatial and a cyclical component. Structural analysis ( � β ) or prediction ( � y )? Explore global banks data using ML methods. Serena Ng Columbia University and NBER Machine Learning in Economic Analysis

Matrix Sketching: � A = A S �� m × n n × k Goal: given A of high dimension, map it to lower dimension while preserving the structural features of A . PCA: choose a small number of directions in which the original data have high variance. preserves average pairwise distance, but a few distance can be drastically violated. relation to factor models. statistical properties can be analyzed. but even partial SVD can be computationally expensive. Serena Ng Columbia University and NBER Machine Learning in Economic Analysis

Random projections (RP) � n � RP: preserves all pairwise distance of data points. 2 may sacrifice overall variance. worst case error bounds optimal from algorithmic perspective. Serena Ng Columbia University and NBER Machine Learning in Economic Analysis

Random Projections Linear algebra: a projection is a linear transformation P from a vector space to itself such that P = P 2 . e.g A = U Σ V T = QR , then P = UU T = QQ T . P has eigenvalues 0 or 1, and P is idempotent. the ‘projection’ in RP is somewhat different if [ P ] ij is iid Gaussian, the range of P T P is a uniformly distributed subspace but eigenvalues / ∈ { 0 , 1 } . If [ P ] ij is {± 1 } , P is approximately unit length and approximately orthogonal. Serena Ng Columbia University and NBER Machine Learning in Economic Analysis

Informal arguments of RP We want to put m points in R n and put them in R k . Naive approach: choose k columns uniformly at random. if features are spread out (uniformity): works well. if some columns contribute more and we do not find them, the approximation will be poor. Idea of random projections: randomly rotate the original data to get a new random basis. In that basis, the vectors are roughly uniformly spread out. Serena Ng Columbia University and NBER Machine Learning in Economic Analysis

RP Implementations Choice of S Dense: S ij = N (0 , 1) 2 s and 0 with prob 1 − 1 1 Sparse: S ij = ± 1 with prob s . SRHT, count sketch, many alternatives. Sketching error and k : With probability at least 1/2, all pairwise distance will be preserved if, for ǫ ∈ (0 , 1 / 2), k ∝ log( m ) ǫ 2 − ǫ 3 . k is logarithmic in m but does not depend on n . Worst-case approximation error depends on ǫ . Serena Ng Columbia University and NBER Machine Learning in Economic Analysis

Remarks 1 Projected data have no interpretation. Do we care? 2 Favorable worst-case errors ⇒ favorable MSE( � β )? Linear Regression: y = X β 0 + e , min β �S y − S X β � . S , random sampling/rescaling matrix. Let W = S ′ S . � β W = ( X T W − 1 X ) − 1 X T Wy . � β W depends on random weights. Some issues: Serena Ng Columbia University and NBER Machine Learning in Economic Analysis

Remarks 1 Projected data have no interpretation. Do we care? 2 Favorable worst-case errors ⇒ favorable MSE( � β )? Linear Regression: y = X β 0 + e , min β �S y − S X β � . S , random sampling/rescaling matrix. Let W = S ′ S . � β W = ( X T W − 1 X ) − 1 X T Wy . � β W depends on random weights. Some issues: Like GLS. Finite sample properties not known. GLS improves efficiency, here weighting adds noise. Is strict exogeneity satisfied? E [ e ∗ i | X ∗ 1 , . . . , X ∗ n ]=0? Do we care about mse( � β )? or mse( � y )) Know little even in point identified models. Serena Ng Columbia University and NBER Machine Learning in Economic Analysis

An RP alternative: Random Sampling of A = U Σ V T A m × n , m >> n . Choose k rows. Select representative rows to capture the structure of U . Statistical leverage scores: ℓ i = � U i � 2 2 , i = 1 , . . . , m . Importance sampling distribution: p i = ℓ i n ℓ i = H ii = [ A ( A ′ A ) − 1 A ′ ] ii . Hat matrix. Choose rows with large influence to account for non-uniformity. � � � � U T � � U T U − � Error bound: 2 < ǫ . U � Serena Ng Columbia University and NBER Machine Learning in Economic Analysis

Regression: ( y , X ), n >> p . Choose r rows. (Drineas at al, 2011): If r = O ( f ( p , ǫ, δ )) with prob > 1 − δ , ǫ || � β − � σ min ( X ) || Y − X � β W || 2 ≤ β || 2 . Serena Ng Columbia University and NBER Machine Learning in Economic Analysis

Regression: ( y , X ), n >> p . Choose r rows. (Drineas at al, 2011): If r = O ( f ( p , ǫ, δ )) with prob > 1 − δ , ǫ || � β − � σ min ( X ) || Y − X � β W || 2 ≤ β || 2 . Sampling with replacement. w i ∼ scaled multinomial with E [ w i ] = 1. � β OLS = � β W (1). TSE of � β W around w 0 = 1: E W [ � β W | y ] = � β OLS + E W [ R W ] . Serena Ng Columbia University and NBER Machine Learning in Economic Analysis

Regression: ( y , X ), n >> p . Choose r rows. (Drineas at al, 2011): If r = O ( f ( p , ǫ, δ )) with prob > 1 − δ , ǫ || � β − � σ min ( X ) || Y − X � β W || 2 ≤ β || 2 . Sampling with replacement. w i ∼ scaled multinomial with E [ w i ] = 1. � β OLS = � β W (1). TSE of � β W around w 0 = 1: E W [ � β W | y ] = � β OLS + E W [ R W ] . R W depends on sampling process and var W ( � β W | y ) decreases with rows selected. Favorable algorithmic properties (worse case error bounds) may not translate into good (statistical properties) MSE. (Ma, Mahoney, Yu 2015). Serena Ng Columbia University and NBER Machine Learning in Economic Analysis

Summary of subspace sampling methods Random projections: S is data oblivious. Uniformize data, then sample. Projected data are linear combin. of the original data. Leverage score sampling: S depends on data through leverage scores. The columns of submatrix are columns of A . Serena Ng Columbia University and NBER Machine Learning in Economic Analysis

Summary of subspace sampling methods Random projections: S is data oblivious. Uniformize data, then sample. Projected data are linear combin. of the original data. Leverage score sampling: S depends on data through leverage scores. The columns of submatrix are columns of A . Suggestions and Questions: If columns contribute uniformly, can just sample u.a.r. Document properties of the data? β is homogeneous. How much data to use? Aggregate? Understand a well identified example first? Serena Ng Columbia University and NBER Machine Learning in Economic Analysis

Using these methods for summarizing data Lots are still unknown about statistical implications of subspace sampling methods for � β . How useful are they in describing data, ( � y )? Global banking data (96 banks, 2675 days). Four observations clusters row and column leverage scores common factors or network spillovers? connectedness: top-down or bottom up. Frank Diebold kindly provided the data. Serena Ng Columbia University and NBER Machine Learning in Economic Analysis

1. Kmeans Group 1 jpm bac c wfc ms bk.us pnc.us cof stt.us fitb.us rf.us sti.us gs usb axp bbt mqg.au na.t td.t ry.t bns.t bmo.t cm.t Serena Ng Columbia University and NBER Machine Learning in Economic Analysis

1. Kmeans Group 1 jpm bac c wfc ms bk.us pnc.us cof stt.us fitb.us rf.us sti.us gs usb axp bbt mqg.au na.t td.t ry.t bns.t bmo.t cm.t Group 1: Canada/US (23) Serena Ng Columbia University and NBER Machine Learning in Economic Analysis

Group 2 hsba.ln bnp.fr dbk.xe barc.ln aca.fr gle.fr rbs.ln san.mc inga.ae lloy.ln ucg.mi ubsn.vx csgn.vx ndasek.sk isp.mi bbva.mc cbk.xe stan.ln danske.ko dnb.os shba.sk seba.sk kbc.bt sweda.sk ebs.vi bmps.mi sab.mc pop.mc bir.db bp.mi aib.db ete.at poh1s.he uni.mi bcp.lb bes.lb mb.mi .ln=uk, .fr=france, .ae=netherlands, .db=ireland, .vx=switzerland,.lb=portugal, .xe=germany, .vi=austria,.ko=denmark,.mi=italy. Serena Ng Columbia University and NBER Machine Learning in Economic Analysis

Group 2 hsba.ln bnp.fr dbk.xe barc.ln aca.fr gle.fr rbs.ln san.mc inga.ae lloy.ln ucg.mi ubsn.vx csgn.vx ndasek.sk isp.mi bbva.mc cbk.xe stan.ln danske.ko dnb.os shba.sk seba.sk kbc.bt sweda.sk ebs.vi bmps.mi sab.mc pop.mc bir.db bp.mi aib.db ete.at poh1s.he uni.mi bcp.lb bes.lb mb.mi .ln=uk, .fr=france, .ae=netherlands, .db=ireland, .vx=switzerland,.lb=portugal, .xe=germany, .vi=austria,.ko=denmark,.mi=italy. Group 2: Europe (37) Serena Ng Columbia University and NBER Machine Learning in Economic Analysis

Machine Learning in Economic Analysis Serena Ng Columbia University - PowerPoint PPT Presentation

Machine Learning in Economic Analysis Serena Ng Columbia University and NBER September 2016 Machine Learning: Whats in it for Economics Becker Friedman Institute University of Chicago Serena Ng Columbia University and NBER Machine

Introduction to Machine Learning Introduction to Machine Learning Introduction to Machine

Quantum Machine Learning Adam Brown, HEP-AI Quantum Computing Machine Learning Quantum

MICROSOFT AZURE MACHINE LEARNING Oscar Naim Microsoft Microsoft Azure Machine Learning What is

MACHINE LEARNING Overview 1 1 APPLIED MACHINE LEARNING 2011-2012 APPLIED MACHINE LEARNING

MACHINE LEARNING kernels 1 MACHINE LEARNING 2012 MACHINE LEARNING Kernels: Intuition How

MACHINE LEARNING Kernel Canonical Correlation Analysis 1 ADVANCED MACHINE LEARNING ADVANCED

A Machine Learning Approach A Machine Learning Approach A Machine Learning Approach A Machine

Welcome to the Machine Learning Toolbox! Machine Learning Toolbox Supervised learning caret

Introduction to Machine Learning COMPSCI 371D Machine Learning COMPSCI 371D Machine

INTRODUCTION TO MACHINE LEARNING Joseph C. Osborn CS 51A Spring 2020 Machine Learning is

Human and Machine Learning Tom Mitchell Machine Learning Department Carnegie Mellon University

Machine Learning Algorithms for Classification Machine Learning Algorithms for Classification

Machine Learning - Intro Aarti Singh Machine Learning 10-701/15-781 Sept 8, 2010 You tell me

Machine learning for finance Nathan George Data Science Professor DataCamp Machine Learning

APPLIED MACHINE LEARNING Methods for Clustering K-means, Soft K-means DBSCAN 1 MACHINE

Introduction to Machine Learning COMPSCI 371D Machine Learning COMPSCI 371D Machine

Welcome Wifi CODE Upton1947!Ms USE SLIDO TO ASK QUESTIONS OF THE PANEL 1. OPEN BROWSER ON

TO JOIN BY TELEPHONE: TO JOIN BY TELEPHONE: Phone: (5 Phone: (510) 2 ) 210-8882 0-8882 | Access

HU HUBZone Cert rtification Webinar ar April 16, 2020 Thank You t to our P r Partners Lo

Stabilizing Series-Elas0c Point-Foot Bipeds using Whole-Body Opera0onal Space Control D.H. Kim,

Project Management for Everyone Contact Information Project Management for Everyone Adam

SWEN 256 Software Process & Project Management Plan: Identify activities. No

CISC 322 Software Architecture Lecture 17: Project Scheduling 1 Emad Shihab Slides adapted

Project Management Dr. James A. Bednar jbednar@inf.ed.ac.uk

Machine Learning in Economic Analysis Serena Ng Columbia University - PowerPoint PPT Presentation

Machine Learning in Economic Analysis Serena Ng Columbia University and NBER September 2016 Machine Learning: Whats in it for Economics Becker Friedman Institute University of Chicago Serena Ng Columbia University and NBER Machine

Introduction to Machine Learning Introduction to Machine Learning Introduction to Machine

Quantum Machine Learning Adam Brown, HEP-AI Quantum Computing Machine Learning Quantum

MICROSOFT AZURE MACHINE LEARNING Oscar Naim Microsoft Microsoft Azure Machine Learning What is

MACHINE LEARNING Overview 1 1 APPLIED MACHINE LEARNING 2011-2012 APPLIED MACHINE LEARNING

MACHINE LEARNING kernels 1 MACHINE LEARNING 2012 MACHINE LEARNING Kernels: Intuition How

MACHINE LEARNING Kernel Canonical Correlation Analysis 1 ADVANCED MACHINE LEARNING ADVANCED

A Machine Learning Approach A Machine Learning Approach A Machine Learning Approach A Machine

Welcome to the Machine Learning Toolbox! Machine Learning Toolbox Supervised learning caret

Introduction to Machine Learning COMPSCI 371D Machine Learning COMPSCI 371D Machine

INTRODUCTION TO MACHINE LEARNING Joseph C. Osborn CS 51A Spring 2020 Machine Learning is

Human and Machine Learning Tom Mitchell Machine Learning Department Carnegie Mellon University

Machine Learning Algorithms for Classification Machine Learning Algorithms for Classification

Machine Learning - Intro Aarti Singh Machine Learning 10-701/15-781 Sept 8, 2010 You tell me

Machine learning for finance Nathan George Data Science Professor DataCamp Machine Learning

APPLIED MACHINE LEARNING Methods for Clustering K-means, Soft K-means DBSCAN 1 MACHINE

Introduction to Machine Learning COMPSCI 371D Machine Learning COMPSCI 371D Machine

Welcome Wifi CODE Upton1947!Ms USE SLIDO TO ASK QUESTIONS OF THE PANEL 1. OPEN BROWSER ON

TO JOIN BY TELEPHONE: TO JOIN BY TELEPHONE: Phone: (5 Phone: (510) 2 ) 210-8882 0-8882 | Access

HU HUBZone Cert rtification Webinar ar April 16, 2020 Thank You t to our P r Partners Lo

Stabilizing Series-Elas0c Point-Foot Bipeds using Whole-Body Opera0onal Space Control D.H. Kim,

Project Management for Everyone Contact Information Project Management for Everyone Adam

SWEN 256 Software Process &amp; Project Management Plan: Identify activities. No

CISC 322 Software Architecture Lecture 17: Project Scheduling 1 Emad Shihab Slides adapted

Project Management Dr. James A. Bednar jbednar@inf.ed.ac.uk

SWEN 256 Software Process & Project Management Plan: Identify activities. No