Use and Limitations of Machine Learning in Portfolio Management

Overview 1. Brief Introduction to Learning 2. Prediction - “Futurecasting” - “Nowcasting” - factor analysis 3. Similarity Measures - recommendation system 4. Generating Synthetic Datasets

A Brief Introduction to Learning Learning: Y|X To each problem its solution • Regression: E[Y|X=x] • What we want to know from Y • Dimensionality of the data (X and Y) • Classification: P(Y=y|X=x) • Signal to noise of the data • Synthetic data generation: • Risk function Y|X=x • Stationarity • Etc.

An Introduction to Statistical Learning Great overview of classic machine learning techniques with examples of code in R

Prediction Methods Used • OLS Regression • Lasso, Ridge, Elastic Net • Kernel Regression • Trees • Neural Nets • Random Forests • SVMs • Etc.

Prediction - Things to Consider • Linear versus non-linear • Dimensionality of the data • Density of the data • Signal to noise • Risk function • Interpretability • Over-fitting

Prediction - “Futurecasting” • No access to contemporaneous data • Very difficult to do • Markets tend to be efficient • Signal to noise ratio is poor • It is difficult to beat naïve predictors • Boosted Trees is the leader at the moment

May 2017 Big Data and AI Strategies Good overview of the current use of machine learning in alpha generation and more Big Data and AI Strategies Machine Learning and Alternative Data Approach to Investing Quantitative and Derivatives Strategy Marko Kolanovic, PhD AC marko.kolanovic@jpmorgan.com Rajesh T. Krishnamachari, PhD rajesh.tk@jpmorgan.com See page 278 for analyst certification and important disclosures, including non-US analyst disclosures. Completed 18 May 2017 04:15 PM EDT Disseminated 18 May 2017 04:15 PM EDT This document is being provided for the exclusive use of LOGAN SCOTT at JPMorgan Chase & Co. and clients of J.P. Morgan.

Prediction - “Nowcasting” • Access to contemporaneous data • Important data that is published with a lag or a low frequency • Generating replicating portfolios (Stat Arb) • Live estimates of - ERP - GDP - Macroeconomic indicators - Etc.

Prediction - Factor Analysis • p: number of predictors • n: number of observation • It used to be n>>p - OLS was useful • It is now p>n (zoo of factors) - curse of dimension ▪ dimensionality reduction, PCA, clustering, etc. ▪ best subset, Lasso, Ridge, etc. ▪ K-fold cross validation • Also useful for hedging

Similarity Measures Useful For • Manager selection • Stock selection • Style drift detection

Similarity Measures Methods Used • PCA • Hierarchical Clustering • K-means • Supervised classifiers • Etc. Used For • Alternative data • Big data • Improving analyst’s productivity

Similarity Measures - Things to Consider • Supervised - labeling the target variable and letting the learner infer useful predictors • Unsupervised - choosing predictors where “closeness” is of interest and letting the algorithm do the clustering • Non stationarity of data • Renormalization • Availability of data for back testing

Generating Synthetic Data Useful For • Scenario analysis • Stress testing • Risk budgeting • Option pricing • OOS testing Could be Useful For • Training data for data intensive learners (deep learning, reinforcement learning, etc.) • Testing systematic strategies

Generating Synthetic Data Methods Used • Fitting of parametric models - distributions (poisson, normal, cauchy, etc.) - DGP (EWMA, GARCH, variance gamma process, etc.) • Kernel density estimation • Eigen vector decomposition • Factor analysis • Auto Encoders • LSTM NN

Generating Synthetic Data - Things to Consider • Single versus multivariate inputs • Single versus multivariate outputs • Conditional versus unconditional outputs • Linear versus non-linear relationships • Bulk versus tails of the distribution • Interpretability

Use and Limitations of Machine Learning in Portfolio Management - PowerPoint PPT Presentation

Use and Limitations of Machine Learning in Portfolio Management Overview 1. Brief Introduction to Learning 2. Prediction - Futurecasting - Nowcasting - factor analysis 3. Similarity Measures - recommendation system 4. Generating

Introduction to Machine Learning Introduction to Machine Learning Introduction to Machine

Quantum Machine Learning Adam Brown, HEP-AI Quantum Computing Machine Learning Quantum

MICROSOFT AZURE MACHINE LEARNING Oscar Naim Microsoft Microsoft Azure Machine Learning What is

MACHINE LEARNING Overview 1 1 APPLIED MACHINE LEARNING 2011-2012 APPLIED MACHINE LEARNING

MACHINE LEARNING kernels 1 MACHINE LEARNING 2012 MACHINE LEARNING Kernels: Intuition How

A Machine Learning Approach A Machine Learning Approach A Machine Learning Approach A Machine

Welcome to the Machine Learning Toolbox! Machine Learning Toolbox Supervised learning caret

Human and Machine Learning Tom Mitchell Machine Learning Department Carnegie Mellon University

Introduction to Machine Learning COMPSCI 371D Machine Learning COMPSCI 371D Machine

MACHINE LEARNING Kernel Canonical Correlation Analysis 1 ADVANCED MACHINE LEARNING ADVANCED

INTRODUCTION TO MACHINE LEARNING Joseph C. Osborn CS 51A Spring 2020 Machine Learning is

Machine Learning Algorithms for Classification Machine Learning Algorithms for Classification

Machine Learning - Intro Aarti Singh Machine Learning 10-701/15-781 Sept 8, 2010 You tell me

Machine learning for finance Nathan George Data Science Professor DataCamp Machine Learning

APPLIED MACHINE LEARNING Methods for Clustering K-means, Soft K-means DBSCAN 1 MACHINE

Introduction to Machine Learning COMPSCI 371D Machine Learning COMPSCI 371D Machine

Interim Results Presentation September 2015 1 GDP growth weakened in Q2 to 1.2% y/y &

BILLE KINGDOM ECONOMIC DEVELOPMENT2040 PRESENTED BY THE ECONOMIC DEVELOPMENT SUB-COMMITTEE@ TH

FALLING RAIN ESTATE, PHASE ONE Bamah Nissi Multilinks Limited FALLING RAIN ESTATE: A 40 hectare

J UNE 2019 Introduction We are an indigenous Nigerian upstream have a highly experienced

Analyzing the Commercial Value of Movies Meng Zhang, Yuntao Lu, Jiaxin Li Introduction

EMA EFPIA workshop EMA EFPIA workshop Break- -out session no. 4 out session no. 4 Break

Data-Parallel Halo Finding with Variable Linking Lengths Conference Paper November 2014 DOI:

Explanatory Session for Fiscal Year Ended March 2006 June 2006 Leopalace21 Corporation This

Sambuz

Useful Links

Newsletter

Mail Us

Use and Limitations of Machine Learning in Portfolio Management - PowerPoint PPT Presentation

Use and Limitations of Machine Learning in Portfolio Management Overview 1. Brief Introduction to Learning 2. Prediction - Futurecasting - Nowcasting - factor analysis 3. Similarity Measures - recommendation system 4. Generating

Introduction to Machine Learning Introduction to Machine Learning Introduction to Machine

Quantum Machine Learning Adam Brown, HEP-AI Quantum Computing Machine Learning Quantum

MICROSOFT AZURE MACHINE LEARNING Oscar Naim Microsoft Microsoft Azure Machine Learning What is

MACHINE LEARNING Overview 1 1 APPLIED MACHINE LEARNING 2011-2012 APPLIED MACHINE LEARNING

MACHINE LEARNING kernels 1 MACHINE LEARNING 2012 MACHINE LEARNING Kernels: Intuition How

A Machine Learning Approach A Machine Learning Approach A Machine Learning Approach A Machine

Welcome to the Machine Learning Toolbox! Machine Learning Toolbox Supervised learning caret

Human and Machine Learning Tom Mitchell Machine Learning Department Carnegie Mellon University

Introduction to Machine Learning COMPSCI 371D Machine Learning COMPSCI 371D Machine

MACHINE LEARNING Kernel Canonical Correlation Analysis 1 ADVANCED MACHINE LEARNING ADVANCED

INTRODUCTION TO MACHINE LEARNING Joseph C. Osborn CS 51A Spring 2020 Machine Learning is

Machine Learning Algorithms for Classification Machine Learning Algorithms for Classification

Machine Learning - Intro Aarti Singh Machine Learning 10-701/15-781 Sept 8, 2010 You tell me

Machine learning for finance Nathan George Data Science Professor DataCamp Machine Learning

APPLIED MACHINE LEARNING Methods for Clustering K-means, Soft K-means DBSCAN 1 MACHINE

Introduction to Machine Learning COMPSCI 371D Machine Learning COMPSCI 371D Machine

Interim Results Presentation September 2015 1 GDP growth weakened in Q2 to 1.2% y/y &amp;

BILLE KINGDOM ECONOMIC DEVELOPMENT2040 PRESENTED BY THE ECONOMIC DEVELOPMENT SUB-COMMITTEE@ TH

FALLING RAIN ESTATE, PHASE ONE Bamah Nissi Multilinks Limited FALLING RAIN ESTATE: A 40 hectare

J UNE 2019 Introduction We are an indigenous Nigerian upstream have a highly experienced

Analyzing the Commercial Value of Movies Meng Zhang, Yuntao Lu, Jiaxin Li Introduction

EMA EFPIA workshop EMA EFPIA workshop Break- -out session no. 4 out session no. 4 Break

Data-Parallel Halo Finding with Variable Linking Lengths Conference Paper November 2014 DOI:

Explanatory Session for Fiscal Year Ended March 2006 June 2006 Leopalace21 Corporation This

Sambuz

Useful Links

Newsletter

Mail Us

Interim Results Presentation September 2015 1 GDP growth weakened in Q2 to 1.2% y/y &