Fast, General Parallel Computation for Norm Matloff University of - PowerPoint PPT Presentation

Fast, General Parallel Computation for Machine Learning Robin Elizabeth Yancey and Fast, General Parallel Computation for Norm Matloff University of California at Machine Learning Davis Robin Elizabeth Yancey and Norm Matloff University of California at Davis P2PS Workshop, ICPP 2018

Fast, General Parallel Outline Computation for Machine Learning Robin Elizabeth Yancey and Norm Matloff University of California at Davis

Fast, General Parallel Outline Computation for Machine Learning Robin Elizabeth Yancey and Norm Matloff University of California at Davis • Motivation. • Software Alchemy. • Theoretical foundations. • Empirical investigation.

Fast, General Parallel Motivation Computation for Machine Learning Robin Elizabeth Yancey and Norm Matloff University of California at Davis

Fast, General Parallel Motivation Computation for Machine Learning Robin Elizabeth Yancey and Characteristics of machine learning (ML) algorithms: Norm Matloff University of California at Davis

Fast, General Parallel Motivation Computation for Machine Learning Robin Elizabeth Yancey and Characteristics of machine learning (ML) algorithms: Norm Matloff University of California at • Big Data: in n × p ( cases × features ) dataset, both n Davis AND p large.

Fast, General Parallel Motivation Computation for Machine Learning Robin Elizabeth Yancey and Characteristics of machine learning (ML) algorithms: Norm Matloff University of California at • Big Data: in n × p ( cases × features ) dataset, both n Davis AND p large. • Compute-intensive algorithms: sorting, k-NN, matrix inversion, iteration.

Fast, General Parallel Motivation Computation for Machine Learning Robin Elizabeth Yancey and Characteristics of machine learning (ML) algorithms: Norm Matloff University of California at • Big Data: in n × p ( cases × features ) dataset, both n Davis AND p large. • Compute-intensive algorithms: sorting, k-NN, matrix inversion, iteration. • Not generally embarrassingly parallel (EP).

Fast, General Parallel Motivation Computation for Machine Learning Robin Elizabeth Yancey and Characteristics of machine learning (ML) algorithms: Norm Matloff University of California at • Big Data: in n × p ( cases × features ) dataset, both n Davis AND p large. • Compute-intensive algorithms: sorting, k-NN, matrix inversion, iteration. • Not generally embarrassingly parallel (EP). (An exception: Random Forests – grow different trees within different processes.)

Fast, General Parallel Motivation Computation for Machine Learning Robin Elizabeth Yancey and Characteristics of machine learning (ML) algorithms: Norm Matloff University of California at • Big Data: in n × p ( cases × features ) dataset, both n Davis AND p large. • Compute-intensive algorithms: sorting, k-NN, matrix inversion, iteration. • Not generally embarrassingly parallel (EP). (An exception: Random Forests – grow different trees within different processes.) • Memory problems: The computation may not fit on a single machine (esp. in R or GPUs).

Fast, General Parallel Parallel ML: Desired Properties Computation for Machine Learning Robin Elizabeth Yancey and Norm Matloff University of California at Davis

Fast, General Parallel Parallel ML: Desired Properties Computation for Machine Learning Robin Elizabeth Yancey and Norm Matloff University of California at Davis • Simple, easily implementable.

Fast, General Parallel Parallel ML: Desired Properties Computation for Machine Learning Robin Elizabeth Yancey and Norm Matloff University of California at Davis • Simple, easily implementable. (And easily understood by non-techies.)

Fast, General Parallel Parallel ML: Desired Properties Computation for Machine Learning Robin Elizabeth Yancey and Norm Matloff University of California at Davis • Simple, easily implementable. (And easily understood by non-techies.) • As general in applicability as possible.

Fast, General Parallel Software Alchemy Computation for Machine Learning Robin Elizabeth Yancey and Norm Matloff University of California at Davis

Fast, General Parallel Software Alchemy Computation for Machine Learning alchemy: Robin The medieval forerunner of chemistry...concerned Elizabeth Yancey and particularly with attempts to convert base metals into Norm Matloff University of gold... a seemingly magical process of California at Davis transformation...

Fast, General Parallel Software Alchemy (cont’d.) Computation for Machine Learning Robin Elizabeth Yancey and Norm Matloff University of California at Davis

Fast, General Parallel Software Alchemy (cont’d.) Computation for Machine Learning Robin Elizabeth Yancey and Norm Matloff University of California at Davis • “Alchemical”: Converts non-EP problems to statistically equivalent EP problems.

Fast, General Parallel Software Alchemy (cont’d.) Computation for Machine Learning Robin Elizabeth Yancey and Norm Matloff University of California at Davis • “Alchemical”: Converts non-EP problems to statistically equivalent EP problems. • Developed independently by (Matloff, JSS, 2013) and several others.

Fast, General Parallel Software Alchemy (cont’d.) Computation for Machine Learning Robin Elizabeth Yancey and Norm Matloff University of California at Davis • “Alchemical”: Converts non-EP problems to statistically equivalent EP problems. • Developed independently by (Matloff, JSS, 2013) and several others. EP: No programming challenge. :-)

Fast, General Parallel Software Alchemy (cont’d.) Computation for Machine Learning Robin Elizabeth Yancey and Norm Matloff University of California at Davis • “Alchemical”: Converts non-EP problems to statistically equivalent EP problems. • Developed independently by (Matloff, JSS, 2013) and several others. EP: No programming challenge. :-) • Not just Embarrassingly Parallel but also Embarrassingly Simple. :-)

Fast, General Parallel Software Alchemy (cont’d) Computation for Machine Learning Robin Elizabeth Yancey and Norm Matloff University of California at Davis

Fast, General Parallel Software Alchemy (cont’d) Computation for Machine Learning Robin Elizabeth Yancey and Norm Matloff University of • Break the data into chunks, one chunk per process. California at Davis

Fast, General Parallel Software Alchemy (cont’d) Computation for Machine Learning Robin Elizabeth Yancey and Norm Matloff University of • Break the data into chunks, one chunk per process. California at Davis • Apply the procedure, e.g. neural networks (NNs), to each chunk,

Fast, General Parallel Software Alchemy (cont’d) Computation for Machine Learning Robin Elizabeth Yancey and Norm Matloff University of • Break the data into chunks, one chunk per process. California at Davis • Apply the procedure, e.g. neural networks (NNs), to each chunk, using off-the-shelf SERIAL algorithms.

Fast, General Parallel Software Alchemy (cont’d) Computation for Machine Learning Robin Elizabeth Yancey and Norm Matloff University of • Break the data into chunks, one chunk per process. California at Davis • Apply the procedure, e.g. neural networks (NNs), to each chunk, using off-the-shelf SERIAL algorithms. • In regression case (continuous response variable) take final estimate as average of the chunked estimates.

Fast, General Parallel Software Alchemy (cont’d) Computation for Machine Learning Robin Elizabeth Yancey and Norm Matloff University of • Break the data into chunks, one chunk per process. California at Davis • Apply the procedure, e.g. neural networks (NNs), to each chunk, using off-the-shelf SERIAL algorithms. • In regression case (continuous response variable) take final estimate as average of the chunked estimates. • In classification case (categorical response variable), do “voting.”

Fast, General Parallel Software Alchemy (cont’d) Computation for Machine Learning Robin Elizabeth Yancey and Norm Matloff University of • Break the data into chunks, one chunk per process. California at Davis • Apply the procedure, e.g. neural networks (NNs), to each chunk, using off-the-shelf SERIAL algorithms. • In regression case (continuous response variable) take final estimate as average of the chunked estimates. • In classification case (categorical response variable), do “voting.” • If have some kind of parametric model (incl. NNs), can average the parameter values across chunks.

Fast, General Parallel Theory Computation for Machine Learning Robin Elizabeth Yancey and Norm Matloff University of California at Davis

Fast, General Parallel Theory Computation for Machine Learning Robin Elizabeth Yancey and Norm Matloff University of California at • Theorem: Davis

Fast, General Parallel Theory Computation for Machine Learning Robin Elizabeth Yancey and Norm Matloff University of California at • Theorem: Davis Say rows of data matrix are i.i.d., output of procedure asymptotically normal. Then the Software Alchemy estimator is fully statistically efficient, i.e. has the same asymptotic variance.

Fast, General Parallel Computation for Norm Matloff University of - PowerPoint PPT Presentation

Fast, General Parallel Computation for Machine Learning Robin Elizabeth Yancey and Fast, General Parallel Computation for Norm Matloff University of California at Machine Learning Davis Robin Elizabeth Yancey and Norm Matloff

Models of Parallel Computation Mark Greenstreet CpSc 418 Oct. 10, 2013 The RAM Model of

CSL 860: Modern Parallel Computation Computation Hello OpenMP #pragma omp parallel { // I am

Being a METS Startup Fast Failure; Fast Reward November 2016 Fast Failure; Fast Reward

Formal Definition of Computation Formal Definition of Computation p.1/28 Computation

Wednesday, November 30, 2016 3:41 PM General Page 1 General Page 2 General Page 3 General Page

Massively Parallel Computation Philip Bille Sequential Computation Computation. Read and

Complexity Measures for Parallel Computation Complexity Measures for Parallel Computation

CSL 860: Modern Parallel Computation Computation PARALLEL ALGORITHM TECHNIQUES: BALANCED BINARY

Fast Scalable Parallel Comparison Sort Fast, Scalable Parallel Comparison Sort On Hybrid Multicore

Welcome to CSE 160! Introduction to parallel computation Scott B. Baden Welcome to Parallel

Embarrassingly Parallel Computations Embarrassingly Parallel Computations A computation that

+ Design of Parallel Algorithms Models of Parallel Computation + Chapter Overview: Algorithms

Parallel Computation Patterns Scan (Prefix Sum) Objective To master parallel scan (prefix

+ Design of Parallel Algorithms Bulk Synchronous Parallel A Bridging Model of Parallel

Parallel Numerical Algorithms Chapter 2 Parallel Thinking Section 2.2 Parallel

Introduction Introduction What is Parallel Architecture? Why Parallel Architecture? Evolution

Technology Acceptance Model Technology Acceptance Model (TAM) Fred Davis (1986) PhD

RETHINKING THINKING MODELS FOR EVENT-DRIVEN PROGRAMMING @cdavisafc function SumToN(n : INTEGER):

CSE5334 DATA MINING Lecture 3: Data CSE 4334/5334 Data Mining, Fall 2014 Warehousing, OLAP ,

CS 327E Class 9 November 11, 2019 Grading update What to expect from remaining

Dr. Amber Davis HappyPhD PhD Coach PhD student Mental Health and Well-being The Flourishing

HAUNTINGS HAUNTINGS Nemanja Kaloper Nemanja Kaloper UC Davis UC Davis Based on: work with C.

Special Topic Option Strategies: Intra Day Trading Brian Houston, Nison Cer0fied Trainer

Harold Johnson ..You Miss 100% of the shots you dont take. --Wayne Gretzky -Michael