Practical Methodology for Deploying Machine Learning Ian Goodfellow - PowerPoint PPT Presentation

Sep 29, 2023 •693 likes •902 views

Practical Methodology for Deploying Machine Learning Ian Goodfellow (An homage to Advice for Applying Machine Learning by Andrew Ng) What drives success in ML? Arcane knowledge Knowing how Mountains of dozens of to apply 3-4 of data?

Practical Methodology for Deploying Machine Learning Ian Goodfellow (An homage to “Advice for Applying Machine Learning” by Andrew Ng)
What drives success in ML? Arcane knowledge Knowing how Mountains of dozens of to apply 3-4 of data? obscure algorithms? standard techniques? (2) (2) (2) h 1 h 2 h 3 (1) (1) (1) (1) h 1 h 2 h 3 h 4 v 1 v 2 v 3
Street View Transcription (Goodfellow et al, 2014)
3 Step Process • Use needs to define metric-based goals • Build an end-to-end system • Data-driven refinement
Identify needs • High accuracy or low accuracy? • Surgery robot: high accuracy • Celebrity look-a-like app: low accuracy
Choose Metrics • Accuracy? (% of examples correct) • Coverage? (% of examples processed) • Precision? (% of detections that are right) • Recall? (% of objects detected) • Amount of error? (For regression problems)
End-to-end system • Get up and running ASAP • Build the simplest viable system first • What baseline to start with though? • Copy state-of-the-art from related publication
Deep or not? • Lots of noise, little structure -> not deep • Little noise, complex structure -> deep • Good shallow baseline: • Use what you know • Logistic regression, SVM, boosted tree are all good
What kind of deep? • No structure -> fully connected • Spatial structure -> convolutional • Sequential structure -> recurrent
Fully connected baseline • 2-3 hidden layer feedforward network • AKA “multilayer perceptron” • Rectified linear units V • Dropout W • SGD + momentum
Convolutional baseline • Inception • Batch normalization • Fallback option: • Rectified linear convolutional net • Dropout • SGD + momentum
Recurrent baseline output × • LSTM output gate self-loop • SGD + × state forget gate • Gradient clipping × input gate • High forget gate bias input
Data driven adaptation • Choose what to do based on data • Don’t believe hype • Measure train and test error • “Overfitting” versus “underfitting”
High train error • Inspect data for defects • Inspect software for bugs • Don’t roll your own unless you know what you’re doing • Tune learning rate (and other optimization settings) • Make model bigger
Checking data for defects • Can a human process it? 26624
Effect of Depth 92.5 Test accuracy (%) 96.5 96.0 95.5 95.0 94.5 94.0 93.5 93.0 92.0 3 Number of hidden layers 11 10 9 8 7 6 5 4 Increasing depth
High test error • Add dataset augmentation • Add dropout • Collect more data
Optimal capacity (polynomial degree) Test (quadratic) 20 15 10 5 0 # train examples 10 5 10 4 10 3 10 2 10 1 10 0 Train (optimal capacity) Test (optimal capacity) Train (quadratic) 10 0 0 10 1 10 2 10 3 10 4 10 5 # train examples 1 Bayes error 2 3 4 5 6 Error (MSE) Increasing training set size
Deep Learning textbook Yoshua Bengio Ian Goodfellow Aaron Courville goodfeli.github.io/dlbook

Recommend

Deploying Machine Learning Models on The Edge Deploying Machine Learning Models on The Edge Yan

Deploying Machine Learning Models on The Edge Deploying Machine Learning Models on The Edge Yan Zhang, Mathew Salvaris Microsoft https://github.com/microsoft/deploy-MLmodels-on-iotedge Cloud Analytics Edge Analytics Device/Sensor Analytics

373 views • 34 slides

Deploying And Supporting Perl 6 Jonathan Worthington UKUUG Spring 2007 Conference Deploying And

Deploying And Supporting Perl 6 Jonathan Worthington UKUUG Spring 2007 Conference Deploying And Supporting Perl 6 Jonathan Worthington UKUUG Spring 2007 Conference Deploying And Supporting Perl 6 Overview Yesterday: the Perl 6 language

947 views • 39 slides

Introduction to Machine Learning Introduction to Machine Learning Introduction to Machine

Introduction to Machine Learning Introduction to Machine Learning Introduction to Machine Learning Introduction to Machine Learning Introduction to Machine Learning Rob Schapire Princeton University www.cs.princeton.edu/ schapire Machine

1.26k views • 38 slides

Quantum Machine Learning Adam Brown, HEP-AI Quantum Computing Machine Learning Quantum

Quantum Machine Learning Adam Brown, HEP-AI Quantum Computing Machine Learning Quantum Computing Machine Learning Quantum Computing Machine Learning so hot so so hot Quantum Computing Machine Learning Quantum Computing Machine Learning

835 views • 51 slides

MICROSOFT AZURE MACHINE LEARNING Oscar Naim Microsoft Microsoft Azure Machine Learning What is

MICROSOFT AZURE MACHINE LEARNING Oscar Naim Microsoft Microsoft Azure Machine Learning What is Machine Learning? Azure Machine Learning: How it works Azure Machine Learning in action Get started Contents What is Machine Learning?

456 views • 21 slides

MACHINE LEARNING Overview 1 1 APPLIED MACHINE LEARNING 2011-2012 APPLIED MACHINE LEARNING

APPLIED MACHINE LEARNING 2011-2012 APPLIED MACHINE LEARNING MACHINE LEARNING Overview 1 1 APPLIED MACHINE LEARNING 2011-2012 APPLIED MACHINE LEARNING Exam Format The exam lasts a total of 3 hours: - Upon entering the room, you must

373 views • 21 slides

MACHINE LEARNING kernels 1 MACHINE LEARNING 2012 MACHINE LEARNING Kernels: Intuition How

MACHINE LEARNING 2012 MACHINE LEARNING MACHINE LEARNING kernels 1 MACHINE LEARNING 2012 MACHINE LEARNING Kernels: Intuition How to separate the red class from the grey class? x 2 360 r x 1 Polar coordinates Data

1.04k views • 44 slides

Deploying Large Scale AVB/TSN Networks Jeff Koftinoff, Meyer Sound Laboratories, Inc. June 19,

Deploying Large Scale AVB/TSN Networks Jeff Koftinoff, Meyer Sound Laboratories, Inc. June 19, 2015 Deploying Large Scale AVB Networks ACT 1 Deploying Large Scale AVB Networks What does putting audio/video on a

872 views • 63 slides

Deploying Information Deploying Information Agents on the Web Agents on the Web Craig A.

Deploying Information Deploying Information Agents on the Web Agents on the Web Craig A. Knoblock Knoblock Craig A. University of Southern California University of Southern California and and Fetch Technologies Fetch Technologies Craig

942 views • 77 slides

Experiences in deploying the high-end Experiences in deploying the high-end visualization

Experiences in deploying the high-end Experiences in deploying the high-end visualization application in the visualization application in the transatlantic GLIF environment transatlantic GLIF environment Marek Baewicz marqs@man.poznan.pl

427 views • 26 slides

Stagnation of deploying of Stagnation of deploying of Jun Takei 4 G and beyond Are you using

Stagnation of deploying of Stagnation of deploying of Jun Takei 4 G and beyond Are you using 3G? y g How do you use 3G phone other than making voice How do you use 3G phone other than making voice call? Why not are you using 3G

183 views • 4 slides

A Machine Learning Approach A Machine Learning Approach A Machine Learning Approach A Machine

A Machine Learning Approach A Machine Learning Approach A Machine Learning Approach A Machine Learning Approach to Preventing to Preventing to Preventing to Preventing Avoidable ED Utilization Avoidable ED Utilization Avoidable ED

727 views • 13 slides

Practical Experience with Practical Experience with Practical Experience with Practical

Practical Experience with Practical Experience with Practical Experience with Practical Experience with performance monitoring performance monitoring performance monitoring performance monitoring Ryszard Jurga CERN openlab March 29, 2006

577 views • 24 slides

Welcome to the Machine Learning Toolbox! Machine Learning Toolbox Supervised learning caret

MACHINE LEARNING TOOLBOX Welcome to the Machine Learning Toolbox! Machine Learning Toolbox Supervised learning caret R package Automates supervised learning (a.k.a. predictive modeling ) Target variable Machine Learning Toolbox

634 views • 16 slides

Introduction to Machine Learning COMPSCI 371D Machine Learning COMPSCI 371D Machine

Introduction to Machine Learning COMPSCI 371D Machine Learning COMPSCI 371D Machine Learning Introduction to Machine Learning 1 / 18 Outline 1 Classification, Regression, Unsupervised Learning 2 About Dimensionality 3 Drawings and

701 views • 18 slides

INTRODUCTION TO MACHINE LEARNING Joseph C. Osborn CS 51A Spring 2020 Machine Learning is

INTRODUCTION TO MACHINE LEARNING Joseph C. Osborn CS 51A Spring 2020 Machine Learning is Machine learning is about predicting the future based on the past. -- Hal Daume III Machine Learning is Machine learning is about predicting

917 views • 59 slides

Deep Learning Jiseob Kim (jkim@bi.snu.ac.kr) Artificial Intelligence Class of 2016 Spring Dept.

Deep Learning Jiseob Kim (jkim@bi.snu.ac.kr) Artificial Intelligence Class of 2016 Spring Dept. of Computer Science and Engineering Seoul National University 1 History of Neural Network Research Neural network Deep belief net Back

1.3k views • 108 slides

Joint work with Earl T. Barr, Marc Brockschmidt, Santanu Dash, Mahmoud Khademi Deep

Microsoft Research Cambridge Joint work with Earl T. Barr, Marc Brockschmidt, Santanu Dash, Mahmoud Khademi Deep Understands images/language/speech Learning Finds patterns in noisy data Requires many samples - Handling structured

934 views • 64 slides

Introduction to English Linguistics 5: Grammar and Syntax II Cognitive Grammar Understands

Introduction to English Linguistics 5: Grammar and Syntax II Cognitive Grammar Understands language as an abstract symbolic system directly rooted in the same cognitive processes used for other thought processes and thus not routed through a

724 views • 27 slides

M =fl Jirka Hana Syntax Chomsky et al. Standard Theory Government and binding (GB)

Standard Theory Government and binding (GB) Minimalism M =fl Jirka Hana Syntax Chomsky et al. Standard Theory Government and binding (GB) Minimalism Syntax Chomsky et al. Jirka Hana Jirka Hana Syntax Chomsky et al. Standard

405 views • 12 slides

On the interplay of network structure and gradient convergence in deep learning Vikas Singh

On the interplay of network structure and gradient convergence in deep learning Vikas Singh , Vamsi K. Ithapu Sathya N. Ravi Computer Sciences Biostatistics and Medical Informatics University of Wisconsin Madison Sep 28,

1.63k views • 137 slides

Deep Neural Nets and Features Sung-Eui Yoon ( ) Course URL:

CS688: Web-Scale Image Search Deep Neural Nets and Features Sung-Eui Yoon ( ) Course URL: http://sgvr.kaist.ac.kr/~sungeui/IR Class Objectives Browse main components of deep neural nets Does not aim for giving in-depth

609 views • 32 slides

How to Construct Deep Recurrent Neural Networks AUTHORS: R. PASCANU, C. GULCEHRE, K. CHO, Y.

How to Construct Deep Recurrent Neural Networks AUTHORS: R. PASCANU, C. GULCEHRE, K. CHO, Y. BENGIO PRESENTATION: HAROUN HABEEB PAPER: HTTPS://ARXIV.ORG/ABS/1312.6026 This presentation Motivation Formal RNN paradigm Deep RNN designs

395 views • 13 slides

Bayesian Deep Learning Mohd Adnan Problems With Deep Learning What does a model not know?

Bayesian Deep Learning Mohd Adnan Problems With Deep Learning What does a model not know? Uninterpretable black-boxes Easily fooled (AI safety) Lacks solid mathematical foundation Crucially relies on big dat Why

297 views • 15 slides