Introduction to Machine Learning Engineering Chicago ML February - PowerPoint PPT Presentation

Introduction to Machine Learning Engineering Chicago ML February 27, 2019 Garrett Smith

New! https:// chicago.ml

Super!

Great! @guildai

Introduction What is machine learning? Theory Tools

Introduction What is machine learning? Credit: vas3k.com

Introduction What is machine learning engineering? Infrastructure Research Production Facilities and tools for Data analysis Model inference research and Data processing and Model optimization engineering preparation Deployment Continuous integration Model selection and continuous Training a model development

Introduction Why machine learning engineering? Use Cases Anomaly detection (e.g. fraud) Optimization (e.g. minimize cost, maximize yield) Market analysis Risk analysis Data Business Value Prediction Reproducibility

Introduction Machine learning vs traditional data analytics Traditional Data Analytics / BI Machine Learning Data suited for Structured Structured and unstructured Typical application Summary/reports, some Prediction, some summary/reports prediction Artifacts Reports, graphs Trained models, applications Used by Human decision makers Application developers

Introduction What are the roles in an ML engineering team? Research Scientist Software/Systems Pure and applied Engineer research Support ML Some systems Research Engineer programming Custom Support research Budget for development scientist publishing Systems More programming integration Implement papers Requires in-depth knowledge of science

Tools of the trade First instruments for galvanocautery introduced by Albrecht Middeldorpf in 1854 (source)

Tools of the trade Programming languages Language When to Use Python General ML, data processing, systems integration R Stats, general data science C/C++ System software, HPC JavaScript Web based applications Java/Scala Enterprise integration bash Systems integration

Tools of the trade Computational libraries and frameworks Library Sweet Spot When to Look Elsewhere TensorFlow Deep learning, production systems New to ML, no production requirements including mobile PyTorch Ease of use, popular among Production requirements beyond simple researchers serving Keras Ease of use, production backend with Affinity with another library (e.g. TensorFlow colleagues use something else), MXNet Performance, scalability, stability Seeking larger community or features not available in MXNet Caffe 2 Computer vision heritage Seeking larger community or need features not available in Caffe scikit-learn General purpose ML Deep learning, need GPU

Tools of the trade Modules and toolkits - Prepackaged models Name Application Language and Libraries Used spaCy Natural language processing Python, TensorFlow, PyTorch TF-Slim Image classification TensorFlow TF-object detection Object detection TensorFlow TensorFlow Hub Various TensorFlow Caffe Model Zoo Various Caffee TensorFlow models Various TensorFlow Keras applications Various Keras

Tools of the trade Scripting tools Tool When to Use Python + argparse Create reusable scripts with well defined interfaces Guild AI Capture script output as ML experiments Paver Python make-like tool Traditional build tools General purpose build automation (make, cmake, ninja)

Tools of the trade Workflow automation Tool When to Use MLFlow Enterprise wide machine learning workflow Guild AI Ad hoc workflows, integration with other automation systems Polyaxon Kubernetes based job scheduling Airflow General workflow automation Traditional scripting Ad hoc automation

Data analysis Chart showing quarterly value of wheat, 1821 (source)

Data analysis Structured vs unstructured data Structured Data Unstructured Data Classification chart of Factory Ledger Darwin’s Finches, 1837 (source) Accounts, 1919 (source)

Data analysis Visualization Many, many more! Matplotlib Plotly Shapley Seaborn H20.ai Visdom

Model selection (Representation) Mitchels Solar System, 1846 (source)

Model selection Standard architectures CNN, RNN, LSTM, GAN, NAT, AutoML, SVM etc...

Model selection Hand engineered or learned? Hand Engineered Learned Rely on experience and AutoML for hyperparameter recommendation of experts and simple architectural optimization Experiment with novel changes to hyperparameters Neural architecture search to and architecture learn entire architecture on data Best place to start Advanced technique

Model selection Runtime performance criteria Accuracy/Precision Speed/Latency Resource Constraints Various measurements (e.g. Inference time per example Required memory and power accuracy, precision, recall) Inference time per batch Model/runtime environment Metrics depend on prediction interaction Model and runtime task environment interaction Mobile and embedded devices severely constrained

Model selection Training performance criteria Training Progress Time to Train Cost Training and validation Model training time can vary by GPU / HPC time is expensive loss/accuracy order of magnitude Opportunity cost of not Time/epochs to convergence Longer runs mean fewer trials training other models Vanishing/exploding gradient Direct impact on time-to-market

Model selection Sample trade off comparison Logistic 3 Layer CNN ResNet-50 NASNET Regression Task: image classification Low Medium High Very High Accuracy Inference Very Low Low High Very High Memory Inference Very Low Low High Very High Latency Training Very Low Low High Very High Time Training Very Low Very Low Medium Medium Cost

Training Wanderer above the Sea of Fog, Caspar David Friedrich, 1818 (source)

Training Primary training patterns - Train from scratch - Transfer learn - Fine tune - Retrain

Training Train from Scratch Wooden frame construction in Sabah, Malaysia (source)

Training Transfer Learn “The Barge” at PolarTrec Northeast Scientific Station, Siberia Russia (source)

Training Fine Tune WTC under construction, April 2012 (source)

Training Retrain Framing for new addition to home (source)

Training Training techniques Train from Scratch Transfer Learn Fine Tune Retrain When No pretrained Pretrained models Pretrained model Pretrained model models for different task for same task same task, different number of output classes Data Requirements Highest Reduced Reduced Reduced Reduced to Training Time Highest Reduced Reduced Unchanged Domains/tasks 1 2 1 1 involved When Used No pretrained model, Pretrained model, Pretrained model, Pretrained model for lots of data and limited data and additional data or same task, need to compute resources, compute resources compute resources to remove or add highest accuracy improve accuracy classes required

Training TF Slim transfer learn example $ python train_image_classifier.py --model_name resnet-50 --dataset_dir ./prepared-data --train_dir train --checkpoint_path checkpoint/resnet_v1_50.ckpt --checkpoint_exclude_scopes resnet_v1_50/logits --trainable_scopes resnet_v1_50/logits https://github.com/tensorflow/models/tree/master/research/slim

Training TF Slim transfer learn example $ python train_image_classifier.py --model_name resnet-50 Model architecture (network) --dataset_dir ./prepared-data --train_dir train --checkpoint_path checkpoint/resnet_v1_50.ckpt --checkpoint_exclude_scopes resnet_v1_50/logits --trainable_scopes resnet_v1_50/logits

Training TF Slim transfer learn example $ python train_image_classifier.py --model_name resnet-50 --dataset_dir ./prepared-data New data for new task --train_dir train --checkpoint_path checkpoint/resnet_v1_50.ckpt --checkpoint_exclude_scopes resnet_v1_50/logits --trainable_scopes resnet_v1_50/logits

Training TF Slim transfer learn example $ python train_image_classifier.py --model_name resnet-50 --dataset_dir ./prepared-data --train_dir train --checkpoint_path checkpoint/resnet_v1_50.ckpt --checkpoint_exclude_scopes resnet_v1_50/logits Model weights from source --trainable_scopes resnet_v1_50/logits task (ImageNet)

Training TF Slim transfer learn example $ python train_image_classifier.py --model_name resnet-50 --dataset_dir ./prepared-data --train_dir train --checkpoint_path checkpoint/resnet_v1_50.ckpt --checkpoint_exclude_scopes resnet_v1_50/logits --trainable_scopes resnet_v1_50/logits Layer weights to not initialize from checkpoint (unfrozen)

Training TF Slim transfer learn example $ python train_image_classifier.py --model_name resnet-50 --dataset_dir ./prepared-data --train_dir train --checkpoint_path checkpoint/resnet_v1_50.ckpt --checkpoint_exclude_scopes resnet_v1_50/logits --trainable_scopes resnet_v1_50/logits Layer weights to train (freeze all others)

Introduction to Machine Learning Engineering Chicago ML February - PowerPoint PPT Presentation

Introduction to Machine Learning Engineering Chicago ML February 27, 2019 Garrett Smith New! https:// chicago.ml Super! Great! @guildai Introduction What is machine learning? Theory Tools Introduction What is machine learning? Credit:

Introduction to Machine Learning Introduction to Machine Learning Introduction to Machine

Quantum Machine Learning Adam Brown, HEP-AI Quantum Computing Machine Learning Quantum

MICROSOFT AZURE MACHINE LEARNING Oscar Naim Microsoft Microsoft Azure Machine Learning What is

MACHINE LEARNING Overview 1 1 APPLIED MACHINE LEARNING 2011-2012 APPLIED MACHINE LEARNING

MACHINE LEARNING kernels 1 MACHINE LEARNING 2012 MACHINE LEARNING Kernels: Intuition How

A Machine Learning Approach A Machine Learning Approach A Machine Learning Approach A Machine

Introduction to Machine Learning COMPSCI 371D Machine Learning COMPSCI 371D Machine

INTRODUCTION TO MACHINE LEARNING Joseph C. Osborn CS 51A Spring 2020 Machine Learning is

Introduction to Machine Learning COMPSCI 371D Machine Learning COMPSCI 371D Machine

Welcome to the Machine Learning Toolbox! Machine Learning Toolbox Supervised learning caret

Human and Machine Learning Tom Mitchell Machine Learning Department Carnegie Mellon University

Machine Learning Algorithms for Classification Machine Learning Algorithms for Classification

Machine Learning - Intro Aarti Singh Machine Learning 10-701/15-781 Sept 8, 2010 You tell me

MACHINE LEARNING Kernel Canonical Correlation Analysis 1 ADVANCED MACHINE LEARNING ADVANCED

Machine learning for finance Nathan George Data Science Professor DataCamp Machine Learning

APPLIED MACHINE LEARNING Methods for Clustering K-means, Soft K-means DBSCAN 1 MACHINE

From asynchronous to synchronous specifications for distributed program synthesis David Janin

Treasure #3: Fasting Matthew 6:16 NIV When you fast, do not look somber as the hypocrites do,

35 YEARS OF 35 YEARS OF METALLIC SUPERLATTICES METALLIC SUPERLATTICES IEEE DISTINGUISHED

Our Place in the Cosmos Our Place in the Cosmos Course Aims and and To explain primarily at a

T2K Collaboration K. Abe, 49 N. Abgrall, 16 Y.

building a culture of experimentation at Spotify @bendressler - experimentation lead user

Good Morning! LIS1001 Information and Technology for Searching May 2017, Ulrich Werner,

Good Morning! LIS1001 Information and Technology for Searching September 2017, Ulrich Werner,