DSC 102 Systems for Scalable Analytics Arun Kumar Topic 7: ML - PowerPoint PPT Presentation

DSC 102   Systems for Scalable Analytics Arun Kumar Topic 7: ML Deployment Not included for Final Exam Slide Content ACKs: Alkis Polyzotis, Manasi Vartak 1

The Lifecycle of ML-based Analytics Feature Engineering Data acquisition Model Serving Training & Inference Data preparation Monitoring Model Selection 2

Deployment Stage of Data Science ❖ Data science does not exist in a vacuum. It must interplay with the data-generating process and prediction application ❖ Deploy Stage: Integrate the trained prediction function(s) with production environment, e.g., offline inference in a data system, online inference on a Web platform / IoT / etc. ❖ Typically, data scientist must work with “DevOps” engineers or “MLOps” engineers to achieve this 3

ML in Academia vs Production What you classes on statistics, ML, AI, etc. cover! ☺ https://papers.nips.cc/paper/5656-hidden-technical-debt-in-machine-learning-systems.pdf 4

Deployment Stage of Data Science ❖ Deployment stage typically involves 5 main activities in sync with other stages: 1. Packaging and Orchestration 2. Prediction Serving 3. Data Validation 4. Prediction Monitoring 5. Versioning 5

1. Packaging and Orchestration ❖ Basic Goal: Bundle up software to deploy with its dependencies into a lightweight standalone executable software that can run almost seamlessly across different OSs and hardware environments ❖ Most common approach today: Containerization ❖ Not specific to ML deployment but highly general ❖ Older generation approach called “virtual machines” included OS too and were bulky and slow ❖ Docker and Kubernetes are most popular options today 6

1. Packaging and Orchestration 7 https://medium.com/edureka/kubernetes-vs-docker-45231abeeaf1

1. Packaging and Orchestration ❖ Often, one might need to deploy end-to-end pipelines with effectively independent contrainerized software modules ❖ Workflow orchestration tools help handle complex pipelines ❖ Can specify time constraints, operation constraints, etc. 8

1. Packaging and Orchestration ❖ Cloud providers are also starting to make it easier to package and deploy prediction software, e.g., Model Endpoint in AWS Sagemaker ❖ Data scientists must look out for organization’s tools and services 9

2. Prediction Serving ❖ Basic Goal: Make ML inference fast and potential co- optimize with serving environment/infra. ❖ Typically automated tools; so data scientists only needs to know what systems available and how to use them ❖ 3 main kinds of systems: ❖ Program optimization of prediction function to improve hardware utilization, e.g., ONNX Runtime or Apache TVM ❖ Batch optimization of many concurrent prediction requests to balance latency and throughput better to improve hardware utilization, e.g., AWS SageMaker ❖ New hardware optimized for inference, e.g., TPUs 10

3. Data Validation ❖ Basic Goal: Ensure the data fed into prediction function conforms to its expectations on, say, schema/syntax/shape, integrity constraints (e.g., value ranges or domains), etc. ❖ Needs to be in lock step with data sourcing stage: acquiring, re-organizing, cleaning, and feature extraction ❖ Industry is starting to build platforms to make this process more rigorous and reusable, e.g., TensorFlow Extended ❖ Data scientists must learn their organization’s data validation practices and tools/APIs ❖ Also covered in Alkis’s guest lecture; further reading: https:// mlsys.org/Conferences/2019/doc/2019/167.pdf 11

4. Prediction Monitoring ❖ Basic Goal: Ensure the prediction functions are working as intended by data scientist; “silent failures” can happen due to concept drifts , i.e., data distribution has deviated significantly from when prediction function was built! ❖ Example: Sudden world event changes Web user behavior drastically, e.g., WHO declares pandemic! ☺ ❖ Needs to be in lock step with model building stage ❖ Industry today uses ad hoc statistical approaches ❖ Data scientists must look out for organizations’ monitoring practices, since it affects the lifecycle loop frequency ❖ Also covered in Alkis’s guest lecture; further reading: https:// mlsys.org/Conferences/2019/doc/2019/167.pdf 12

5. Versioning ❖ Basic Goal: Just like regular code, prediction software must be versioned and tracked for teams to ensure consistency across time and employees, as well as for auditing sake, ability to “rollback” to a safer state, etc. ❖ But unlike regular code, prediction software has 3 more dependencies other than just code: datasets (train/val/test), configuration (e.g., hyper-parameters), and environment (hardware/software, since that can affect accuracy too) ❖ Research and industry are barely starting to figure this out ❖ Data scientists must look out versioning best practices/tools ❖ Covered in Manasi’s guest lecture; https://blog.verta.ai/blog/ how-to-move-fast-in-ai-without-breaking-things 13

DSC 102 Systems for Scalable Analytics Arun Kumar Topic 7: ML - PowerPoint PPT Presentation

DSC 102 Systems for Scalable Analytics Arun Kumar Topic 7: ML Deployment Not included for Final Exam Slide Content ACKs: Alkis Polyzotis, Manasi Vartak 1 The Lifecycle of ML-based Analytics Feature Engineering Data acquisition Model

DSC 102 Systems for Scalable Analytics Arun Kumar Topic 4: ML Data Preparation and Model

DSC 102 Systems for Scalable Analytics Arun Kumar Topic 3: Parallel and Scalable Data

DSC 102 Systems for Scalable Analytics Arun Kumar Topic 6: Deep Learning Systems 1 Outline

Slide 7 / 102 Slide 8 / 102 4 Compare/Contrast Pulse and Wave. 5 In a transverse wave, compare

DSC 102 Systems for Scalable Analytics Arun Kumar Topic 5: Dataflow Systems Chapter 2.2 of

DSC 102 Systems for Scalable Analytics Arun Kumar Topic 1: Computer Organization; Operating

DSC 102 Systems for Scalable Analytics Winter 2020 Arun Kumar 1 About Myself 2009:

DSC 102 Systems for Scalable Analytics Arun Kumar Topic 2: Basics of Cloud Computing 1

Slide 1 / 102 Slide 2 / 102 8th Grade Wave Properties Classwork-Homwork Slides 2015-10-15

Slide 4 / 102 1 What causes a wave? Slide 5 / 102 2 In terms of wave motion, define medium.

How to do research in clinical practice Dr P S Shankar, MD, FRCP(Lond), FAMS, DSc(Gul),

3rd Grade Shapes and Perimeter 2015-11-10 www.njctl.org Slide 3 / 102 Slide 4 / 102 Table of

Analytics and Data Summit 2020 Analytics and Data Summit 2020 Analytics and Data Summit 2020

Cache Coherence in Scalable Machines Scalable Cache Coherent Systems Scalable, distributed

DSC 10: Lecture 1 Introduction Cause and Effect Credit: Anindita Adhikari and John DeNero

AP Physics C - Mechanics Simple Harmonic Motion 2015-12-05 www.njctl.org Slide 3 / 102 Slide 4

Advanced ML in Google Cloud Abhay Agarwal (MS Design 19) Agenda General Notes on

DevOps with Kubernetes and Helm Jessica Deen Cloud Developer Advocate HELLO! I am Jessica

INTRODUCTION TO DOCKER ADRIAN MOUAT SO WHAT IS DOCKER? SIMILAR TO A LIGHTWEIGHT VM Both

C l a i m c o n t r o l o f y o u r D o c k e r i ma g e s D i i m

Highly-Available Applications on Unreliable Infrastructure: Microservice Architectures in

Federal Maritime Commission Chairman Richard A. Lidinsky, Jr. Presentation to AgTC/Pacific

Outline Introduction Challenges Background Research Questions Methodology

MEMCARE (Metals and Metal Mixtures: Cognitive Aging, Remediation, and Exposure Sources) To