cs 744 ray
play

CS 744: RAY Shivaram Venkataraman Fall 2019 ADMINISTRIVIA - - PowerPoint PPT Presentation

CS 744: RAY Shivaram Venkataraman Fall 2019 ADMINISTRIVIA - Assignment 1 Grades - Assignment 2 due on Fri - Course Project emails Bismarck Supervised learning, Unified Interface Shared memory, Model fits in memory Parameter Server Large


  1. CS 744: RAY Shivaram Venkataraman Fall 2019

  2. ADMINISTRIVIA - Assignment 1 Grades - Assignment 2 due on Fri - Course Project emails

  3. Bismarck Supervised learning, Unified Interface Shared memory, Model fits in memory Parameter Server Large datasets, large models (PB scale) Machine Learning Consistency model, Fault tolerance Tensorflow Need for flexible programming model Dataflow graph Heterogeneous accelerators

  4. Bismarck Parameter Server WORKLOADS Tensorflow

  5. REINFORCEMENT LEARNING

  6. RL SETUP

  7. RL REQUIREMENTS Simulation Training Serving

  8. RAY API Tasks Actors futures = f.remote(args) actor = Class.remote(args) futures = actor.method.remote(args) objects = ray.get(futures) ready = ray.wait(futures, k,timeout)

  9. COMPUTATION MODEL

  10. ARCHITECTURE

  11. Global control store Object table Task table Function table

  12. RAY SCHEDULER Global Scheduler Global Control Store

  13. FAULT TOLERANCE Tasks Actors GCS Scheduler

  14. DISCUSSION https://forms.gle/QQyLbwjAufJNXWnr6

  15. Consider you are implementing two task: a deep learning model training and a sorting application. When will use tasks vs actors and why ?

  16. Considering AllReduce using MPI as the baseline parallel programming task. Discuss the improvements made by MapReduce, Spark over MPI and discuss if/how Ray further contributes to the comparison.

  17. NEXT STEPS Next class: Clipper Assignment 2 due this week! Course project

Recommend


More recommend