comp 135 introduction to machine learning
play

COMP 135 Introduction to Machine Learning First day of class - PowerPoint PPT Presentation

COMP 135 Introduction to Machine Learning First day of class Spring 2019 https://www.cs.tufts.edu/comp/135/2019s/ Many slides attributable to: Emily Fox (UW), Finale Doshi-Velez (Harvard), Erik Sudderth (UCI), & Liping Liu (Tufts) 1


  1. COMP 135 Introduction to Machine Learning First day of class Spring 2019 https://www.cs.tufts.edu/comp/135/2019s/ Many slides attributable to: Emily Fox (UW), Finale Doshi-Velez (Harvard), Erik Sudderth (UCI), & Liping Liu (Tufts) 1

  2. Why Machine Learning? Mike Hughes - Tufts COMP 135 - Spring 2019 2

  3. Image Credit: Emily Fox Mike Hughes - Tufts COMP 135 - Spring 2019 3

  4. Artificial Intelligence (AI) • Study of “intelligent systems”, with many parts: logic, planning, search, probabilistic reasoning, learning from experience, interacting with other agents, etc. Mike Hughes - Tufts COMP 135 - Spring 2019 4

  5. Machine Learning (ML) • Study of algorithms that learn from experience/data to perform a task • Task output: a prediction or a decision Mike Hughes - Tufts COMP 135 - Spring 2019 5

  6. The Machine Learning Process Past data Performance Task / Goal (or “experience”) measure input data prediction / decision (now) (now) Mike Hughes - Tufts COMP 135 - Spring 2019 6

  7. Why take this course? Mike Hughes - Tufts COMP 135 - Spring 2019 7

  8. Why take this course? Our goal is to prepare students to effectively apply machine learning methods to problems that might arise in "the real world" -- in industry, medicine, education, and beyond. Gain skills and understanding for a future as: • Developer using ML “out-of-the-box” • ML methods researcher Mike Hughes - Tufts COMP 135 - Spring 2019 8

  9. Why take this course? Students will be able to: - Think systematically - Compare/contrast each method’s strengths & limitations - Deploy rapidly - Hands-on experience with open-source libraries - Evaluate carefully - Design experiments with task-appropriate performance metrics and strong baselines - Report uncertainty in performance Mike Hughes - Tufts COMP 135 - Spring 2019 9

  10. What skills will we learn? Mike Hughes - Tufts COMP 135 - Spring 2019 10

  11. What will we learn? Supervised Learning Data, Label Pairs Performance { x n , y n } N measure Task n =1 Unsupervised Learning data label x y Reinforcement Learning Mike Hughes - Tufts COMP 135 - Spring 2019 11

  12. Task: Regression y is a continuous variable Supervised e.g. sales in $$ Learning regression y Unsupervised Learning Reinforcement x Learning Mike Hughes - Tufts COMP 135 - Spring 2019 12

  13. Regression Example: Uber Supervised Learning regression Unsupervised Learning Reinforcement Learning Mike Hughes - Tufts COMP 135 - Spring 2019 13

  14. Regression Example: Uber Mike Hughes - Tufts COMP 135 - Spring 2019 14

  15. Regression Example: Uber Mike Hughes - Tufts COMP 135 - Spring 2019 15

  16. Task: Classification y is a discrete variable Supervised (red or blue) Learning classification x 2 Unsupervised Learning Reinforcement Learning x 1 Mike Hughes - Tufts COMP 135 - Spring 2019 16

  17. Classification Example: Swype Mike Hughes - Tufts COMP 135 - Spring 2019 17

  18. What will we learn? Supervised Learning Data, Label Pairs Performance { x n , y n } N measure Task n =1 Unsupervised Learning data label x y Reinforcement Learning Mike Hughes - Tufts COMP 135 - Spring 2019 18

  19. What will we learn? Supervised Learning Data Examples Performance { x n } N measure Task n =1 Unsupervised Learning summary data of x x Reinforcement Learning Mike Hughes - Tufts COMP 135 - Spring 2019 19

  20. Task: Clustering Supervised Learning x 2 Unsupervised Learning clustering x 1 Reinforcement Learning Mike Hughes - Tufts COMP 135 - Spring 2019 20

  21. Clustering Example: News Mike Hughes - Tufts COMP 135 - Spring 2019 21

  22. Task: Embedding Supervised Learning x 2 Unsupervised Learning embedding Reinforcement x 1 Learning Mike Hughes - Tufts COMP 135 - Spring 2019 22

  23. Example: Genes vs. geography Mike Hughes - Tufts COMP 135 - Spring 2019 23

  24. Task: Recommendation Supervised Learning recommendation Unsupervised Learning Reinforcement Learning Mike Hughes - Tufts COMP 135 - Spring 2019 24

  25. Recommendation Example Mike Hughes - Tufts COMP 135 - Spring 2019 25

  26. What will we learn? Histories of states, Supervised actions, rewards Learning Performance { s t , a t , r t , s t +1 , . . . } N n =1 measure Task Unsupervised Learning Recent Next history action a Reinforcement Learning Mike Hughes - Tufts COMP 135 - Spring 2019 26

  27. RL example: Pancake robot Mike Hughes - Tufts COMP 135 - Spring 2019 27

  28. What will we learn this semester? If I want more? Supervised ~ 10 weeks Take COMP 136 Learning 6 homeworks 3 projects Take COMP 150 – Deep Learning Unsupervised ~ 2 weeks Take COMP 136 1.5 homeworks Learning Reinforcement ~ 1 week Take COMP 150 – RL (Prof. Jivko Sinapov) 0.5 homeworks Learning Mike Hughes - Tufts COMP 135 - Spring 2019 28

  29. What we won’t cover • Active learning • Transfer learning • Semi-supervised learning • Learning theory • ….. lots more Mike Hughes - Tufts COMP 135 - Spring 2019 29

  30. Who is teaching? • Instructor: Prof. Mike Hughes • TA Staff • Mike Pietras • Rui Chen • Minh Nguyen • Duc Nguyen • Wayne Tang Mike Hughes - Tufts COMP 135 - Spring 2019 30

  31. Problem: When will ICU patient need intervention ? Ghassemi, Wu, Hughes , et al. AMIA CRI 2017 Interventions: mechanical ventilation • blood pressure drugs • Early prediction helps: prepare patient plan staffing try less aggressive options early

  32. Cohort from MIMIC-III dataset mimic.physionet.org 36,050 patients (Johnson et al. Sci. Data 2016) • from Beth-Israel Deaconess in Boston 2001-2012 • kept all adults with record within 6-360 hours

  33. Observed data 7 nurse-validated vital signs (hourly) heart rate, blood pressure, temp., SpO2, … 11 lab measurements (much less than hourly) hematocrit, lactate, … each channel standardized to mean=0, var=1 with carry-and-hold for missing data

  34. Task: predict need in advance +2 hrs ahead Mike Hughes - Tufts COMP 135 - Spring 2019 34

  35. Vasopressor prediction : 1 hr ahead Area-under-ROC static demographics (age, race, etc) “embedding” using time-series model dynamic patient vitals at time t Mike Hughes - Tufts COMP 135 - Spring 2019 35

  36. Key Stakeholder Questions • How should we fill in missing values? • How to deal with imbalance data? • most patients never get drug X • How to deal with imbalanced mistake costs? • remove ventilator too early ends life, too late costs $$ • How uncertain are predictions? • How will this generalize to a new hospital? We can start to answer many of these in COMP 135 Mike Hughes - Tufts COMP 135 - Spring 2019 36

  37. Logistics • Course website (take a tour) • https://www.cs.tufts.edu/comp/135/2019s/ • Discussions on Piazza • Lectures every Mon & Wed • Recitation Sessions with TAs every Mon • Deliverables: 2 exams, 8 homeworks, 3 projects • Collaboration Policy Mike Hughes - Tufts COMP 135 - Spring 2019 37

  38. Let’s Get Started! • Setup your Python environment ASAP • First recitation is Mon 1/21 • Get help with your Python env. • Learn basics of arrays, plotting, etc • HW0 due NEXT WEEK (Wed 1/23 at 11:59pm) Mike Hughes - Tufts COMP 135 - Spring 2019 38

Recommend


More recommend