cs 744 datacenter as a computer
play

CS 744: DATACENTER AS A COMPUTER Shivaram Venkataraman Fall 2020 - PowerPoint PPT Presentation

CS 744: DATACENTER AS A COMPUTER Shivaram Venkataraman Fall 2020 ANNOUNCEMENTS - Assignments - Assignment zero is due! - Form groups for Assignment 1 on Piazza - Class format - Review - Lecture - Discussion Applications Machine


  1. CS 744: DATACENTER AS A COMPUTER Shivaram Venkataraman Fall 2020

  2. ANNOUNCEMENTS - Assignments - Assignment zero is due! - Form groups for Assignment 1 on Piazza - Class format - Review - Lecture - Discussion

  3. Applications Machine Learning SQL Streaming Graph Computational Engines Scalable Storage Systems Resource Management Datacenter Architecture

  4. OUTLINE - Hardware Trends - Datacenter design - WSC workloads - Discussion

  5. Why is One Machine Not Enough?

  6. What’s in a Machine? Interconnected compute and storage Memory Bus Newer Hardware - GPUs, FPGAs PCIe v4 - RDMA, NVlink Ethernet SATA

  7. Scale Up: Make More Powerful Machines Moore’s law – Stated 52 years ago by Intel founder Gordon Moore – Number of transistors on microchip double every 2 years – Today “closer to 2.5 years” Intel CEO Brian Krzanich

  8. Dennard Scaling is the Problem Suggested that power requirements are proportional to the area for transistors – Both voltage and current being proportional to length – Stated in 1974 by Robert H. Dennard (DRAM inventor) “Adapting to Thrive in a New Economy of Memory Abundance,” Bresniker et al Broken since 2005

  9. Dennard Scaling is the Problem Performance per-core is stalled Number of cores is increasing “Adapting to Thrive in a New Economy of Memory Abundance,” Bresniker et al

  10. Memory TRENDS

  11. MEMORY TAKEAWAY Growing Data access from memory is getting more expensive ! +15% per year

  12. HDD CAPACITY

  13. HDD BANDWIDTH Disk bandwidth is not growing

  14. SSDs Performance: – Reads: 25us latency – Write: 200us latency – Erase: 1,5 ms Steady state, when SSD full – One erase every 64 or 128 reads (depending on page size) Lifetime: 100,000-1 million writes per page

  15. SSD VS HDD COST

  16. Ethernet Bandwidth Growing 33-40% per year ! 2017 2002 1998 1995

  17. AMAZON EC2 (2019)

  18. TRENDS SUMMARY CPU speed per core is flat Memory bandwidth growing slower than capacity SSD, NVMe replacing HDDs Ethernet bandwidth growing

  19. DATACENTER ARCHITECHTURE Memory Bus PCIe Ethernet SATA Server Server

  20. STORAGE HIERARCHY (DC AS A COMPUTER v2)

  21. Warehouse-Scale Computers Many concerns – Infrastructure Single organization – Networking Homogeneity (to some extent) – Storage Cost efficiency at scale – Software – Multiplexing across applications and services – Power/Energy – Rent it out! – Failure/Recovery – …

  22. SOFTWARE IMPLICATIONS Reliability Storage Hierarchy Workload Diversity Single organization

  23. WORKLOAD: Partition-Aggregate BigData Top-level Aggregator Mid-level Aggregators Workers

  24. WORKLOAD: SCHOLAR SIMILARITY Map Stage Reduce Stage

  25. VIDEO ENCODING

  26. MACHINE LEARNING

  27. DISCUSSION https://forms.gle/CrrrhCPYHerwXNEt5

  28. Discussion Scale-up vs Scale-out

  29. DISCUSSION Microsoft Word vs. online document editor like Google Docs

  30. DISCUSSION

  31. NEXT STEPS Next class: Storage Systems Assignment 1 out Thursday. Submit groups before that!

Recommend


More recommend