nucleus eight gpu platform for visual simulation
play

Nucleus: Eight GPU Platform for Visual Simulation David Morgan - PowerPoint PPT Presentation

Nucleus: Eight GPU Platform for Visual Simulation David Morgan Principal Engineer Aechelon Technology S9224 Session Trajectory Visual Simulation Background Monsters, Clusters, and Moores Law Nucleus Architecture Challenges


  1. Nucleus: Eight GPU Platform for Visual Simulation David Morgan Principal Engineer Aechelon Technology S9224

  2. Session Trajectory • Visual Simulation Background • Monsters, Clusters, and Moore’s Law • Nucleus Architecture • Challenges • Demo

  3. Aechelon Technology

  4. Image Generation

  5. 2000: RealityMonster • 5 Racks • 8 Graphics “Pipelines” (GPUs) • 24 CPUs • 9GB RAM (NUMA) • 140GB Storage • Single IRIX OS • 15kW • $2.8M

  6. Scalability Matters • 128 CPUs • 256GB RAM • 16 GPUs

  7. 2001: GeForce 3

  8. 2002-Today: PC Clusters • 1-3 Racks per IG • 1 GPU per node • 1U Diskless Renderers • 3U Pager w/88TB Storage • Windows OS Per Node • Ethernet Interconnect • 7000W (8ch) • Unlimited Scalability

  9. Moore’s Law is Dead 3.8 3.6 Cluster 3.4 3.2 Skylake 2017 3 Nucleus Broadwell 2016 GHz 2.8 Haswell 2014 Ivy Bridge 2013 2.6 Sandy Bridge 2012 2.4 2.2 2 4 6 8 10 12 14 16 18 20 22 24 26 28 Cores

  10. 2016: 8-GPU Support

  11. Multi-GPU is Hard

  12. Nucleus • 4U • 8 Quadro GPUs • One display per GPU • 36 CPU Cores • 192GB RAM • 36TB Storage • One Windows OS • $100-200K • Operates up to 35C • 2000W • Limited Scalability

  13. Dual Root Complex

  14. Single Root Complex

  15. GPU Affinity • Exposed in OpenGL through WGL_NV_gpu_affinity extension • Quadro feature necessary to address individual GPUs on Windows • pC-Nova Maps GPU device handles to screens in the Windows virtual desktop • Beware driver crashes enumerating more than 4 screens per GPU!

  16. EDID Management http://johnsciacca.webs.com/apps/blog/show/16852621-installation-nightmares-9-professional-horror-stories

  17. DWM Is… • Independent GPUs’ video timings phase shift. • Normally correctable by tracking the phase • “Full -Screen Exclusive Mode” is gone. • DWM intermediates all drawing on multi-display systems. • One display is Primary.

  18. DWM Is Evil GPU 1 GPU 2 https://www.pandza.xyz/article/16/dwm,-dxgi,-swap-chains,-latency,-throughput-and-you

  19. Workaround: Framelock • Quadro Sync II supports 8 GPUs per system • Shared oscillator ensures displays remain in phase with Primary • DWM placated! • Downside: Video timings must all match • Downside: Wiring is delicate

  20. Future Work • GPU Multicast • Or Dual Root Complex? • VR Direct? • Clusters of Nuclei

  21. Thanks • Doug Traill • John Chaney • Tim Woodard • Steve Nash • Ian Williams

  22. Demo

  23. Questions?

Recommend


More recommend