low latency gpgpu
play

LOW-LATENCY GPGPU A 5-minute intro and investigation Matheus Vitti - PowerPoint PPT Presentation

LOW-LATENCY GPGPU A 5-minute intro and investigation Matheus Vitti Santos @ Meeting C++ 2019 Disclaimer These findings reflect the point of view of someone whos been courting only CUDA in a hobbyist setting since 2010, and in a


  1. LOW-LATENCY GPGPU A 5-minute intro and investigation Matheus Vitti Santos @ Meeting C++ 2019

  2. Disclaimer ■ These findings reflect the point of view of someone who’s been courting only CUDA in a hobbyist setting since 2010, and in a (lightweight) professional setting since 2017 ■ I’d love to hear the viewpoints of AMD, Intel, Direct3D, Metal, and Vulkan folks on this - hit me up afterwards! – And maybe give me some hardware to play with…? Matheus Vitti Santos @ Meeting C++ 2019

  3. So, GPGPUs have latency issues… ■ Calling GPU functions ta takes ti time ■ Moving memory around ta takes a a l lot o t of ti time ■ The GPU Driver ta takes i its ts s sweet, s t, sweet ti t time ■ …right? Matheus Vitti Santos @ Meeting C++ 2019

  4. Matheus Vitti Santos @ Meeting C++ 2019

  5. Matheus Vitti Santos @ Meeting C++ 2019

  6. 10+ years of GPGPU ■ APIs are slimmer! – (D3D12 / Vulkan / Metal vs. OpenGL/D3D9) ■ GPUs are faster! ■ PCI Express is faster! ■ Most of all, Drivers a are f faster ! Matheus Vitti Santos @ Meeting C++ 2019

  7. Literally 2 weeks ago: NVIDIA introduces “Ultra Low Latency Mode” Matheus Vitti Santos @ Meeting C++ 2019

  8. “Latency” is relative Do Domain Accepta table L Latency Protein Folding Simulation Days Weather Simulation Hours Radar Signal Convolution 200~300ms Videogame 10~30ms Audio Processing 5~10ms High Frequency Stock Trading <1ms Matheus Vitti Santos @ Meeting C++ 2019

  9. “Latency” is relative Do Domain Accepta table L Latency Protein Folding Simulation Days Weather Simulation Hours Radar Signal Convolution 200~300ms Videogame 10~30ms Audio Processing 5~10ms High Frequency Stock Trading <1ms Matheus Vitti Santos @ Meeting C++ 2019

  10. Experiment: Real-time FM Synth Solar Modulation - Savaged Regime Matheus Vitti Santos @ Meeting C++ 2019

  11. Test subjects GeForce 640M Quadro P400 (this computer) (office workstation) ■ Kepler Architecture, SM 3.0, 2012 ■ Pascal Architecture, SM 6.1, 2017 ■ 2GB GDDR5 / 128bit / 900MHz ■ 2GB GDDR5 / 64bit / 2GHz ■ ~390 Gflops ■ ~630 Gflops – 2x PlayStation 3 – ½ Xbox One – Intel UHD 620 – 2x Intel Iris 5100 ■ ~25 Giops ■ ~200 Giops Matheus Vitti Santos @ Meeting C++ 2019

  12. Compute Time per Audio Frame 10.0 ms 1.0 ms 95% GeForce 95% Quadro 99.9% GeForce 99.9% Quadro 48kHz 96kHz 0.1 ms 32 64 128 256 512 1024 Matheus Vitti Santos @ Meeting C++ 2019

  13. DEMO TIME!

  14. THANK YOU! And go do something awesome with that GPU of yours!

  15. Image Sources ■ Screenshot: Marble Madness, c. Atari 1984 ■ Nvidia Control Panel: https://www.howtogeek.com/437761/how-to-enable-ultra-low- latency-mode-for-nvidia-graphics/ ■ FM Music Video: Savaged Regime Matheus Vitti Santos @ Meeting C++ 2019

Recommend


More recommend