tensor layers for compression of deep learning networks
play

TENSOR LAYERS FOR COMPRESSION OF DEEP LEARNING NETWORKS Cris Cecka - PowerPoint PPT Presentation

TENSOR LAYERS FOR COMPRESSION OF DEEP LEARNING NETWORKS Cris Cecka Senior Research Scientist, NVIDIA GTC 2018 Tensors Computations and the GPU AGENDA Tensor Networks and Decompositions Tensor Layers in Deep Learning 2 TENSOR COMPUTATIONS


  1. TENSOR LAYERS FOR COMPRESSION OF DEEP LEARNING NETWORKS Cris Cecka Senior Research Scientist, NVIDIA GTC 2018

  2. Tensors Computations and the GPU AGENDA Tensor Networks and Decompositions Tensor Layers in Deep Learning 2

  3. TENSOR COMPUTATIONS AND THE GPU Modern data is inherently multi-dimensional. 3

  4. TENSOR CONTRACTIONS • Core primitive of multilinear algebra • BLAS level 3 — unbounded compute intensity. X

  5. TENSOR LIBRARIES Explicit permutation dominates. Y . Shi, U. N. Niranjan, A. Anandkumar and C. Cecka , "Tensor Contractions with Extended BLAS Kernels on CPU and GPU," 2016 IEEE X 23rd International Conference on High Performance Computing (HiPC), Hyderabad, 2016, pp. 193-202.

  6. CONTRACTIONS : Single GEMM (Provided compact layout) X

  7. BATCHED MATRIX-MATRIX MULTIPLY cublas<T>gemmStridedBatched X

  8. CONTRACTIONS : Single SB-GEMM (Any layout) X

  9. APPLICATION: FFT : Tensor/FFT — vendor optimized cublas<T>gemmStridedBatched : Custom kernel : FMM Communication StridedBatchedGEMM: 75%+ of the runtime 1.5x over cuFFT on 2xV100 2.6x over cuFFT on 8xV100 Cris Cecka. “Low communication FMM-accelerated FFT on GPUs." In Proceedings of the International Conference for High X Performance Computing, Networking, Storage and Analysis (SC '17). ACM, New York, NY, USA.

  10. WHY TENSORS? 10

  11. DENSITY AND SPARSITY H. Anzt, S. Tomov, J. Dongarra , “Energy Efficiency and Performance Frontiers for Sparse Computations on GPU 11 Supercomputers," PMAM 2015.

  12. DENSITY AND SPARSITY In general, need < 5% sparsity for a computational win. Solutions Block-sparse — Locally dense and globally sparse 12

  13. TENSOR DECOMPOSITIONS Decompositions for data sparse representations. 13

Recommend


More recommend