scalable data processing at network transfer rates with
play

Scalable Data Processing at Network transfer rates with nCorium - PowerPoint PPT Presentation

Scalable Data Processing at Network transfer rates with nCorium Compute in Memory Modules Suresh Devalapalli, Brett Neuman, Arvindh Lalam 11/18/2019 1 OSS Layer Bottlenecks in HPC Facilities 10,000 000s s server ers 10s of Peta


  1. Scalable Data Processing at Network transfer rates with nCorium Compute in Memory Modules Suresh Devalapalli, Brett Neuman, Arvindh Lalam 11/18/2019 1

  2. OSS Layer Bottlenecks in HPC Facilities 10,000’ 000’s s server ers • 10s of Peta Bytes of data moved in and out of Storage layer Compu pute L e Layer er • Challenges: OSS SS 1. Data processing on CPU unable to keep up with increase in network and storage speeds 2. I/O throughput per node is limited Storage L e Layer er 11/18/2019 2

  3. nCorium Compute in Memory Module • DIMM form-factor • Reconfigurable compute cores on the DIMM • Off-load functions via libraries: • Compression • Erasure • Encryption • Video Transcoding • ….etc nCIMM Array in the system • Data Stream based processing 11/18/2019 3

  4. nCIMMs + nC-Grid = Scalable nCorium-OSS • nCIMM to off-load data processing • nC-Grid to open wider I/O 1 st Gen Server With 5 nCSTs • Scalable Architecture: nCST SSD SSD SSD SSD  Number of nCIMMs EDR (Storage Target) SSD SSD SSD SSD  Functions to support nCST SSD SSD SSD SSD CPU EDR (Storage Target) SSD SSD SSD SSD nCIMM nCIMM nCST SSD SSD SSD SSD EDR nCIMM (Storage Target) SSD SSD SSD SSD nCIMM nCIMM nCST SSD SSD SSD SSD EDR nCIMM (Storage Target) SSD SSD SSD SSD nCIMM nCIMM nCST SSD SSD SSD SSD nCIMM EDR (Storage Target) SSD SSD SSD SSD nCIMM Management Network Data Network 11/18/2019 4

  5. Performance of Gen-1 nCorium OSS: Q’1 19 RDMA 5x for RDMA + Erasure 60 3.4X RDMA RDMA + Encode 50.0 50 3.4x for RDMA+ Erasure RDMA + SSD 41.1 RDMA + Encode + SSD 40 Througbput (GBps) 34.3 30.8 28.6 30 26.3 22.5 22.2 21.1 20.7 Performance compared with Intel Gold CPU based 18.4 20 system 15.9 15.0 14.6 11 10.8 10.5 9.8 9 10 7.5 7.5 7.2 5.5 4.9 Gen-1 released in Q1’19 0 Ref. Platform 1-nc STs 2-nc STs 3-nc STs 4-nc STs 5-nc STs Number of nC-STs 11/18/2019 5

  6. Performance of Gen-2 nCorium OSS • Improved reconfigurable cores ~10x faster • More off-load functionality • Higher Power efficiency Ref. Platform vs nCorium G1 and nCorium G2 12.0 9.9 10.0 Normalized to Ref. Platform 8.0 7.5 7.2 Gen-2 now available 7.0 6.0 4.7 4.5 3.8 3.7 4.0 RDMA RDMA+ENC RDMA+ENC+SSD RDMA+COMP+ENC+SSD Reference 11 9 7.2 5 nCorium G1 50 34.3 26.3 23.5 2.0 1.0 1.0 1.0 1.0 nCorium G2 76.5 67.5 52 49.4 Numbers extrapolated from initial data 0.0 RDMA RDMA+ENC RDMA+ENC+SSD RDMA+COMP+ENC+SSD Reference nCorium G1 nCorium G2 11/18/2019 6

  7. Ongoing Work • Parallel file system integration • Other Applications • Lustre Key-Value Storage  • BeeGFS Video Transcoding  Inferencing  Graph Analytics   Cyber Security Content Distribution  11/18/2019 7

  8. Demos / Exhibits StarLight: Booth 993 Thank you! EchoStreams: Booth 1490 11/18/2019 8

Recommend


More recommend