openmpir
play

OpenMPIR Implementing OpenMP tasks with Tapir George Stelle 1 - PowerPoint PPT Presentation

OpenMPIR Implementing OpenMP tasks with Tapir George Stelle 1 William S. Moses 2 Patrick McCormick 1 Stephen L. Olivier 3 1 Los Alamos National Laboratory 2 MIT CSAIL 3 Sandia National Laboratories November 13, 2017 Operated by Los Alamos


  1. OpenMPIR Implementing OpenMP tasks with Tapir George Stelle 1 William S. Moses 2 Patrick McCormick 1 Stephen L. Olivier 3 1 Los Alamos National Laboratory 2 MIT CSAIL 3 Sandia National Laboratories November 13, 2017 Operated by Los Alamos National Security, LLC for the U.S. Department of Energy’s NNSA

  2. Outline OpenMP Motivation Tapir Implementation Results Discussion Questions Slide 2 Operated by Los Alamos National Security, LLC for the U.S. Department of Energy’s NNSA

  3. OpenMP int fib(int n){ if (n < 2) return n; else { int x, y; #pragma omp task x = fib(n-1); #pragma omp task y = fib(n-2); #pragma omp taskwait return x+y; } } Slide 3 Operated by Los Alamos National Security, LLC for the U.S. Department of Energy’s NNSA

  4. OpenMP LLVM IR ... %19 = call i32 @__kmpc_omp_task(%ident_t* nonnull @0, i32 %5, %20 = call i8* @__kmpc_omp_task_alloc(%ident_t* nonnull @0, %21 = bitcast i8* %20 to i8** %22 = load i8*, i8** %21, align 8, !tbaa !8 ... %28 = load i32, i32* %2, align 4, !tbaa !4 store i32 %28, i32* %27, align 4, !tbaa !13 %29 = call i32 @__kmpc_omp_task(%ident_t* nonnull @0, i32 %5, %30 = call i32 @__kmpc_omp_taskwait(%ident_t* nonnull @0, i32 ... Slide 4 Operated by Los Alamos National Security, LLC for the U.S. Department of Energy’s NNSA

  5. Slide 5 Operated by Los Alamos National Security, LLC for the U.S. Department of Energy’s NNSA

  6. Tapir detach reattach sync Slide 6 Operated by Los Alamos National Security, LLC for the U.S. Department of Energy’s NNSA

  7. ... if.end: detach label %det.achd, label %det.cont det.achd: %2 = load i32, i32* %n.addr, align 4 int fib(int n){ %sub = sub nsw i32 %2, 1 %call = call i32 @fib(i32 %sub) ... store i32 %call, i32* %x, align 4 #pragma omp task reattach label %det.cont x = fib(n-1); det.cont: #pragma omp task detach label %det.achd1, label %det.cont4 y = fib(n-2); det.achd1: #pragma omp taskwait %3 = load i32, i32* %n.addr, align 4 ... %sub2 = sub nsw i32 %3, 2 %call3 = call i32 @fib(i32 %sub2) } store i32 %call3, i32* %y, align 4 reattach label %det.cont4 det.cont4: sync label %sync.continue ... Slide 7 Operated by Los Alamos National Security, LLC for the U.S. Department of Energy’s NNSA

  8. Overview Slide 8 Operated by Los Alamos National Security, LLC for the U.S. Department of Energy’s NNSA

  9. Results Fibonacci 12 Time (Seconds, lower is better) 10 8 6 4 2 0 tapir icc clang tapir-omp Slide 9 Operated by Los Alamos National Security, LLC for the U.S. Department of Energy’s NNSA

  10. Results FFT 7 6 Time (Seconds, lower is better) 5 4 3 2 1 0 tapir icc clang tapir-omp Slide 10 Operated by Los Alamos National Security, LLC for the U.S. Department of Energy’s NNSA

  11. Results NQueens 50 40 Time (Seconds, lower is better) 30 20 10 0 tapir icc clang tapir-omp Slide 11 Operated by Los Alamos National Security, LLC for the U.S. Department of Energy’s NNSA

  12. The other 355 pages? Slide 12 Operated by Los Alamos National Security, LLC for the U.S. Department of Energy’s NNSA

  13. Questions? Slide 13 Operated by Los Alamos National Security, LLC for the U.S. Department of Energy’s NNSA

  14. Acknowledgements This research was supported by the Exascale Computing Project (17-SC-20-SC), a joint project of the U.S. Department of Energys Office of Science and National Nuclear Security Administration, responsible for delivering a capable exascale ecosystem, including software, applications, and hardware technology, to support the nations exascale computing imperative. Slide 14 Operated by Los Alamos National Security, LLC for the U.S. Department of Energy’s NNSA

Recommend


More recommend