Computational Reproducibility in Production Physics Applications - PowerPoint PPT Presentation

Computational Reproducibility in Production Physics Applications Numerical Reproducibility at Exascale Workshop Supercomputing 2015 November 20, 2015 Robert W. Robey Los Alamos National Laboratory LA-UR-15-28798 UNCLASSIFIED Operated by Los Alamos National Security, LLC for the U.S. Department of Energy's NNSA

The Problem • Finite precision arithmetic is not associative • Parallel global sums are non-reproducible on different numbers of processors – Hides programming errors – Can’t demonstrate that implementation conserves mass, etc. which means it is not verified and may not have the robustness properties guaranteed by the Lax-Wendroff theorem UNCLASSIFIED Operated by Los Alamos National Security, LLC for the U.S. Department of Energy's NNSA

Importance at Exascale • Predictive simulation requires improved quality of simulations • New hardware with vectors and threads exacerbates the problem • As size of calculations increase, the global sum error increases proportionally UNCLASSIFIED Operated by Los Alamos National Security, LLC for the U.S. Department of Energy's NNSA

Test Problem • Leblanc’s problem also known as shock tube from hell – 1.0e9 dynamic range in data – Compute sum and compare with correct sum calculated analytically UNCLASSIFIED Operated by Los Alamos National Security, LLC for the U.S. Department of Energy's NNSA

Problem grows with size UNCLASSIFIED Operated by Los Alamos National Security, LLC for the U.S. Department of Energy's NNSA

The Insight • Reproducible global sums thought to require summation in a fixed order, but • It can also be addressed by enhancing precision because regular addition is associative => Can use both enhanced precision and order to reduce precision loss UNCLASSIFIED Operated by Los Alamos National Security, LLC for the U.S. Department of Energy's NNSA

Possible Solution Components • Enhanced precision techniques – Kahan sum – accumulates error on one term – Knuth sum – accumulates error on both terms – Quadtype • Pair-wise summation • Precision truncation • MPI enhanced precision sum (covered in previous talks/papers) UNCLASSIFIED Operated by Los Alamos National Security, LLC for the U.S. Department of Energy's NNSA

The Results http://www.github.com/losalamos/GlobalSums Method Error Run-time (msecs) Double -1.99e-09 0.116 Double w/truncation 0.0 0.120 Long Double -1.31e-13 0.118 Long Double w/truncation 0.0 0.116 Kahan Sum 0.0 0.406 Knuth Sum 0.0 0.704 Pair-wise Sum 0.0 0.402 Quad Double 5.55e-17 3.010 Full Quad Double -4.81e-27 2.454 OpenMP double 2.465e-10 0.048 OpenMP Kahan 1.39e-16 0.063 UNCLASSIFIED Operated by Los Alamos National Security, LLC for the U.S. Department of Energy's NNSA

Surprising Application • Automatic fault recovery in a shallow- water code tracks the mass conservation and automatically restarts if it changes by more than a small amount. The quality of the global mass sum needs to be high to avoid false positives. UNCLASSIFIED Operated by Los Alamos National Security, LLC for the U.S. Department of Energy's NNSA

Open Source Playground http://www.github.com/losalamos/GlobalSums Apache 2 license – only restriction is to cite the use UNCLASSIFIED Operated by Los Alamos National Security, LLC for the U.S. Department of Energy's NNSA

Computational Reproducibility in Production Physics Applications - PowerPoint PPT Presentation

Slide 1 Computational Reproducibility in Production Physics Applications Numerical Reproducibility at Exascale Workshop Supercomputing 2015 November 20, 2015 Robert W. Robey Los Alamos National Laboratory LA-UR-15-28798 UNCLASSIFIED

Computational Reproducibility Daniel S. Katz Jennifer Freeman Smith Computational

Research Reproducibility in Computational Social Science Aek Palakorn Achananuparp, SMU Research

Computational Physics What is Computational Physics? Basic Computer Hardware Operating Systems

Reproducibility "[An article about computational science in a scientific publication is

Reproducibility as a Community Effort Lessons from the Madagascar Project Sergey Fomel Jackson

Implementing reproducibility in phonetic research: a computational workfmow Stefano Coretta

New NIH requirements regarding Rigor and Reproducibility

Reproducibility & Generalizability @ Twitter Strengthening Reproducibility in Network Science

Reproducibility and Big (Omics) Data Nuno Bandeira, Ph.D. Associate Professor Dept. Computer

R and Reproducibility A Proposal David Smith Revolu0on

B: Data Reproducibility What are we doing in Singapore, Tim White and what should journals be

Everware - lowering reproducibility barriers Andrey Ustyuzhanin Yandex School of Data Analysis

A Computational Model of Natural Language Communication Interpretation, Inference, and Production

Rigor, Reproducibility, and Transparency David T. Redden, PhD Co-Director, CCTS BERD Chair,

"[An article about computational science in a scientific publication is not the

Reproducibility: failures & futures David A. C. Beck Chemical Engineering & eScience

Experiment Reproducibility in Planetlab RP 1.1 Project Presentation Sudesh Jethoe Experiment

REPRODUCIBILITY IN COMPUTER VISION: TOWARDS OPEN PUBLICATION OF IMAGE ANALYSIS EXPERIMENTS AS

Computational plasma physics extending legacy codes, computing functionals and other ideas

Computational challenges in experimental mathematics David H. Bailey http://www.davidhbailey.com

Physics and phase transitions in parallel computational complexity Jon Machta University of

Integrated computational physics and numerical optimization Matthew J. Zahr Luis W. Alvarez

Investigation on Diboson Production Ye Li Graduate Student UW - Madison Diboson physics at

67 Cu Production in Gallium Cu Production in Gallium 67 George Kharashvili Radiation

Computational Reproducibility in Production Physics Applications - PowerPoint PPT Presentation

Slide 1 Computational Reproducibility in Production Physics Applications Numerical Reproducibility at Exascale Workshop Supercomputing 2015 November 20, 2015 Robert W. Robey Los Alamos National Laboratory LA-UR-15-28798 UNCLASSIFIED

Computational Reproducibility Daniel S. Katz Jennifer Freeman Smith Computational

Research Reproducibility in Computational Social Science Aek Palakorn Achananuparp, SMU Research

Computational Physics What is Computational Physics? Basic Computer Hardware Operating Systems

Reproducibility &quot;[An article about computational science in a scientific publication is

Reproducibility as a Community Effort Lessons from the Madagascar Project Sergey Fomel Jackson

Implementing reproducibility in phonetic research: a computational workfmow Stefano Coretta

New NIH requirements regarding Rigor and Reproducibility

Reproducibility &amp; Generalizability @ Twitter Strengthening Reproducibility in Network Science

Reproducibility and Big (Omics) Data Nuno Bandeira, Ph.D. Associate Professor Dept. Computer

R and Reproducibility A Proposal David Smith Revolu0on

B: Data Reproducibility What are we doing in Singapore, Tim White and what should journals be

Everware - lowering reproducibility barriers Andrey Ustyuzhanin Yandex School of Data Analysis

A Computational Model of Natural Language Communication Interpretation, Inference, and Production

Rigor, Reproducibility, and Transparency David T. Redden, PhD Co-Director, CCTS BERD Chair,

&quot;[An article about computational science in a scientific publication is not the

Reproducibility: failures &amp; futures David A. C. Beck Chemical Engineering &amp; eScience

Experiment Reproducibility in Planetlab RP 1.1 Project Presentation Sudesh Jethoe Experiment

REPRODUCIBILITY IN COMPUTER VISION: TOWARDS OPEN PUBLICATION OF IMAGE ANALYSIS EXPERIMENTS AS

Computational plasma physics extending legacy codes, computing functionals and other ideas

Computational challenges in experimental mathematics David H. Bailey http://www.davidhbailey.com

Physics and phase transitions in parallel computational complexity Jon Machta University of

Integrated computational physics and numerical optimization Matthew J. Zahr Luis W. Alvarez

Investigation on Diboson Production Ye Li Graduate Student UW - Madison Diboson physics at

67 Cu Production in Gallium Cu Production in Gallium 67 George Kharashvili Radiation

Reproducibility "[An article about computational science in a scientific publication is

Reproducibility & Generalizability @ Twitter Strengthening Reproducibility in Network Science

"[An article about computational science in a scientific publication is not the

Reproducibility: failures & futures David A. C. Beck Chemical Engineering & eScience