ENHANCING SCIENTIFIC COMPUTATION USING A VARIABLE PRECISION FPU WITH - PowerPoint PPT Presentation

ENHANCING SCIENTIFIC COMPUTATION USING A VARIABLE PRECISION FPU WITH A RISC-V PROCESSOR Y.Durand, C.Fabre, A. Bocco, T. Trevisan | IMPRENUM Project | Oct 2019 | 1

USE CASES FOR (LARGE) VARIABLE PRECISION Applications Techniques & Kernels • • Computational Physics Dense/sparse linear algebra • • Solvers, eigenvalues Computational chemistry • • Numerical integration Computational statistics • RK, but not only … • Computational geometry • Monte Carlo • Spectral techniques • Large PDEs • FFT and others • Finite elements, finite • Interval arithmetics differences • ODE s • optimization Our main focus today: linear algebra solvers However, there are many other area in scientific computing where variable precision is sought Y.Durand | Oct 2019 | 2

VARIABLE PRECISION FOR SCIENTIFIC COMPUTATION JACOBI While error > tolerance augment precision while convergence not reached do Accumulation : for i := 1:n do Requires max Matrix coeffs: read-only,  =0 sparse doubles precision should be done Stay in remote memory for j := 1:n do inside the FPU if j ≠ i then (𝑙) 𝜏 += 𝑏 𝑗𝑘 𝑦 𝑘 Vector update : end • dense • Requires high precision end • should be kept in close (𝑙+1) = 1 𝑦 𝑗 𝑏 𝑗𝑗 (𝑐 𝑗 − 𝜏) memory end we need 1. extended precision operators, k=k+1 2. dedicated accumulators in registers inside end the FPU, end 3. Extended precision storage in close memory | 3

MORE IN DEPTH WITH JACOBI : EXECUTING ON THE V1 ACCELERATOR Input data, RO, in RAM, k = 0 double format (sparse) while convergence not reached do for i = 1:n do  =0 Rocket tile for j = 1:n do FPU if j ≠ i then Risc V (𝑙) $ 𝜏 += 𝑏 𝑗𝑘 𝑦 𝑘 L&S R R L1/ $ A A RoCC end L2/ L1 M M VP L3 end co-proc L&S (𝑙+1) = 1 𝑦 𝑗 𝑏 𝑗𝑗 (𝑐 𝑗 − 𝜏) Scratchpad Internal format, for end accumulation (high precision) k=k+1 Intermediate vector, end adjustable format (dense) Y.Durand | Oct 2019 | 4

VARIABLE PRECISION SYSTEM Large size registers for V.P Floating accumulation Point Unit (FPU) (eg 64 512b Standard core registers) + specialized  FPU registers scratchpad VP Specific access to memory hierarchy L1$ L1$ Large size (10s of MB) coherent close memory LLC$ Distant Shared memory Y.Durand | Oct 2019 | 5

PROGRAMMING MODEL: HARDWARE & SOFTWARE LAYERS application Domain Specific library Solver & algorithms i/f SOLVERS & VP SOLVERS & Variable precision is ALGORITHMS ALGORITHMS contained within calls to kernel Computation routines i/f (BLAS level) and Solver (LaPack level) calls Variable precision kernel kernel kernel Auxiliary support library Hardware Y.Durand | Oct 2018 | 6

RECAP: BENEFITS OF VARIABLE PRECISION • Augmenting accuracy inside the kernel reduces rounding errors  improves stability of the computation • Augmenting the mantissa during accumulation is not sufficient • Usual solution is to tweak the solver (pre-conditioning, etc.) but this is costly, hazardous and very limited • Another solution is to double precision (  quad !!) in the intermediate calculation  huge impact in memory and in calculation time • Using specialized data types (GMP, MPFR) has the same pitfalls • At even higher cost in memory • Our solution: • Variable precision, byte-aligned data format for intermediate data in memory • affordable memory footprint for intermediate data • Hardware support for variable precision in hardware co-processor • Up to 4x64 bits fractional part in internal accumulator Y.Durand | Oct 2019 | 7

PERSPECTIVES • Early investigation carried on by CEA • With support of other research projects • OPRECOMP, Imprenum, QUANTEX • First Use cases • Proof of concept = First FPGA prototype • Investigation on Compiler and library support • Mid-term Target : Proof of realization • Re-engineering with actual memory subsystem & infrastructure • Improve co-processor integration with processor • SW integration (libraries, execution model ?) • Main publications • Andrea Bocco, Yves Durand, and Florent de Dinechin. SMURF: Scalar multiple-precision unum Risc-V floating-point accelerator for scientific computing. In Conference on Next-Generation Arithmetic , March 2019 • Tiago Trevisan Jost, Andrea Bocco, Yves Durand, Christian Fabre, Florent De Dinechin, Anca Molnos, Albert Cohen:Variable Precision Capabilities in RISC-V Processors, RISC-V Workshop Zurich (June 11 – 13, 2019) • Andrea Bocco, Yves Durand, and Florent de Dinechin. Dynamic precision numerics using a variable-precision UNUM type I HW coprocessor. In 26th IEEE Symposium of Computer Arithmetic (ARITH-26) , June 2019 . Y.Durand | April 2019 | 8

ENHANCING SCIENTIFIC COMPUTATION USING A VARIABLE PRECISION FPU WITH - PowerPoint PPT Presentation

ENHANCING SCIENTIFIC COMPUTATION USING A VARIABLE PRECISION FPU WITH A RISC-V PROCESSOR Y.Durand, C.Fabre, A. Bocco, T. Trevisan | IMPRENUM Project | Oct 2019 | 1 USE CASES FOR (LARGE) VARIABLE PRECISION Applications Techniques & Kernels

Numberjack User Guide May 27, 2013 1 Variables Constructor for the class Variable : Constructor

Variable selection bias Bias in Ensemble Bias in Ensemble Methods Methods Variable selection

Formal Definition of Computation Formal Definition of Computation p.1/28 Computation

Variable Benefit Plans in Depth Kelly Coffing, FSA, EA, MAAA September 21, 2019 Agenda The

Measuring variable importance in random forests Variable Variable importance in RF importance

Variables in C++ The variable C++ Variables Kinds of Variables Memory storage

Variable & Value Ordering Heuristics Heuristics for backtracking algorithms Variable

Scientific report Mariusz ynel April 22, 2015 Scientific report 2 Contents 1 Scientific

The Scientific Method The Scientific Method The Scientific Method involves 6 steps: Problem

Enhancing Academic Enhancing Academic Advisement Using the Advisement Using the First-Year

SCIENCE SCIENCE Scientific Question Hypothesis Prediction Experimental Test Scientific

Scientific Programming in mpags-python.github.io Steven Bamford An introduction to scientific

DISCRETIZE: Command to Convert a Continuous Instrument into a Dummy Variable for Instrumental

DISCRETIZ: Command to Convert a Continuous Instrument into a Dummy Variable for Instrumental

Template Rendering DTL Usage Variables {{variable}} Notation: IF Variable Not Valid:

Math 2200-01 (Calculus I) Spring 2020 Book 1 - fail for example ( one input variable onion 27

Side-Channel Analysis on Blinded Regular Scalar Multiplications Benoit Feix Mylne Roussellet

A sampled sine wave 1.0 0.5

Contents 1. CARRIOCAS Challenges and Project presentation Cliquez pour modifier le style du titre

NEXTLEAP NeXt generation Technosocial and Legal Encryption Access and Privacy Co-ordinator:

AUTOMATED SOFTWARE PROTECTION FOR THE MASSES AGAINST SIDE-CHANNEL ATTACKS PHISIC 2018 |

X-Ray Magnetic Circular Dichroism: basic concepts and applications for 3d transition metals

Ins GAM Dr. Camille SALINESI Centre de Recherche en Informatique Universit Paris1

The Cost of Monotonicity in Distributed Graph Searching David Ilcinkas 1 Nicolas Nisse 2 David

ENHANCING SCIENTIFIC COMPUTATION USING A VARIABLE PRECISION FPU WITH - PowerPoint PPT Presentation

ENHANCING SCIENTIFIC COMPUTATION USING A VARIABLE PRECISION FPU WITH A RISC-V PROCESSOR Y.Durand, C.Fabre, A. Bocco, T. Trevisan | IMPRENUM Project | Oct 2019 | 1 USE CASES FOR (LARGE) VARIABLE PRECISION Applications Techniques & Kernels

Numberjack User Guide May 27, 2013 1 Variables Constructor for the class Variable : Constructor

Variable selection bias Bias in Ensemble Bias in Ensemble Methods Methods Variable selection

Formal Definition of Computation Formal Definition of Computation p.1/28 Computation

Variable Benefit Plans in Depth Kelly Coffing, FSA, EA, MAAA September 21, 2019 Agenda The

Measuring variable importance in random forests Variable Variable importance in RF importance

Variables in C++ The variable C++ Variables Kinds of Variables Memory storage

Variable &amp; Value Ordering Heuristics Heuristics for backtracking algorithms Variable

Scientific report Mariusz ynel April 22, 2015 Scientific report 2 Contents 1 Scientific

The Scientific Method The Scientific Method The Scientific Method involves 6 steps: Problem

Enhancing Academic Enhancing Academic Advisement Using the Advisement Using the First-Year

SCIENCE SCIENCE Scientific Question Hypothesis Prediction Experimental Test Scientific

Scientific Programming in mpags-python.github.io Steven Bamford An introduction to scientific

DISCRETIZE: Command to Convert a Continuous Instrument into a Dummy Variable for Instrumental

DISCRETIZ: Command to Convert a Continuous Instrument into a Dummy Variable for Instrumental

Template Rendering DTL Usage Variables {{variable}} Notation: IF Variable Not Valid:

Math 2200-01 (Calculus I) Spring 2020 Book 1 - fail for example ( one input variable onion 27

Side-Channel Analysis on Blinded Regular Scalar Multiplications Benoit Feix Mylne Roussellet

A sampled sine wave 1.0 0.5

Contents 1. CARRIOCAS Challenges and Project presentation Cliquez pour modifier le style du titre

NEXTLEAP NeXt generation Technosocial and Legal Encryption Access and Privacy Co-ordinator:

AUTOMATED SOFTWARE PROTECTION FOR THE MASSES AGAINST SIDE-CHANNEL ATTACKS PHISIC 2018 |

X-Ray Magnetic Circular Dichroism: basic concepts and applications for 3d transition metals

Ins GAM Dr. Camille SALINESI Centre de Recherche en Informatique Universit Paris1

The Cost of Monotonicity in Distributed Graph Searching David Ilcinkas 1 Nicolas Nisse 2 David

Variable & Value Ordering Heuristics Heuristics for backtracking algorithms Variable