Computing transcendental functions with error bounds (a progress - PowerPoint PPT Presentation

Computing transcendental functions with error bounds (a progress report) Fredrik Johansson LFANT seminar, 2015-10-13 1 / 43

Overview Recent work (last 12 months) on Arb – a C library for arbitrary-precision interval arithmetic ◮ Much faster elementary functions (published in ARITH22) ◮ Hypergeometric functions (in brief) ◮ Elliptic/modular functions (including an addendum to the joint work with Andreas Enge and William Hart, described in a previous talk) 2 / 43

Advertisement: http://nemocas.org/ Computer algebra package for the Julia programming language Uses the C libraries FLINT, Antic, Pari, GMP/MPIR, MPFR, Arb. Plus algorithms for generic rings, implemented in Julia. William Hart, Tommy Hofmann, Claus Fieker, Oleksandr Motsak (Kaiserslautern) and FJ 3 / 43

Elementary functions Functions : exp, log, sin, cos, atan 4 / 43

Elementary functions Functions : exp, log, sin, cos, atan Input : floating-point number x = a · 2 b , precision p ≥ 2 Output : m , r with f ( x ) ∈ [ m − r , m + r ] and r ≈ 2 − p | f ( x ) | 4 / 43

Precision ranges Hardware precision ( n ≈ 53 bits) ◮ Extensively studied - elsewhere! 5 / 43

Precision ranges Hardware precision ( n ≈ 53 bits) ◮ Extensively studied - elsewhere! Medium precision ( n ≈ 100 - 10 000 bits) ◮ Multiplication costs M ( n ) = O ( n 2 ) or O ( n 1 . 6 ) ◮ Argument reduction + rectangular splitting: O ( n 1 / 3 M ( n )) ◮ In the lower range, software overhead is significant 5 / 43

Precision ranges Hardware precision ( n ≈ 53 bits) ◮ Extensively studied - elsewhere! Medium precision ( n ≈ 100 - 10 000 bits) ◮ Multiplication costs M ( n ) = O ( n 2 ) or O ( n 1 . 6 ) ◮ Argument reduction + rectangular splitting: O ( n 1 / 3 M ( n )) ◮ In the lower range, software overhead is significant Very high precision ( n ≫ 10 000 bits) ◮ Multiplication costs M ( n ) = O ( n log n log log n ) ◮ Asymptotically fast algorithms: binary splitting, arithmetic-geometric mean (AGM) iteration: O ( M ( n ) log( n )) 5 / 43

Recipe for elementary functions exp( x ) sin( x ) , cos( x ) log(1 + x ) atan( x ) ↓ Domain reduction using π and log(2) ↓ x ∈ [0 , log(2)) x ∈ [0 , π/ 4) x ∈ [0 , 1) x ∈ [0 , 1) 6 / 43

Recipe for elementary functions exp( x ) sin( x ) , cos( x ) log(1 + x ) atan( x ) ↓ Domain reduction using π and log(2) ↓ x ∈ [0 , log(2)) x ∈ [0 , π/ 4) x ∈ [0 , 1) x ∈ [0 , 1) ↓ Argument-halving r ≈ 8 times exp( x ) = [exp( x / 2)] 2 log(1 + x ) = 2 log( √ 1 + x ) ↓ x ∈ [0 , 2 − r ) ↓ Taylor series 6 / 43

Better recipe at medium precision exp( x ) sin( x ) , cos( x ) log(1 + x ) atan( x ) ↓ Domain reduction using π and log(2) ↓ x ∈ [0 , log(2)) x ∈ [0 , π/ 4) x ∈ [0 , 1) x ∈ [0 , 1) ↓ Lookup table with 2 r ≈ 2 8 entries exp( t + x ) = exp( t ) exp( x ) log(1 + t + x ) = log(1 + t ) + log(1 + x / (1 + t )) ↓ x ∈ [0 , 2 − r ) ↓ Taylor series 7 / 43

Argument reduction formulas What we want to compute: f ( x ) , x ∈ [0 , 1) Table size: q = 2 r t = i / q , i = ⌊ 2 r x ⌋ Precomputed value: f ( t ) , y ∈ [0 , 2 − r ) Remaining value to compute: f ( y ) , 8 / 43

Argument reduction formulas What we want to compute: f ( x ) , x ∈ [0 , 1) Table size: q = 2 r t = i / q , i = ⌊ 2 r x ⌋ Precomputed value: f ( t ) , y ∈ [0 , 2 − r ) Remaining value to compute: f ( y ) , exp( x ) = exp( t ) exp( y ) , y = x − i / q sin( x ) = sin( t ) cos( y ) + cos( t ) sin( y ) , y = x − i / q cos( x ) = cos( t ) cos( y ) − sin( t ) sin( y ) , y = x − i / q 8 / 43

Argument reduction formulas What we want to compute: f ( x ) , x ∈ [0 , 1) Table size: q = 2 r t = i / q , i = ⌊ 2 r x ⌋ Precomputed value: f ( t ) , y ∈ [0 , 2 − r ) Remaining value to compute: f ( y ) , exp( x ) = exp( t ) exp( y ) , y = x − i / q sin( x ) = sin( t ) cos( y ) + cos( t ) sin( y ) , y = x − i / q cos( x ) = cos( t ) cos( y ) − sin( t ) sin( y ) , y = x − i / q log(1 + x ) = log(1 + t ) + log(1 + y ) , y = ( qx − i ) / ( i + q ) atan( x ) = atan( t ) + atan( y ) , y = ( qx − i ) / ( ix + q ) 8 / 43

Optimizing lookup tables m = 2 tables with 2 5 + 2 5 entries gives same reduction as m = 1 table with 2 10 entries Function Precision Entries Size (KiB) m r exp ≤ 512 1 8 178 11.125 exp ≤ 4608 2 5 23+32 30.9375 sin ≤ 512 1 8 203 12.6875 sin ≤ 4608 2 5 26+32 32.625 cos ≤ 512 1 8 203 12.6875 cos ≤ 4608 2 5 26+32 32.625 log ≤ 512 2 7 128+128 16 log ≤ 4608 2 5 32+32 36 atan ≤ 512 1 8 256 16 atan ≤ 4608 2 5 32+32 36 Total 236.6875 9 / 43

Taylor series Logarithmic series : 3 x 3 + 1 5 x 5 − 1 7 x 7 + . . . atan( x ) = x − 1 log(1 + x ) = 2 atanh( x / ( x + 2)) With x < 2 − 10 , need 230 terms for 4600-bit precision 10 / 43

Taylor series Logarithmic series : 3 x 3 + 1 5 x 5 − 1 7 x 7 + . . . atan( x ) = x − 1 log(1 + x ) = 2 atanh( x / ( x + 2)) With x < 2 − 10 , need 230 terms for 4600-bit precision Exponential series : 2! x 2 + 1 3! x 3 + 1 4! x 4 + . . . exp( x ) = 1 + x + 1 3! x 3 + 1 5! x 5 − . . . , 2! x 2 + 1 4! x 4 − . . . sin( x ) = x − 1 cos( x ) = 1 − 1 With x < 2 − 10 , need 280 terms for 4600-bit precision 10 / 43

Taylor series Logarithmic series : 3 x 3 + 1 5 x 5 − 1 7 x 7 + . . . atan( x ) = x − 1 log(1 + x ) = 2 atanh( x / ( x + 2)) With x < 2 − 10 , need 230 terms for 4600-bit precision Exponential series : 2! x 2 + 1 3! x 3 + 1 4! x 4 + . . . exp( x ) = 1 + x + 1 3! x 3 + 1 5! x 5 − . . . , 2! x 2 + 1 4! x 4 − . . . sin( x ) = x − 1 cos( x ) = 1 − 1 With x < 2 − 10 , need 280 terms for 4600-bit precision � 1 − sin 2 ( x ) Above 300 bits: cos( x ) = � 1 + sinh 2 ( x ) Above 800 bits: exp( x ) = sinh( x ) + 10 / 43

Evaluating Taylor series using rectangular splitting Paterson and Stockmeyer, 1973: � n i =0 � x i in O ( n ) cheap steps + O ( n 1 / 2 ) expensive steps 11 / 43

Evaluating Taylor series using rectangular splitting Paterson and Stockmeyer, 1973: � n i =0 � x i in O ( n ) cheap steps + O ( n 1 / 2 ) expensive steps � x 2 � x 3 ( + � x + + ) + � � x 2 � x 3 x 4 ( � + � x + + ) + � x 2 � x 3 x 8 ( � + � x + + ) + � x 2 � x 3 x 12 ( + + + ) � � x 11 / 43

Evaluating Taylor series using rectangular splitting Paterson and Stockmeyer, 1973: � n i =0 � x i in O ( n ) cheap steps + O ( n 1 / 2 ) expensive steps � x 2 � x 3 ( + � x + + ) + � � x 2 � x 3 x 4 ( � + � x + + ) + � x 2 � x 3 x 8 ( � + � x + + ) + � x 2 � x 3 x 12 ( + + + ) � � x ◮ Smith, 1989: elementary and hypergeometric functions 11 / 43

Evaluating Taylor series using rectangular splitting Paterson and Stockmeyer, 1973: � n i =0 � x i in O ( n ) cheap steps + O ( n 1 / 2 ) expensive steps � x 2 � x 3 ( + � x + + ) + � � x 2 � x 3 x 4 ( � + � x + + ) + � x 2 � x 3 x 8 ( � + � x + + ) + � x 2 � x 3 x 12 ( + + + ) � � x ◮ Smith, 1989: elementary and hypergeometric functions ◮ Brent & Zimmermann, 2010: improvements to Smith 11 / 43

Evaluating Taylor series using rectangular splitting Paterson and Stockmeyer, 1973: � n i =0 � x i in O ( n ) cheap steps + O ( n 1 / 2 ) expensive steps � x 2 � x 3 ( + � x + + ) + � � x 2 � x 3 x 4 ( � + � x + + ) + � x 2 � x 3 x 8 ( � + � x + + ) + � x 2 � x 3 x 12 ( + + + ) � � x ◮ Smith, 1989: elementary and hypergeometric functions ◮ Brent & Zimmermann, 2010: improvements to Smith ◮ FJ, 2014: generalization to D-finite functions 11 / 43

Evaluating Taylor series using rectangular splitting Paterson and Stockmeyer, 1973: � n i =0 � x i in O ( n ) cheap steps + O ( n 1 / 2 ) expensive steps � x 2 � x 3 ( + � x + + ) + � � x 2 � x 3 x 4 ( � + � x + + ) + � x 2 � x 3 x 8 ( � + � x + + ) + � x 2 � x 3 x 12 ( + + + ) � � x ◮ Smith, 1989: elementary and hypergeometric functions ◮ Brent & Zimmermann, 2010: improvements to Smith ◮ FJ, 2014: generalization to D-finite functions ◮ New: optimized algorithm for elementary functions 11 / 43

Logarithmic series Rectangular splitting: 2 x 2 + x 3 � 1 5 x 2 + x 3 � 1 8 x 2 �� x + 1 3 + 1 4 x + 1 6 + 1 7 x + 1 12 / 43

Logarithmic series Rectangular splitting: 2 x 2 + x 3 � 1 5 x 2 + x 3 � 1 8 x 2 �� x + 1 3 + 1 4 x + 1 6 + 1 7 x + 1 Improved algorithm with fewer divisions: � 30 x 2 + x 3 � 20+15 x +12 x 2 + x 3 � � � 8 x +7 x 2 �� x + 1 10+ 1 60 60 56 12 / 43

Exponential series Rectangular splitting: � 3 x 3 � � � 6 x 3 � � 8 x 2 �� 1+ x +1 x 2 +1 1+1 x +1 x 2 +1 1+1 x +1 2 4 5 7 13 / 43

Exponential series Rectangular splitting: � 3 x 3 � � � 6 x 3 � � 8 x 2 �� 1+ x +1 x 2 +1 1+1 x +1 x 2 +1 1+1 x +1 2 4 5 7 Improved algorithm with fewer divisions: � 12 x 2 + x 3 � � � 6 x 2 + x 3 � � 8 x + x 2 �� 1+ x + 1 x + 1 1+ 1 4+1 24 30 56 13 / 43

Taylor series evaluation using mpn arithmetic We use n -word fixed-point numbers (ulp = 2 − 64 n ) Negative numbers implicitly or using two’s complement! 14 / 43

Computing transcendental functions with error bounds (a progress - PowerPoint PPT Presentation

Computing transcendental functions with error bounds (a progress report) Fredrik Johansson LFANT seminar, 2015-10-13 1 / 43 Overview Recent work (last 12 months) on Arb a C library for arbitrary-precision interval arithmetic Much

The Permutable Mystery Tour Transcendental Functions and Escaping Points For transcendental

Transcendental lattices of complex algebraic surfaces Ichiro Shimada Hiroshima University

Circuit Lower-bounds Lecture 24 Weak circuits are indeed weak 1 Circuit Lower-bounds 2

Chapter 11: The R.M.S. Error for Regression Errors: A has a large positive error B has a large

Transcendental Julia Sets with Fractional Packing Dimension Jack Burkart, Stony Brook Topics in

Randomness in Computing L ECTURE 10 Last time Chernoff Bounds Today Hoeffding Bounds

ERROR DETECTON & CORRECTION Error Detection EDC= Error Detection and Correction bits

A Transcendental Julia Set of Dimension 1 Jack Burkart 2 October 2017 Jack Burkart Some History

Dessins denfants and transcendental lattices of singular K 3 surfaces Dessins denfants

EXPLORING TRANSCENDENTAL EXTENSIONS ADHEEP JOSEPH MENTOR: JORDAN HIRSH DIRECTED READING PROGRAM,

A Newhouse phenomenon in transcendental dynamics Adam Epstein (with Lasse Rempe-Gillen) Warwick

Transcendental Brauer elements and descent on elliptic surfaces Bianca Viray Brown University

Formal Verification of Roundoff Error Bounds using Semidefinite Programming Victor Magron , CNRS

Correct rounding of transcendental functions: an approach via Euclidean lattices and approximation

Human Error and Human Error Identification Techniques adapted from an IE 545 presentaton by

An Overview of Human Error Drawn f rom J . Reason, Human Error , Cambridge, 1990 Aaron Brown CS

On the geometry of CR -submanifolds of product type Marian Ioan MUNTEANU Al.I.Cuza University of

Schema Matching in a Large Scale Schema Matching in a Large Scale Personal Schema Based Querying

On Path-Centric Navigation and Search Techniques for Personal Knowledge Stored in Topic Maps

Acknowledgments: One of the most dense objects in Universe: Neutron stars R~10km and M~1.5 M

Shape Analysis Syntax of the pointer language p | n | a 1 op a a 2 | nil a ::= Goal: to obtain

Experimental Setup of the H8-RD22 Experiment h H8 RD22 E i Massimiliano Fiorini (on behalf of

On single-turn slow extraction from individual bunches using bent crystals and pulsed dipole

Uncut David Ripley University of Connecticut http://davewripley.rocks The questions Paradoxes

Computing transcendental functions with error bounds (a progress - PowerPoint PPT Presentation

Computing transcendental functions with error bounds (a progress report) Fredrik Johansson LFANT seminar, 2015-10-13 1 / 43 Overview Recent work (last 12 months) on Arb a C library for arbitrary-precision interval arithmetic Much

The Permutable Mystery Tour Transcendental Functions and Escaping Points For transcendental

Transcendental lattices of complex algebraic surfaces Ichiro Shimada Hiroshima University

Circuit Lower-bounds Lecture 24 Weak circuits are indeed weak 1 Circuit Lower-bounds 2

Chapter 11: The R.M.S. Error for Regression Errors: A has a large positive error B has a large

Transcendental Julia Sets with Fractional Packing Dimension Jack Burkart, Stony Brook Topics in

Randomness in Computing L ECTURE 10 Last time Chernoff Bounds Today Hoeffding Bounds

ERROR DETECTON &amp; CORRECTION Error Detection EDC= Error Detection and Correction bits

A Transcendental Julia Set of Dimension 1 Jack Burkart 2 October 2017 Jack Burkart Some History

Dessins denfants and transcendental lattices of singular K 3 surfaces Dessins denfants

EXPLORING TRANSCENDENTAL EXTENSIONS ADHEEP JOSEPH MENTOR: JORDAN HIRSH DIRECTED READING PROGRAM,

A Newhouse phenomenon in transcendental dynamics Adam Epstein (with Lasse Rempe-Gillen) Warwick

Transcendental Brauer elements and descent on elliptic surfaces Bianca Viray Brown University

Formal Verification of Roundoff Error Bounds using Semidefinite Programming Victor Magron , CNRS

Correct rounding of transcendental functions: an approach via Euclidean lattices and approximation

Human Error and Human Error Identification Techniques adapted from an IE 545 presentaton by

An Overview of Human Error Drawn f rom J . Reason, Human Error , Cambridge, 1990 Aaron Brown CS

On the geometry of CR -submanifolds of product type Marian Ioan MUNTEANU Al.I.Cuza University of

Schema Matching in a Large Scale Schema Matching in a Large Scale Personal Schema Based Querying

On Path-Centric Navigation and Search Techniques for Personal Knowledge Stored in Topic Maps

Acknowledgments: One of the most dense objects in Universe: Neutron stars R~10km and M~1.5 M

Shape Analysis Syntax of the pointer language p | n | a 1 op a a 2 | nil a ::= Goal: to obtain

Experimental Setup of the H8-RD22 Experiment h H8 RD22 E i Massimiliano Fiorini (on behalf of

On single-turn slow extraction from individual bunches using bent crystals and pulsed dipole

Uncut David Ripley University of Connecticut http://davewripley.rocks The questions Paradoxes

ERROR DETECTON & CORRECTION Error Detection EDC= Error Detection and Correction bits