How to Write Fast Numerical Code Spring 2011 Lecture 21 Instructor: - PowerPoint PPT Presentation

How to Write Fast Numerical Code Spring 2011 Lecture 21 Instructor: Markus Püschel TA: Georg Ofenbeck

Schedule Today Lecture Project presentations • 10 minutes each • random order • random speaker Final project paper and code due: Friday, June 10 th

FFT References Complexity: Bürgisser, Clausen, Shokrollahi, Algebraic Complexity Theory , Springer, 1997  History: Heideman, Johnson, Burrus: Gauss and the History of the Fast Fourier Transform , Arch.  Hist. Sc. 34(3) 1985 FFTs:   Cooley and Tukey, An algorithm for the machine calculation of complex Fourier series ,” Math. of Computation, vol. 19, pp. 297 – 301, 1965  Nussbaumer, Fast Fourier Transform and Convolution Algorithms , 2nd ed., Springer, 1982  van Loan, Computational Frameworks for the Fast Fourier Transform , SIAM, 1992  Tolimieri, An, Lu, Algorithms for Discrete Fourier Transforms and Convolution , Springer, 2nd edition, 1997  Franchetti, Püschel, Voronenko, Chellappa and Moura, Discrete Fourier Transform on Multicore , IEEE Signal Processing Magazine, special issue on ``Signal Processing on Platforms with Multiple Cores'', Vol. 26, No. 6, pp. 90-102, 2009 FFTW: www.fftw.org   Frigo and Johnson, FFTW: An Adaptive Software Architecture for the FFT , Proc. ICASSP, vol. 3, pp. 1381-1384  M. Frigo, A fast Fourier transform compiler , in Proc. PLDI, 1999

Discrete Fourier Transform Defined for all sizes n:  y = DFT n x ! n = e ¡ 2 ¼i=n DFT n = [ ! k` n ] 0 · k;`<n ;

Complexity of the DFT Measure: L c , 2 ≤ c   Complex adds count 1  Complex mult by a constant a with |a| < c counts 1  L 2 is strictest, L ∞ the loosest (and most natural) Upper bounds:   n = 2 k : L 2 (DFT n ) ≤ 3/2 n log 2 (n) (using Cooley-Tukey FFT)  General n: L 2 (DFT n ) ≤ 8 n log 2 (n) (needs Bluestein FFT) Lower bound:   Theorem by Morgenstern: If c < ∞, then L c (DFT n ) ≥ ½ n log c (n)  Implies: in the measure L c , the DFT is Θ (n log(n))

History of FFTs The advent of digital signal processing is often attributed to the FFT  (Cooley-Tukey 1965) History:   Around 1805: FFT discovered by Gauss [1] (Fourier publishes the concept of Fourier analysis in 1807!)  1965: Rediscovered by Cooley-Tukey [1]: Heideman, Johnson, Burrus : “Gauss and the History of the Fast Fourier Transform” Arch. Hist. Sc. 34(3) 1985

Carl-Friedrich Gauss 1777 - 1855 Contender for the greatest mathematician of all times  Some contributions: Modular arithmetic, least square analysis, normal  distribution, fundamental theorem of algebra, Gauss elimination, Gauss quadrature, Gauss-Seidel, non-euclidean geometry, …

Example FFT, n = 4 Fast Fourier transform (FFT) 2 3 2 3 2 3 2 3 2 3 1 1 1 1 1 ¢ 1 ¢ 1 ¢ ¢ ¢ 1 1 ¢ ¢ 1 ¢ ¢ ¢ 6 7 6 7 6 7 6 7 6 7 1 i ¡ 1 ¡ i ¢ 1 ¢ 1 ¢ 1 ¢ ¢ 1 ¡ 1 ¢ ¢ ¢ ¢ 1 ¢ 6 7 6 7 6 7 6 7 6 7 5 x = 5 x 6 7 6 7 6 7 6 7 6 7 1 ¡ 1 1 ¡ 1 1 ¢ ¡ 1 ¢ ¢ ¢ 1 ¢ ¢ ¢ 1 1 ¢ 1 ¢ ¢ 4 4 5 4 5 4 5 4 1 ¡ i ¡ 1 ¢ 1 ¢ ¡ 1 ¢ ¢ ¢ ¢ ¢ 1 ¡ 1 ¢ ¢ ¢ 1 i i Representation using matrix algebra Data flow graph

Example FFT, n = 16 (Recursive, Radix 4)

FFTs Recursive, general radix, decimation-in-time/decimation-in-frequency  radix = k I m ) T km DFT m ) L km DFT km = ( DFT k - m (I k - k = L km DFT m ) T km DFT km m (I k - m ( DFT k - I m ) Iterative, radix 2, decimation-in-time/decimation-in-frequency  0 1 t Y T 2 t ¡ j +1 A ¢ R 2 t @ DFT 2 t = (I 2 j ¡ 1 - DFT 2 - I 2 t ¡ j ) ¢ (I 2 j ¡ 1 - ) 2 t ¡ j j =1 0 1 t Y T 2 j @ A DFT 2 t = R 2 t ¢ (I 2 t ¡ j - 2 j ¡ 1 ) ¢ (I 2 t ¡ j - DFT 2 - I 2 j ¡ 1 ) j =1

Radix 2, recursive Radix 2, iterative

Recursive vs. Iterative Iterative FFT computes in stages of butterflies =  log 2 (n) passes through the data Recursive FFT reduces passes through data =  better locality Same computation graph but different topological sorting  Rough analogy:  MMM DFT Triple loop Iterative FFT Blocked Recursive FFT

Fast Implementation (≈ FFTW 2.x) Choice of algorithm  Locality optimization  Constants  Fast basic blocks  Adaptivity  Blackboard 

How to Write Fast Numerical Code Spring 2011 Lecture 21 Instructor: - PowerPoint PPT Presentation

How to Write Fast Numerical Code Spring 2011 Lecture 21 Instructor: Markus Pschel TA: Georg Ofenbeck Schedule Today Lecture Project presentations 10 minutes each random order random speaker Final project paper and code due:

Write Through No Write Allocate Cache Write Reference Check tag and index Yes Tag AND

Being a METS Startup Fast Failure; Fast Reward November 2016 Fast Failure; Fast Reward

How to Write Fast Numerical Code Spring 2011 Lecture 22 Instructor: Markus Pschel TA: Georg

Read Write Inc. Phonics Parents Meeting Who is Read Write Inc. Phonics for? Read Write Inc.

How to Write Fast Numerical Code Spring 2011 Lecture 13 Instructor: Markus Pschel TA: Georg

How to Write Fast Numerical Code Spring 2011 Lecture 5 Instructor: Markus Pschel TA: Georg

How to Write Fast Numerical Code Spring 2011 Lecture 15 Instructor: Markus Pschel TA: Georg

How to Write Fast Numerical Code Spring 2011 Lecture 7 Instructor: Markus Pschel TA: Georg

How to Write Fast Numerical Code Spring 2011 Lecture 8 Instructor: Markus Pschel TA: Georg

How to Write Fast Numerical Code Spring 2011 Lecture 12 Instructor: Markus Pschel TA: Georg

Code Generation Machine code generation cs4713 1 Machine code generation machine Intermediate

{Sequential Code} {Sequential Code} {Sequential Code} {Sequential Code} {Sequential Code}

Numerical Differentiation & Integration Composite Numerical Integration I Numerical Analysis

Numerical Differentiation & Integration Numerical Differentiation I Numerical Analysis (9th

Numerical Semigroup Algebra Joint with Kee, Mee-Kyoung International meeting on numerical

Obstacles in Numerical Calculations Erik Schnetter Paris, November 2006 Obstacles in Numerical

CS 294-73 Software Engineering for Scientific Computing Lecture 11: Fourier

Concepts and Algorithms of Scientific and Visual Computing Fast Fourier Transform CS448J,

The Fast Fourier Transform (FFT) A top 10 Algorithm* Rubin H Landau Sally Haerer,

Polynomials and Fast Fourier Transform (FFT) Polynomials n-1 a i x i a polynomial of degree n-1

5.6 Convolution and FFT Fast Fourier Transform: Applications Applications. Optics,

Lecture 23: Fourier Transform, Convolution Theorem, and Linear Dynamical Systems April 28, 2016.

From Fourier to Koopman Spectral Methods for Long-term Time Series Prediction arXiv:2004.00574

Fast Fourier Transform Integer multiplication Multiplying two n-bit integers A and B: Grade

How to Write Fast Numerical Code Spring 2011 Lecture 21 Instructor: - PowerPoint PPT Presentation

How to Write Fast Numerical Code Spring 2011 Lecture 21 Instructor: Markus Pschel TA: Georg Ofenbeck Schedule Today Lecture Project presentations 10 minutes each random order random speaker Final project paper and code due:

Write Through No Write Allocate Cache Write Reference Check tag and index Yes Tag AND

Being a METS Startup Fast Failure; Fast Reward November 2016 Fast Failure; Fast Reward

How to Write Fast Numerical Code Spring 2011 Lecture 22 Instructor: Markus Pschel TA: Georg

Read Write Inc. Phonics Parents Meeting Who is Read Write Inc. Phonics for? Read Write Inc.

How to Write Fast Numerical Code Spring 2011 Lecture 13 Instructor: Markus Pschel TA: Georg

How to Write Fast Numerical Code Spring 2011 Lecture 5 Instructor: Markus Pschel TA: Georg

How to Write Fast Numerical Code Spring 2011 Lecture 15 Instructor: Markus Pschel TA: Georg

How to Write Fast Numerical Code Spring 2011 Lecture 7 Instructor: Markus Pschel TA: Georg

How to Write Fast Numerical Code Spring 2011 Lecture 8 Instructor: Markus Pschel TA: Georg

How to Write Fast Numerical Code Spring 2011 Lecture 12 Instructor: Markus Pschel TA: Georg

Code Generation Machine code generation cs4713 1 Machine code generation machine Intermediate

{Sequential Code} {Sequential Code} {Sequential Code} {Sequential Code} {Sequential Code}

Numerical Differentiation &amp; Integration Composite Numerical Integration I Numerical Analysis

Numerical Differentiation &amp; Integration Numerical Differentiation I Numerical Analysis (9th

Numerical Semigroup Algebra Joint with Kee, Mee-Kyoung International meeting on numerical

Obstacles in Numerical Calculations Erik Schnetter Paris, November 2006 Obstacles in Numerical

CS 294-73 Software Engineering for Scientific Computing Lecture 11: Fourier

Concepts and Algorithms of Scientific and Visual Computing Fast Fourier Transform CS448J,

The Fast Fourier Transform (FFT) A top 10 Algorithm* Rubin H Landau Sally Haerer,

Polynomials and Fast Fourier Transform (FFT) Polynomials n-1 a i x i a polynomial of degree n-1

5.6 Convolution and FFT Fast Fourier Transform: Applications Applications. Optics,

Lecture 23: Fourier Transform, Convolution Theorem, and Linear Dynamical Systems April 28, 2016.

From Fourier to Koopman Spectral Methods for Long-term Time Series Prediction arXiv:2004.00574

Fast Fourier Transform Integer multiplication Multiplying two n-bit integers A and B: Grade

Numerical Differentiation & Integration Composite Numerical Integration I Numerical Analysis

Numerical Differentiation & Integration Numerical Differentiation I Numerical Analysis (9th