Implementing autograd Slides by Matthew Johnson Autograds - PowerPoint PPT Presentation

Implementing autograd Slides by Matthew Johnson

Autograd’s implementation github.com/hips/autograd Dougal Maclaurin, David Duvenaud, Matt Johnson • differentiates native Python code • handles most of Numpy + Scipy • loops, branching, recursion, closures • arrays, tuples, lists, dicts... • derivatives of derivatives • a one-function API!

autodiff implementation options A. direct specification of computation graph B. source code inspection C. monitoring function execution

ingredients: 1. tracing composition of primitive functions 2. vector-Jacobian product for each primitive 3. composing VJPs backward

numpy.sum

primitive autograd.numpy.sum numpy.sum

primitive Node ã autograd.numpy.sum a value: F function: numpy.sum [x] parents:

primitive Node ã autograd.numpy.sum a value: a F function: unbox numpy.sum [x] parents:

primitive ˜ Node ã autograd.numpy.sum Node b a value: value: b a b F function: function: anp.sum unbox numpy.sum box ˜ [x] parents: parents: [ã]

start_node x

start_node x a = A ( x )

start_node x b = B ( a ) a = A ( x )

start_node x b = B ( a ) a = A ( x ) c = C ( b )

start_node end_node y = D ( c ) x b = B ( a ) a = A ( x ) c = C ( b )

start_node end_node No control flow!

a = A ( x ) x

∂ y ∂ a a = A ( x ) x

∂ y ∂ y ∂ x = ? ∂ a a = A ( x ) x

∂ x = ∂ y ∂ y ∂ a · ∂ a ∂ y ∂ x ∂ a a = A ( x ) x

vector-Jacobian product ∂ x = ∂ y ∂ y ∂ y ∂ a · A 0 ( x ) ∂ a a = A ( x ) x

start_node end_node y = D ( c ) x b = B ( a ) a = A ( x ) c = C ( b )

∂ y ∂ y = 1 start_node end_node y = D ( c ) x b = B ( a ) a = A ( x ) c = C ( b )

∂ y ∂ y = 1 ∂ y start_node end_node ∂ c y = D ( c ) x b = B ( a ) a = A ( x ) c = C ( b )

∂ y ∂ y ∂ y = 1 ∂ b ∂ y start_node end_node ∂ c y = D ( c ) x b = B ( a ) a = A ( x ) c = C ( b )

∂ y ∂ y ∂ y = 1 ∂ b ∂ y ∂ y start_node end_node ∂ a ∂ c y = D ( c ) x b = B ( a ) a = A ( x ) c = C ( b )

∂ y ∂ y ∂ y ∂ y = 1 ∂ x ∂ b ∂ y ∂ y start_node end_node ∂ a ∂ c y = D ( c ) x b = B ( a ) a = A ( x ) c = C ( b )

higher-order autodiff just works: the backward pass can itself be traced

∂ y ∂ y = 1 start_node end_node y = D ( c ) x b = B ( a ) a = A ( x ) c = C ( b )

∂ y ∂ c ∂ y ∂ y = 1 start_node end_node y = D ( c ) x b = B ( a ) a = A ( x ) c = C ( b )

∂ y ∂ y ∂ b ∂ c ∂ y ∂ y = 1 start_node end_node y = D ( c ) x b = B ( a ) a = A ( x ) c = C ( b )

∂ y ∂ y ∂ y ∂ a ∂ b ∂ c ∂ y ∂ y = 1 start_node end_node y = D ( c ) x b = B ( a ) a = A ( x ) c = C ( b )

∂ y ∂ y ∂ y ∂ y ∂ x ∂ a ∂ b ∂ c ∂ y ∂ y = 1 start_node end_node y = D ( c ) x b = B ( a ) a = A ( x ) c = C ( b )

∂ y end_node ∂ y = 1 start_node y = D ( c ) x b = B ( a ) a = A ( x ) c = C ( b )

ingredients: 1. tracing composition of primitive functions   Node , primitive , forward_pass 2. vector-Jacobian product for each primitive   defvjp 3. composing VJPs backward   backward_pass , make_vjp , grad

what’s the point? easy to extend! - develop autograd! - forward mode - log joint densities from sampler programs

Implementing autograd Slides by Matthew Johnson Autograds - PowerPoint PPT Presentation

Implementing autograd Slides by Matthew Johnson Autograds implementation github.com/hips/autograd Dougal Maclaurin, David Duvenaud, Matt Johnson differentiates native Python code handles most of Numpy + Scipy loops, branching,

autograd January 31, 2019 1 Automatic Differentiation 1.1 Import autograd and create a variable

Deep learning 5.7. Writing an autograd function Fran cois Fleuret https://fleuret.org/ee559/

Convolution Layers Convolution Layers In [1]: from mxnet import autograd, nd from mxnet.gluon

Autoregressive Models Autoregressive Models In [1]: from mxnet import autograd, nd, gluon, init

Implementing Perl 6 Jonathan Worthington Dutch Perl Workshop 2008 Implementing Perl 6 I

61A Extra Lecture 6 Implementing an Object System 3 Implementing an Object System Today's

Implementing Generalised Alt Gavin Lowe Implementing Generalised Alt 02 CSO for dummies

Potential Impact of Implementing Research Findings: Modeling Approach Implementing PCOR Results to

Implementing the ACA at Your Implementing the ACA at Your Community College Community College

WELCOME! Texas Board of Nursing Workshop Implementing the Implementing the Differentiated

Steady Progress in Implementing Measures to Steady Progress in Implementing Measures to Improve

Implementing 3D conformal radiotherapy and IMRT Implementing 3D conformal radiotherapy and IMRT

Implementing Powerful Implementing Powerful Ratio ionale le fo for A Actio ion P Pla

Implementing Snort into SURFids Sander Keemink and Michael van Kleij February 6, 2008 1 / 21

Challenges in Implementing Challenges in Implementing Trade Facilitation & Trade

Implementing an SRM tool Implementing an SRM tool Business value & key success factors

MATH 12002 - CALCULUS I 3.7: Antiderivatives (Part 1) Professor Donald L. White Department of

CS 4803 / 7643: Deep Learning Topics: Specifying Layers Forward & Backward

Chapter 11 Introduction to Programming in C C: A High-Level Language Gives symbolic names to

6LoWPAN: An Open IoT Networking Protocol OpenIoT Summit 2016 San Diego Stefan Schmidt Samsung

Automa'c Iden'fica'on of Research Ar'cles from Crawled Documents

Mandatory Access Control Systems CSE497b - Spring 2007 Introduction Computer and Network

Evaluating Systems Information Assurance Fall 2009 Reading Material Chapter 21 Computer

Module 19: Security The Security Problem Authentication Program Threats System

Implementing autograd Slides by Matthew Johnson Autograds - PowerPoint PPT Presentation

Implementing autograd Slides by Matthew Johnson Autograds implementation github.com/hips/autograd Dougal Maclaurin, David Duvenaud, Matt Johnson differentiates native Python code handles most of Numpy + Scipy loops, branching,

autograd January 31, 2019 1 Automatic Differentiation 1.1 Import autograd and create a variable

Deep learning 5.7. Writing an autograd function Fran cois Fleuret https://fleuret.org/ee559/

Convolution Layers Convolution Layers In [1]: from mxnet import autograd, nd from mxnet.gluon

Autoregressive Models Autoregressive Models In [1]: from mxnet import autograd, nd, gluon, init

Implementing Perl 6 Jonathan Worthington Dutch Perl Workshop 2008 Implementing Perl 6 I

61A Extra Lecture 6 Implementing an Object System 3 Implementing an Object System Today's

Implementing Generalised Alt Gavin Lowe Implementing Generalised Alt 02 CSO for dummies

Potential Impact of Implementing Research Findings: Modeling Approach Implementing PCOR Results to

Implementing the ACA at Your Implementing the ACA at Your Community College Community College

WELCOME! Texas Board of Nursing Workshop Implementing the Implementing the Differentiated

Steady Progress in Implementing Measures to Steady Progress in Implementing Measures to Improve

Implementing 3D conformal radiotherapy and IMRT Implementing 3D conformal radiotherapy and IMRT

Implementing Powerful Implementing Powerful Ratio ionale le fo for A Actio ion P Pla

Implementing Snort into SURFids Sander Keemink and Michael van Kleij February 6, 2008 1 / 21

Challenges in Implementing Challenges in Implementing Trade Facilitation &amp; Trade

Implementing an SRM tool Implementing an SRM tool Business value &amp; key success factors

MATH 12002 - CALCULUS I 3.7: Antiderivatives (Part 1) Professor Donald L. White Department of

CS 4803 / 7643: Deep Learning Topics: Specifying Layers Forward &amp; Backward

Chapter 11 Introduction to Programming in C C: A High-Level Language Gives symbolic names to

6LoWPAN: An Open IoT Networking Protocol OpenIoT Summit 2016 San Diego Stefan Schmidt Samsung

Automa'c Iden'fica'on of Research Ar'cles from Crawled Documents

Mandatory Access Control Systems CSE497b - Spring 2007 Introduction Computer and Network

Evaluating Systems Information Assurance Fall 2009 Reading Material Chapter 21 Computer

Module 19: Security The Security Problem Authentication Program Threats System

Challenges in Implementing Challenges in Implementing Trade Facilitation & Trade

Implementing an SRM tool Implementing an SRM tool Business value & key success factors

CS 4803 / 7643: Deep Learning Topics: Specifying Layers Forward & Backward