Simon Pabst (Double Negative VFX) Talk Overview 1. The need in - PowerPoint PPT Presentation

Jeff Clifford (Double Negative VFX) Lukáš Polok (Brno University of Technology) Simon Pabst (Double Negative VFX)

Talk Overview 1. The need in production (Jeff) 2. The algorithm on the GPU ( Lukáš ) 3. Integration into DNeg’s pipeline (Simon)

About DNeg • Started in 1998 with a team of 30 people. Now 1250 people approx. • Latest film work was Interstellar ฀ Offices in London, Singapore & Vancouver ฀ R&D challenges have changed ฀ Unique challenges for handling of on-set data appropriate for GPU

IMPART • Intelligent Management Platform for Advanced Real-time Media Processes • EU Research Project • Two Industrial Partners • Four Universities

On-set Data Capture • Data captured on-set vital for digital feature film post production • Reference Photos, HDRIs, Panoramas, LIDAR, GPS, witness cameras, … • One use-case: Photogrammetry • FF6 required 8 hours to process on CPU • IMPART provided opportunity to accelerate that as a POC initially in OpenCL • Latest CUDA prototype means we can process same data in 1h on a laptop • Allows for processing of material on-set!

Bundle Adjustment (BA) • 3D reconstruction from stills (N cameras) • Optimization problem, solvable using MLE • Strives to reduce reprojection errors (in 2D) • Related problems in computer vision • Subtly different from SfM (one camera) • Different from SLAM (reduces errors in 3D)

Bundle Adjustment as a Graph • Vertices: • 3D point positions • Camera poses • Camera parameters • Edges: • 3D point observations • Any other constraints

Graph Representation c 0 c 1 c 2 c 3 p 1 p 2 p 3 p 4 p 5 p 6 p 7 • Represented by a sparse matrix • edges Incidence (Jacobian) matrix A • Adjacency ( Hessian ) matrix Λ • Has a block structure vertices p 1 p 2 p 3 p 4 p 5 p 6 p 7 vertices c 0 c 1 c 2 c 3

Variable Block Structure • Size of blocks in a single matrix • Decompose camera blocks [Jeong12] • Solved on a GPU [Rennich12, Tawara12] • Variable block size schemes • Known at compile-time [Polok13] • Applies to GPUs as well Yekeun Jeong et. al., „Pushing the Envelope of Modern Methods for Bundle Adjustment,“ PAMI, 2012 Steve Rennich , „ Leveraging Matrix Block Structure In Sparse Matrix- Vector Multiplication,“ talk on GTC 2012 Tetsuo Tawara , „ Levenberg- Marquardt Using Block Sparse Matrices on CUDA,“ talk on GTC 2012 Lukas Polok et. al., "Cache efficient implementation for block matrix operations," HPC, 2013

Solving Bundle Adjustment • while 1 (Damped) Gauss-Newton methods • Repeatedly solve for build linearized system ( Λ , r) • Serial direct methods [Kummerle11, Kaess11] solve u = Λ / r • Serial sparse factorization, backsubstitution if norm(u) < thresh • Or parallel gradient descent [Wu2013] done • Easy to implement, less numerically robust update x = x Θ u + • Implemented a parallel direct solver Kummerle , Rainer, et al., „g2o: A general framework for graph optimization," ICRA, 2011 Kaess , Michael, et al. „iSAM2: Incremental smoothing and mapping using the Bayes tree,“ IJRR, 2011 Wu, Changchang . „Towards linear -time incremental structure from motion," 3DV, 2013

Solving Bundle Adjustment Quickly • A bipartite graph: 3D points not interrelated • Can use Schur complement • Maps well to GPU • Parallel matrix multiplication [Polok15] • Parallel factorization of reduced camera system • Can be nested • Can use maximum independent set for explicit ordering Lukas Polok et. al., „Fast Sparse Matrix Multiplication on GPU," to appear at HPC, 2015

Solving Time Breakdown all in double precision

Matrix Factorization Time Comparison 5226 x 5226, 40.06% dense

Matrix Multiplication Time Comparison

Fast Matrix Multiplication in SW BlockMatrix A, B, C, D; // lambda sections typedef TypeList(Size<6, 3>, Size<5, 3>) BS; typedef TransposeSizes<BS>::Result BS_T; typedef TypeList(Size<3, 3>) D_invS; // block sizes specifications BlockMatrix BD_inv, SC; // the results BD_inv = SpDGEMM<BS, D_invS>(B, D_invS); // calculate BD -1 SC = SpDGEMM<BS, BS_T>(BD_inv, C); // calculate BD -1 C Lukas Polok et. al., "Cache efficient implementation for block matrix operations," HPC, 2013

Fast Matrix Multiplication in HW • ESC algorithm [Dalton13, Polok15] • Expansion • Sorting • Compression Steven Dalton et. al., "Optimizing sparse matrix-matrix multiplication for the GPU," 2013 Lukas Polok et. al., „Fast Sparse Matrix Multiplication on GPU," to appear at HPC, 2015

Fast Matrix Multiplication in HW • ESC algorithm [Dalton13, Polok15] • Expansion • Sorting • Compression • 480 MFLOP/s (0.0336%) • Blocks to the rescue! Steven Dalton et. al., "Optimizing sparse matrix-matrix multiplication for the GPU," 2013 Lukas Polok et. al., „Fast Sparse Matrix Multiplication on GPU," to appear at HPC, 2015

Block Matrix Multiplication Time

Estimating 3D reconstruction errors • Important for practical use on-set • Involves system matrix inverse (fully dense!)

Estimating 3D reconstruction errors Can calculate parts of the inverse [Björck96] Difficult to parallelize A. Björck , „Numerical methods for least squares problems,“ SIAM, 1996

Estimating 3D reconstruction errors Can update it incrementally very fast! [Ila15] Viorela Ila et. al, „Fast Covariance Recovery in Incremental Nonlinear Least Square Solvers“, to appear at ICRA, 2015

Jigsaw • DNeg’s in-house tool to ingest and process data captured on-set • Handles photos, LIDAR, witness cameras, HDRIs, … • Can dispatch processing jobs to the farm or locally (on-set) • Easy to extend

Questions ? DNeg is hiring!!! Join our teams in London, Singapore and Vancouver (event next week!)

Simon Pabst (Double Negative VFX) Talk Overview 1. The need in - PowerPoint PPT Presentation

Jeff Clifford (Double Negative VFX) Luk Polok (Brno University of Technology) Simon Pabst (Double Negative VFX) Talk Overview 1. The need in production (Jeff) 2. The algorithm on the GPU ( Luk ) 3. Integration into DNegs pipeline

VFX What jobs are needed to make VFX? The learning objective To understand the wide variety

Warm Welcome Matrix SETU VFX Multi-port VoIP-FXO-FXS Gateways SETU VFX Presentation Overview

A GPU-Accelerated Node Based Framework for Hair Simulation and Rendering Francesco Giordana

More Java Graphics Shape Classes: Face Check out Faces from SVN Finish Java Graphics: text and

CG & VFX 4 th bangalore ANIMAYA shorts film 2015 festival AWARDS back to main VES

Names Quattro S Double A Double S Double C Triple C Quattro C Variations All Boxer models

Double Chooz Experiment Status Double Chooz Experiment Status Jelena Maricic, Drexel University

The Negative Marker in Romanian Negative Concord Gianina Iord achioaia Seminar f ur

Standards Based Grading In Language Arts Alyssa Jones (KJ), Rebecca Pabst (PJ), & Alyssa

TAILORING YOUR DOUBLE OPT-IN EMAILS 14 February 2018 AGENDA DOUBLE OPT-IN MECHANISM 1. 2.

Double, Multiple, and Sequential Sampling Double-sampling In a double-sampling plan, a first

Double Entry System Session 04 Session Outline Double Entry System Normal Account

Double Dose of Double-Negation Translations Olivier Hermant CRI, MINES ParisTech June 2, 2014

Annual General Meeting 27 June 2019 Simon Thompson Chairman Todays agenda Simon Thompson

Negative Equity Taxation and Disclosure October 19, 2009 Negative Equity History: State of Ohio

Signed numbers Goals unsigned numbers - non-negative integers signed numbers - positive/negative

An approach to limit states in An approach to limit states in advanced materials advanced

High-precision lane-level road map building for vehicle navigation Anning Chen Jay A. Farrell

SPEEDING UP SIMULATIONS OF STELLAR EXPLOSIONS Tom Papatheodore, May 9, 2017 OAK RIDGE LEADERSHIP

Comparative study of the use of C 1 continuous finite Conference on Computational Mechanics,

Comparison of Local Feature Descriptors Subhransu Maji Department of EECS, University of

Ipopt Tutorial Andreas W achter IBM T.J. Watson Research Center andreasw@watson.ibm.com

Relevance Vector Machines Jukka Lankinen LUT February 21, 2011 Jukka Lankinen Relevance Vector

Download the notebook for this section from the CS109 repo or here: http://bit.ly/109_S6 1

Sambuz

Useful Links

Newsletter

Mail Us

Simon Pabst (Double Negative VFX) Talk Overview 1. The need in - PowerPoint PPT Presentation

Jeff Clifford (Double Negative VFX) Luk Polok (Brno University of Technology) Simon Pabst (Double Negative VFX) Talk Overview 1. The need in production (Jeff) 2. The algorithm on the GPU ( Luk ) 3. Integration into DNegs pipeline

VFX What jobs are needed to make VFX? The learning objective To understand the wide variety

Warm Welcome Matrix SETU VFX Multi-port VoIP-FXO-FXS Gateways SETU VFX Presentation Overview

A GPU-Accelerated Node Based Framework for Hair Simulation and Rendering Francesco Giordana

More Java Graphics Shape Classes: Face Check out Faces from SVN Finish Java Graphics: text and

CG &amp; VFX 4 th bangalore ANIMAYA shorts film 2015 festival AWARDS back to main VES

Names Quattro S Double A Double S Double C Triple C Quattro C Variations All Boxer models

Double Chooz Experiment Status Double Chooz Experiment Status Jelena Maricic, Drexel University

The Negative Marker in Romanian Negative Concord Gianina Iord achioaia Seminar f ur

Standards Based Grading In Language Arts Alyssa Jones (KJ), Rebecca Pabst (PJ), &amp; Alyssa

TAILORING YOUR DOUBLE OPT-IN EMAILS 14 February 2018 AGENDA DOUBLE OPT-IN MECHANISM 1. 2.

Double, Multiple, and Sequential Sampling Double-sampling In a double-sampling plan, a first

Double Entry System Session 04 Session Outline Double Entry System Normal Account

Double Dose of Double-Negation Translations Olivier Hermant CRI, MINES ParisTech June 2, 2014

Annual General Meeting 27 June 2019 Simon Thompson Chairman Todays agenda Simon Thompson

Negative Equity Taxation and Disclosure October 19, 2009 Negative Equity History: State of Ohio

Signed numbers Goals unsigned numbers - non-negative integers signed numbers - positive/negative

An approach to limit states in An approach to limit states in advanced materials advanced

High-precision lane-level road map building for vehicle navigation Anning Chen Jay A. Farrell

SPEEDING UP SIMULATIONS OF STELLAR EXPLOSIONS Tom Papatheodore, May 9, 2017 OAK RIDGE LEADERSHIP

Comparative study of the use of C 1 continuous finite Conference on Computational Mechanics,

Comparison of Local Feature Descriptors Subhransu Maji Department of EECS, University of

Ipopt Tutorial Andreas W achter IBM T.J. Watson Research Center andreasw@watson.ibm.com

Relevance Vector Machines Jukka Lankinen LUT February 21, 2011 Jukka Lankinen Relevance Vector

Download the notebook for this section from the CS109 repo or here: http://bit.ly/109_S6 1

Sambuz

Useful Links

Newsletter

Mail Us

CG & VFX 4 th bangalore ANIMAYA shorts film 2015 festival AWARDS back to main VES

Standards Based Grading In Language Arts Alyssa Jones (KJ), Rebecca Pabst (PJ), & Alyssa