Algorithm-Hardware Co-design for Deformable Convolution Qijing Huang - PowerPoint PPT Presentation

Algorithm-Hardware Co-design for Deformable Convolution Qijing Huang *, Dequan Wang*, Yizhao Gao † , Yaohui Cai ‡ , Zhen Dong, Bichen Wu, Kurt Keutzer, John Wawrzynek University of California, Berkeley † University of Chinese Academy of Science ‡ Peking University EMC2 Workshop @ NeurIPS 2019

Motivation • Deformable Convolution is an input-adaptive dynamic operation that samples inputs from variable spatial locations • Its sampling locations vary with: • Different input images • Different output pixel locations 1. Generate offsets 2. Sample from input • It captures the spatial variance of objects with different: feature map • Scales • Aspect Ratios • Rotation Angles • Challenges: • Increased compute and memory requirements • Irregular Input-dependent memory access patterns • Not friendly for dataflows that leverage the spatial reuse Sampling Locations (in red) for Different Output Pixels (in green) Variable Receptive Fields 2

Algorithm-Hardware Codesign Algorithm Modification: Hardware Optimization: (-2, 2) (2, 0.75) Input Buffer Input Buffer 0. Original Deformable • Preloads weights to on-chip buffer • Accuracy 1 (mIoU ↑): 79.9 Loads input and offsets directly from DRAM 1 Accuracy for Semantic Segmentation on CityScapes 3

Algorithm-Hardware Codesign Algorithm Modification: Hardware Optimization: (-2, 2.4) (2, 1) Input Buffer 1. Rounded Offsets • Reduces the computation for bilinear ↓ 0.3 Accuracy 1 (mIoU ↑): 79.6 interpolation 1 Accuracy for Semantic Segmentation on CityScapes 4

Algorithm-Hardware Codesign Algorithm Modification: Hardware Optimization: Δ x ≤ 2, Δ y ≤ 2 2. Bounded Range • Buffers inputs in the on-chip ↓ 0.2 Accuracy 1 (mIoU ↑): 79.4 line buffer to allow spatial reuse 1 Accuracy for Semantic Segmentation on CityScapes 5

Algorithm-Hardware Codesign Results Hardware Performance Algorithm Modification: Hardware Optimization: 4. Efficient Feature Extractor 5. Depthwise Convolution • Our algorithm-hardware co-design methodology for the deformable 3. Rectangular Shape convolution achieves a 1.36 × and 9.76 × speedup respectively for the • • Improves on-chip memory bandwidth ↓ 0.7 Reduce the total MACs full and depthwise deformable convolution on FPGA Accuracy 1 (mIoU ↑): 78.7 Email: qijing.huang@berkeley.edu 1 Accuracy for Semantic Segmentation on CityScapes 6

Algorithm-Hardware Co-design for Deformable Convolution Qijing Huang - PowerPoint PPT Presentation

Algorithm-Hardware Co-design for Deformable Convolution Qijing Huang , Dequan Wang, Yizhao Gao , Yaohui Cai , Zhen Dong, Bichen Wu, Kurt Keutzer, John Wawrzynek University of California, Berkeley University of Chinese Academy of

1 Convolution Convolution is an important operation in signal and image processing. Convolution

Geometric Registration for Deformable Shapes 2.2 Deformable Registration Variational Model

Chapter 8: Fast Convolution Keshab K. Parhi Chapter 8 Fast Convolution Introduction

Vision and Sound Computer Vision Fall 2018 Columbia University Single-modality video

Correlation, Convolution, Filtering COMPSCI 527 Computer Vision COMPSCI 527 Computer

Improving PixelCNN Vertical stack oblem with this m of masked convolution. Blind spot

E he i m COMPSCI 527 Computer Vision Correlation, Convolution, Filtering 14 / 26 Image

Hardware Observability Framework Hardware Observability Framework Hardware Observability

Engineering Mechanics Of Deformable Solids A Presentation With Exercises Engineering Mechanics

Manipulation of 1D and 2D Deformable Objects Without Modeling Deformation Dmitry Berenson

WEBee Reverse Convolution Coding Reverse Convolution Coding Convolutional encoding uses a

Convolution Sum Overview Review of time invariance Review of sampling property

Convolution Layers Convolution Layers In [1]: from mxnet import autograd, nd from mxnet.gluon

Lecture 2: Convolution Mark Hasegawa-Johnson ECE 401: Signal and Image Analysis, Fall 2020

Chapter 3 Chapter 3 Convolution Representation Convolution Representation CT Unit-Impulse

Overview of Convolution Integral Topics Impulse response defined Several derivations of the

The Picnic Digital Signature Algorithm NIST Second PQC Standardization Conference August 2019

Knuth-Morris-Pratt Algorithm Kranthi Kumar Mandumula December 18, 2011 Kranthi Kumar Mandumula

STUDENTS ACCEPTANCE OF ANIMATED INTERACTIVE PRESENTATION OF SORTING ALGORITHMS Mario

Day Algorithm Workshop, Brussels Requirements 14 th November 2016 Background In accordance

Estimating the Variance of Complex Differentially Private Algorithms Robert Ashmead JSM 2019,

A C A Core R Robot Al Algorithm hm: I Inverse K Kinematics Setting a robots joints so

Tree-based model algorithm for maintaining consistency in real-time collaborative editing

Experiments with Mixed Prevision Algorithms in Linear Algebra Jack Dongarra (UTK/ORNL/U

Explore More Topics

Sambuz

Useful Links

Newsletter

Mail Us

Algorithm-Hardware Co-design for Deformable Convolution Qijing Huang - PowerPoint PPT Presentation

Algorithm-Hardware Co-design for Deformable Convolution Qijing Huang *, Dequan Wang*, Yizhao Gao , Yaohui Cai , Zhen Dong, Bichen Wu, Kurt Keutzer, John Wawrzynek University of California, Berkeley University of Chinese Academy of

1 Convolution Convolution is an important operation in signal and image processing. Convolution

Geometric Registration for Deformable Shapes 2.2 Deformable Registration Variational Model

Chapter 8: Fast Convolution Keshab K. Parhi Chapter 8 Fast Convolution Introduction

Vision and Sound Computer Vision Fall 2018 Columbia University Single-modality video

Correlation, Convolution, Filtering COMPSCI 527 Computer Vision COMPSCI 527 Computer

Improving PixelCNN Vertical stack oblem with this m of masked convolution. Blind spot

E he i m COMPSCI 527 Computer Vision Correlation, Convolution, Filtering 14 / 26 Image

Hardware Observability Framework Hardware Observability Framework Hardware Observability

Engineering Mechanics Of Deformable Solids A Presentation With Exercises Engineering Mechanics

Manipulation of 1D and 2D Deformable Objects Without Modeling Deformation Dmitry Berenson

WEBee Reverse Convolution Coding Reverse Convolution Coding Convolutional encoding uses a

Convolution Sum Overview Review of time invariance Review of sampling property

Convolution Layers Convolution Layers In [1]: from mxnet import autograd, nd from mxnet.gluon

Lecture 2: Convolution Mark Hasegawa-Johnson ECE 401: Signal and Image Analysis, Fall 2020

Chapter 3 Chapter 3 Convolution Representation Convolution Representation CT Unit-Impulse

Overview of Convolution Integral Topics Impulse response defined Several derivations of the

The Picnic Digital Signature Algorithm NIST Second PQC Standardization Conference August 2019

Knuth-Morris-Pratt Algorithm Kranthi Kumar Mandumula December 18, 2011 Kranthi Kumar Mandumula

STUDENTS ACCEPTANCE OF ANIMATED INTERACTIVE PRESENTATION OF SORTING ALGORITHMS Mario

Day Algorithm Workshop, Brussels Requirements 14 th November 2016 Background In accordance

Estimating the Variance of Complex Differentially Private Algorithms Robert Ashmead JSM 2019,

A C A Core R Robot Al Algorithm hm: I Inverse K Kinematics Setting a robots joints so

Tree-based model algorithm for maintaining consistency in real-time collaborative editing

Experiments with Mixed Prevision Algorithms in Linear Algebra Jack Dongarra (UTK/ORNL/U

Explore More Topics

Sambuz

Useful Links

Newsletter

Mail Us

Algorithm-Hardware Co-design for Deformable Convolution Qijing Huang , Dequan Wang, Yizhao Gao , Yaohui Cai , Zhen Dong, Bichen Wu, Kurt Keutzer, John Wawrzynek University of California, Berkeley University of Chinese Academy of