Lecture 5: Convolution Princeton University COS 495 Instructor: - PowerPoint PPT Presentation

Deep Learning Basics Lecture 5: Convolution Princeton University COS 495 Instructor: Yingyu Liang

Convolutional neural networks • Strong empirical application performance • Convolutional networks: neural networks that use convolution in place of general matrix multiplication in at least one of their layers ℎ = 𝜏(𝑋 𝑈 𝑦 + 𝑐) for a specific kind of weight matrix 𝑋

Convolution

Convolution: math formula • Given functions 𝑣(𝑢) and 𝑥(𝑢) , their convolution is a function 𝑡 𝑢 𝑡 𝑢 = ∫ 𝑣 𝑏 𝑥 𝑢 − 𝑏 𝑒𝑏 • Written as 𝑡 = 𝑣 ∗ 𝑥 or 𝑡 𝑢 = (𝑣 ∗ 𝑥)(𝑢)

Convolution: discrete version • Given array 𝑣 𝑢 and 𝑥 𝑢 , their convolution is a function 𝑡 𝑢 +∞ 𝑡 𝑢 = ෍ 𝑣 𝑏 𝑥 𝑢−𝑏 𝑏=−∞ • Written as 𝑡 = 𝑣 ∗ 𝑥 or 𝑡 𝑢 = 𝑣 ∗ 𝑥 𝑢 • When 𝑣 𝑢 or 𝑥 𝑢 is not defined, assumed to be 0

Illustration 1 𝑥 = [z, y, x] 𝑣 = [a, b, c, d, e, f] xb+yc+zd x y z a b c d e f

Illustration 1 xc+yd+ze x y z a b c d e f

Illustration 1 xd+ye+zf x y z a b c d e f

Illustration 1: boundary case xe+yf x y a b c d e f

Illustration 1 as matrix multiplication y z a x y z b x y z c x y z d x y z e x y f

Illustration 2: two dimensional case a b c d w x e f g h y z i j k l wa + bx + ey + fz

Illustration 2 a b c d w x e f g h y z i j k l wa + bx + bw + cx + ey + fz fy + gz

Illustration 2 Input Kernel (or filter) a b c d w x e f g h y z i j k l wa + bx + bw + cx + ey + fz fy + gz Feature map

Advantage: sparse interaction Fully connected layer, 𝑛 × 𝑜 edges 𝑛 output nodes 𝑜 input nodes Figure from Deep Learning, by Goodfellow, Bengio, and Courville

Advantage: sparse interaction Convolutional layer, ≤ 𝑛 × 𝑙 edges 𝑛 output nodes 𝑙 kernel size 𝑜 input nodes Figure from Deep Learning, by Goodfellow, Bengio, and Courville

Advantage: sparse interaction Multiple convolutional layers: larger receptive field Figure from Deep Learning, by Goodfellow, Bengio, and Courville

Advantage: parameter sharing The same kernel are used repeatedly. E.g., the black edge is the same weight in the kernel. Figure from Deep Learning, by Goodfellow, Bengio, and Courville

Advantage: equivariant representations • Equivariant: transforming the input = transforming the output • Example: input is an image, transformation is shifting • Convolution(shift(input)) = shift(Convolution(input)) • Useful when care only about the existence of a pattern, rather than the location

Pooling

Terminology Figure from Deep Learning, by Goodfellow, Bengio, and Courville

Pooling • Summarizing the input (i.e., output the max of the input) Figure from Deep Learning, by Goodfellow, Bengio, and Courville

Advantage Induce invariance Figure from Deep Learning, by Goodfellow, Bengio, and Courville

Motivation from neuroscience • David Hubel and Torsten Wiesel studied early visual system in human brain (V1 or primary visual cortex), and won Nobel prize for this • V1 properties • 2D spatial arrangement • Simple cells: inspire convolution layers • Complex cells: inspire pooling layers

Variants of convolution and pooling

Variants of convolutional layers • Multiple dimensional convolution • Input and kernel can be 3D • E.g., images have (width, height, RBG channels) • Multiple kernels lead to multiple feature maps (also called channels) • Mini-batch of images have 4D: (image_id, width, height, RBG channels)

Variants of convolutional layers • Padding: valid xd+ye+zf x y z a b c d e f

Variants of convolutional layers • Padding: same xe+yf x y a b c d e f

Variants of convolutional layers • Stride Figure from Deep Learning, by Goodfellow, Bengio, and Courville

Variants of convolutional layers • Others: • Tiled convolution • Channel specific convolution • ……

Variants of pooling • Stride and padding Figure from Deep Learning, by Goodfellow, Bengio, and Courville

Variants of pooling • Max pooling 𝑧 = max{𝑦 1 , 𝑦 2 , … , 𝑦 𝑙 } • Average pooling 𝑧 = mean{𝑦 1 , 𝑦 2 , … , 𝑦 𝑙 } • Others like max-out

Lecture 5: Convolution Princeton University COS 495 Instructor: - PowerPoint PPT Presentation

Deep Learning Basics Lecture 5: Convolution Princeton University COS 495 Instructor: Yingyu Liang Convolutional neural networks Strong empirical application performance Convolutional networks: neural networks that use convolution in

1 Convolution Convolution is an important operation in signal and image processing. Convolution

Vision and Sound Computer Vision Fall 2018 Columbia University Single-modality video

Correlation, Convolution, Filtering COMPSCI 527 Computer Vision COMPSCI 527 Computer

Improving PixelCNN Vertical stack oblem with this m of masked convolution. Blind spot

E he i m COMPSCI 527 Computer Vision Correlation, Convolution, Filtering 14 / 26 Image

Chapter 8: Fast Convolution Keshab K. Parhi Chapter 8 Fast Convolution Introduction

Lecture 2: Convolution Mark Hasegawa-Johnson ECE 401: Signal and Image Analysis, Fall 2020

WEBee Reverse Convolution Coding Reverse Convolution Coding Convolutional encoding uses a

Convolution Sum Overview Review of time invariance Review of sampling property

Convolution Layers Convolution Layers In [1]: from mxnet import autograd, nd from mxnet.gluon

Chapter 3 Chapter 3 Convolution Representation Convolution Representation CT Unit-Impulse

Overview of Convolution Integral Topics Impulse response defined Several derivations of the

Chapter 3 Chapter 3 Convolution Representation Convolution Representation DT Unit-Impulse

Computer Vision Lecture 5: Edges, binary images and blobs Last lecture Convolution masks as

Lecture 7: Image Sources, Convolution, Scene Graphs COMPSCI/MATH 290-04 Chris Tralie, Duke

Lecture 5.6: Convolution Matthew Macauley Department of Mathematical Sciences Clemson University

Welcome IN TRODUCTION TO MON GODB IN P YTH ON Donny Winston Instructor JavaScript Object

Text Mining and Geo-referencing Historical Text Beatrice Alex Edinburgh Language Technology

Programme Budgeting course Lec ectu turer: Hra rachya Zak akoyan BASIC INFORMATION PROGRAMME

Electronic Citizen Identities and Strong Authentication Sanna Suoranta, Lari Haataja, Tuomas Aura

When a video game is more than a game Giovanni Viviani MMORPG MMORPG Massively multiplayer

Lecture 1: Introduction Quantum mechanics (QM) is one of two theories of 20 th century physics,

New block Cipher Anatoly Lebedev; Andrey Karondeev; Alexandre Kozlov BMSTU 1 John Nash to NSA

2. US History 2.1 Colonial Era and Revolution (1607-1789) 2.2 Early Republic (1789-1850) 2.3