SEMANTIC IMAGE SEGMENTATION WITH DEEP CONVOLUTIONAL NETS AND FULLY - PowerPoint PPT Presentation

SEMANTIC IMAGE SEGMENTATION WITH DEEP CONVOLUTIONAL NETS AND FULLY CONNECTED CRFS Paper by Chen, Papandreou, Kokkinos, Murphy, Yuille Slides by Josh Kelle (with graphics from the paper)

Semantic Segmentation Goal: Partition the image into semantically meaningful parts, and classify each part. car background person horse semantic segmentation

Main Idea 1.Use CNN to generate a rough prediction of segmentation (smooth, blurry heat map) 2.Refine this prediction with a conditional random field (CRF) image CNN output CRF output

Why are CNNs insufficient? Too much invariance. Good for high-level vision tasks like classification, bad for low level tasks like segmentation. • Problem: subsampling   Solution: ‘atrous’ algorithm (hole algorithm) • Problem: spatial invariance (shared kernel weights)   Solution: fully connected CRF

Example image ground truth DCNN output CRF 1 iteration CRF 2 iteration CRF 10 iteration

Part 1: CNN

CNNs for Dense Feature Extraction • Construct “DeepLab” by modifying VGG-16 (a 16- layer CNN pre-trained on ImageNet, publicly available). • Convert the fully-connected layers of VGG-16 into convolutional layers. • Skip subsampling after the last two max-pooling layers.

Hole Algorithm • How to skip max pooling, but Input stride keep learned kernels the same? • Could introduce zeros into the kernels, but that’s slow. • The hole algorithm is faster.

Image Resolution • CNN shrinks the image. We need image at original resolution. • Skipping the last two phases of max pooling helps, but the CNN output is still 8x too small. • Since the score maps are smooth, just use bi-linear interpolation to grow the image. Input Aeroplane Bi-linear Interpolation Coarse Score map Deep Convolutional Neural Network

Part 2: CRF

Fully Connected CRF • Traditionally, short range CRFs are used to smooth noisy segmentation. • CNN output is already very smooth. Short range CRF would make it worse. • Use a fully connected CRF. The graphical model has every pixel connected to every other pixel.

CRF Energy Function X X E ( x ) = θ i ( x i ) + θ ij ( x i , x j ) i ij where x i is assignment of pixel i θ i ( x i ) = − log P ( x i ) P ( x i ) = label assignment probability computed by CNN

CRF Energy Function K X w m · k m ( f i , f j ) θ ij ( x i , x j ) = µ ( x i , x j ) m =1

CRF Energy Function K X w m · k m ( f i , f j ) θ ij ( x i , x j ) = µ ( x i , x j ) m =1 µ ( x i , x j ) = 1 if x i 6 = x j , and zero otherwise indicator function

CRF Energy Function K X w m · k m ( f i , f j ) θ ij ( x i , x j ) = µ ( x i , x j ) m =1 µ ( x i , x j ) = 1 if x i 6 = x j , and zero otherwise indicator function p = pixel position I = pixel color intensities K − || p i − p j || 2 − || I i − I j || 2 ⇣ ⌘ X w m · k m ( f i , f j ) = w 1 exp 2 σ 2 2 σ 2 α β m =1 − || p i − p j || 2 ⇣ ⌘ + w 2 exp 2 Gaussian kernels 2 σ 2 γ ( w and σ are hyper parameters fit with cross validation)

Full Pipeline “DeepLab-CRF” Input Aeroplane Coarse Score map Deep Convolutional Neural Network Bi-linear Interpolation Final Output Fully Connected CRF

Comparison to state-of-the-art Method mean IOU (%) MSRA-CFM 61.8 FCN-8s 62.2 TTI-Zoomout-16 64.4 DeepLab-CRF 66.4 DeepLab-MSc-CRF 67.1 DeepLab-MSc-CRF-LargeFOV 71.6

Comparison to state-of-the-art image ground truth FCN-8s DeepLab-CRF

Comparison to state-of-the-art image ground truth TTI-Zoomout-16 DeepLab-CRF

Success Cases image ground truth DeepLab DeepLab-CRF

Failure Cases image ground truth DeepLab DeepLab-CRF

Conclusion • Modify the CNN architecture to become less spatially invariant. • Use the CNN to compute a rough score map. • Use a fully connected CRF to sharpen the score map.

SEMANTIC IMAGE SEGMENTATION WITH DEEP CONVOLUTIONAL NETS AND FULLY - PowerPoint PPT Presentation

SEMANTIC IMAGE SEGMENTATION WITH DEEP CONVOLUTIONAL NETS AND FULLY CONNECTED CRFS Paper by Chen, Papandreou, Kokkinos, Murphy, Yuille Slides by Josh Kelle (with graphics from the paper) Semantic Segmentation Goal: Partition the image into

Semantic Segmentation / Instance Segmentation Based on Deep learning Yiding Liu 2018.12.08

An Overview of Semantic Image Segmentation with Deep Learning Simone Bonechi Outline

Segmentation Bottom-up Segmentation Semantic / instance segmentation Many Slides from L.

Pixel-Level Im Image Understanding wit ith Semantic Segmentation and Panoptic Segmentation

Semantic segmentation Image classification Object detection Semantic segmentation Evolution

VIDEO SIGNALS Segmentation WHAT IS SEGMENTATION WHAT IS SEGMENTATION Segmentation is a

Semantic Segmentation of the sekleton in bone scintigraphy images with convolutional neural

Convolutional Neural Nets CS447 Natural Language Processing (J. Hockenmaier)

Conflict nets: Efficient locally canonical MALL proof nets Dominic J. D. Hughes and Willem

Learning Deep Structured Models for Semantic Segmentation Guosheng Lin Semantic Segmentation

Deep Convolutional Neural Nets COMPSCI 371D Machine Learning COMPSCI 371D Machine

Lecture 8: Image Segmentation Peng Chao Face++ Researcher pengchao@megvii.com Nov. 2017

Image Segmentation Image Segmentation: Definitions How do we know which groups of pixels in a

Segmentation Segmentation Segmentation Define the accurate boundaries of all objects in an image

Petri Nets Petri Nets Inputs and Outputs Petri Nets vs FSM Lionel Morel Modeling Templates

Mix-Nets Lecture 19 Some tools for electronic-voting (and other things) Mix-Nets Mix-Nets

Radiative Forcing Efficiency of a Forest Fire Smoke Plume at the Surface and TOA John A. Augustine

Battle of the Accelerator Stars Xipeng Shen The College of William and Mary & MIT Top500

QTRAN: Learning to Factorize with Transformation for Cooperative Multi-Agent Reinforcement

Commensurate comparisons of models with energy budget observations reveal consistent climate

Neutrino Physics from the CMB & Large Scale Structure - Report - Topical Conveners: K.N.

R. Kelly Crace, Ph.D. R. Kelly Crace, Ph.D. College of William & Mary College of William

Sequence Labeling Markov Models Many information extraction tasks can be formulated as

reduction without rule Randy Pollack and Masahiko Sato Version of September 7, 2017 Syntax

SEMANTIC IMAGE SEGMENTATION WITH DEEP CONVOLUTIONAL NETS AND FULLY - PowerPoint PPT Presentation

SEMANTIC IMAGE SEGMENTATION WITH DEEP CONVOLUTIONAL NETS AND FULLY CONNECTED CRFS Paper by Chen, Papandreou, Kokkinos, Murphy, Yuille Slides by Josh Kelle (with graphics from the paper) Semantic Segmentation Goal: Partition the image into

Semantic Segmentation / Instance Segmentation Based on Deep learning Yiding Liu 2018.12.08

An Overview of Semantic Image Segmentation with Deep Learning Simone Bonechi Outline

Segmentation Bottom-up Segmentation Semantic / instance segmentation Many Slides from L.

Pixel-Level Im Image Understanding wit ith Semantic Segmentation and Panoptic Segmentation

Semantic segmentation Image classification Object detection Semantic segmentation Evolution

VIDEO SIGNALS Segmentation WHAT IS SEGMENTATION WHAT IS SEGMENTATION Segmentation is a

Semantic Segmentation of the sekleton in bone scintigraphy images with convolutional neural

Convolutional Neural Nets CS447 Natural Language Processing (J. Hockenmaier)

Conflict nets: Efficient locally canonical MALL proof nets Dominic J. D. Hughes and Willem

Learning Deep Structured Models for Semantic Segmentation Guosheng Lin Semantic Segmentation

Deep Convolutional Neural Nets COMPSCI 371D Machine Learning COMPSCI 371D Machine

Lecture 8: Image Segmentation Peng Chao Face++ Researcher pengchao@megvii.com Nov. 2017

Image Segmentation Image Segmentation: Definitions How do we know which groups of pixels in a

Segmentation Segmentation Segmentation Define the accurate boundaries of all objects in an image

Petri Nets Petri Nets Inputs and Outputs Petri Nets vs FSM Lionel Morel Modeling Templates

Mix-Nets Lecture 19 Some tools for electronic-voting (and other things) Mix-Nets Mix-Nets

Radiative Forcing Efficiency of a Forest Fire Smoke Plume at the Surface and TOA John A. Augustine

Battle of the Accelerator Stars Xipeng Shen The College of William and Mary &amp; MIT Top500

QTRAN: Learning to Factorize with Transformation for Cooperative Multi-Agent Reinforcement

Commensurate comparisons of models with energy budget observations reveal consistent climate

Neutrino Physics from the CMB &amp; Large Scale Structure - Report - Topical Conveners: K.N.

R. Kelly Crace, Ph.D. R. Kelly Crace, Ph.D. College of William &amp; Mary College of William

Sequence Labeling Markov Models Many information extraction tasks can be formulated as

reduction without rule Randy Pollack and Masahiko Sato Version of September 7, 2017 Syntax

Battle of the Accelerator Stars Xipeng Shen The College of William and Mary & MIT Top500

Neutrino Physics from the CMB & Large Scale Structure - Report - Topical Conveners: K.N.

R. Kelly Crace, Ph.D. R. Kelly Crace, Ph.D. College of William & Mary College of William