GCNet: Non-local Networks Meet Squeeze- Excitation Networks and - PowerPoint PPT Presentation

GCNet: Non-local Networks Meet Squeeze- Excitation Networks and Beyond Yue Cao*, Jiarui Xu*, Stephen Lin, Fangyun Wei, Han Hu MSRA & HKUST Code available at: https://github.com/xvjiarui/GCNet

Rela lated Work rks: : Se Self lf Attentio ion Mechanis ism • Transformer is a milestone for machine translation, which applies a self- attention mechanism to model long-range dependencies. A. Vaswni et al . Attention is all your need . NIPS’2017

Rela lated Work rks: : Non-lo local l Neural l Netw tworks Each query pixel ( 𝑦 𝑗 ) will aggregate values from each key pixel ( 𝑦 𝑘 ) by attention weight averaging. • 1 y i = 𝐷(𝑦) σ ∀𝑘 𝑔(𝑦 𝑗 , 𝑦 𝑘 ) 𝑕(𝑦 𝑘 ) query keys Model dependency between distant pixels (long range dependency) • Complementary to convolution, which prove to work well on many visual understanding tasks. • X. Wang et al . Non-local Neural Networks . CVPR’2018

What Is Is Exp xpected To Be Be Le Learnt • Different query pixels impacted by different sets of key pixels query pixels key pixels X. Wang et al . Non-local Neural Networks . CVPR’2018

What Is Is Actu tuall lly Le Learnt • Different query pixels impacted by the same set of key pixels query pixels key pixels X. Wang et al . Non-local Neural Networks . CVPR’2018

Attentio ion Maps For r Dif ifferent Query ry Pix ixels ls query pixels The effectiveness of non-local neural networks do not come from the modeling of dependency between distant pixels, but from the global context modeling .

St Statis istic ical l Analy lysis is On COCO, Im ImageNet, Kin Kinetic ics • We computed the cosine distance of different parts of Non-Local Network to verify the dependency. • It turns out that what Non-Local modeling is query independent, namely global context from statistical perspective. Cosine distance Dataset AP(bbox) AP(mask) Input Attention Map ouput COCO 38.0 34.7 0.401 0.020 0.012 Dataset Top-1 Top-5 ImageNet 77.2 91.9 0.358 0.004 0.003 Dataset Top-1 Top-5 Kinetics 75.9 92.2 0.301 0.115 0.074

St Statis istic ical l Analy lysis is On Cit Cityscapes (E (Exceptio ion) • However, compared with aforementioned 3 datasets, Cityscapes seems to be an exception. Cosine distance Dataset AP(bbox) AP(mask) Input Attention Map ouput COCO 38.0 34.7 0.401 0.020 0.012 Dataset Top-1 Top-5 ImageNet 77.2 91.9 0.358 0.004 0.003 Dataset Top-1 Top-5 Kinetics 75.9 92.2 0.301 0.115 0.074 Dataset mIoU Cityscapes 77.59 0.315 0.383 0.354

Exp xpli licit itly ly Use se Th The Sa Same Attentio ion Map Non-Local Block Simplification X X FLOPs 9.3G 5.0M model size 2.1M 1.0M 38.0 38.1 accuracy (mAP)

Exp xpli licit itly ly Use se Th The Sa Same Attentio ion Map Non-Local Block Simplification Global Context Block 4.0M FLOPs 9.3G 5.0M model size 2.1M 1.0M 0.1M 38.0 38.1 38.1 accuracy (mAP) borrowed from SE-Net (champion of 2017 ImageNet Challenge)

Exp xpli licit itly ly Use se Th The Sa Same Attentio ion Map Non-Local Block Simplification Reduction Global Context Block 4.0M FLOPs 2,300x 9.3G 5.0M model size 20x 2.1M 1.0M 0.1M 38.0 38.1 unchanged 38.1 accuracy (mAP)

Abla latio ion stu tudy of f Glo lobal l Co Context xt Netw twork Tasks Backbone Dataset Evaluation Image Classification ResNet-50 ImageNet Top Acc Object Detection Faster R-CNN+FPN+ResNet-50 COCO Mean AP Action Recognition ResNet-50 Slow only Kinetics 500 Top Acc Semantic Segmentation Dilated ResNet-101 Cityscapes Mean IoU

COCO Objec ject Detectio ion Resu sult lts • Baseline: Mask R-CNN + ResNet50 + FPN method AP (bbox) AP (mask) #param FLOPs baseline 37.2 33.8 44.4M 279.4G NL-Net 38.0 34.7 46.5M 288.7G SNL-Net 38.1 35.0 45.4M 279.4G GC-Net (1 38.1 34.9 44.5M 279.4G block) GC-Net (all 39.4 35.7 46.9M 279.6G layers) +2.2 mAP +1.9 mAP with little computation and model size overhead!

ImageNet Im Im Image Cla Classif ific icatio ion Resu sult lts • Baseline: ResNet-50 method Top-1 Acc Top-5 Acc #param FLOPs baseline 76.51 93.35 25.56M 3.86G NL-Net 77.21 93.64 27.66M 4.11G SNL-Net 77.10 93.56 26.61M 3.86G GC-Net (1 77.20 93.47 25.69M 3.86G layer) GC-Net (all 77.49 93.67 28.08M 3.87G layers)

Kin Kinetic ics Actio tion Recognit itio ion Resu sult lts • Baseline: ResNet-50 Slow-only method Top-1 Acc Top-5 Acc #param FLOPs baseline 74.94 91.90 32.45M 39.29G NL-Net(5 75.95 92.29 39.81M 59.60G blocks) SNL-Net(5 75.76 92.44 36.13M 39.32G blocks GC-Net (5 75.85 92.25 34.30M 39.31G blocks) GC-Net (all 76.00 92.34 42.45M 39.35G layers)

Cit Cityscapes Se Semantic ics Se Segmentatio ion Resu sult lts • Baseline: ResNet101 Dilated method mIoU #param FLOPs baseline 75.42% 70.96M 646.88G NL-Head 77.59% 71.22M 649.36G SNL-Head 77.22% 71.22M 646.86G GC-Head 78.55% 71.09M 646.89G

COCO Objec ject Detectio ion Resu sult lts • Stronger backbone backbone method AP (bbox) AP (mask) #param FLOPs ResNet-50 Baseline 37.2 33.8 44.4M 279.4G +GC r16 39.4 35.7 46.9M 279.5G +GC r4 39.9 36.2 54.4M 279.6G ResNet-101 Baseline 39.8 36.0 63.4M 354.1G +GC r16 41.1 37.4 68.1M 354.2G +GC r4 41.7 37.6 82.4M 354.3G ResNeXt-101 Baseline 41.2 37.3 63.0M 357.8G +GC r16 42.4 38.0 67.8M 358.1G +GC r4 42.9 38.5 81.9M 358.2G

COCO Objec ject Detectio ion Resu sult lts • Stronger method backbone method AP (bbox) AP (mask) #param FLOPs ResNeXt-101 Baseline 41.2 37.3 63.0M 357.8G +GC r16 42.4 38.0 67.8M 358.1G +GC r4 42.9 38.5 81.9M 358.2G ResNeXt-101 Baseline 44.7 38.3 95.9M 536.9G +Cascade +GC r16 45.9 39.3 100.7M 537.2G +GC r4 46.5 39.7 114.9M 537.3G ResNeXt-101 Baseline 47.1 40.4 98.5M 547.5G +DCN +GC r16 47.9 40.9 103.3M 547.7G +Cascade +GC r4 47.9 40.8 117.5M 547.8G

Conclu lusio ion • We have found empirically that non-local network only models query-independent context on several important visual recognition tasks. • We simplify non-local networks while preserve the long-range dependency modeling capability and performance. • We proposed a novel Global Context Network which can effectively model long-range dependency with light computation, which shows consistent improvements on four fundamental benchmarks.

GCNet: Non-local Networks Meet Squeeze- Excitation Networks and - PowerPoint PPT Presentation

GCNet: Non-local Networks Meet Squeeze- Excitation Networks and Beyond Yue Cao, Jiarui Xu, Stephen Lin, Fangyun Wei, Han Hu MSRA & HKUST Code available at: https://github.com/xvjiarui/GCNet Rela lated Work rks: : Se Self lf Attentio

Margin Squeeze Hartwig Tauber FTTH Council Europe Margin Squeeze 1 Recognise BEREC are

Squeeze-and-Excitation Networks Jie Hu 1,* Li Shen 2,* Gang Sun 1 2 Department of Engineering

Squeeze Out Stress Mindful Progressive Muscle Relaxation Modified from

Fresh Squeeze Full Design Project Details Client Brief and Brand Promise Client Brief Fresh

Checki king in and Treating High-Achievi ving Students Meet Meet you your r Doctor Doctor

What your Team needs to know and do at a CARA Meet PRIOR TO THE MEET 1. OBTAIN MEET SCHEDULE

Growing Growing Vegetables To Meet Vegetables To Meet Local Needs Local Needs Duwayne Sipper

Snap it! Every brilliant concept comes from a simple idea!* Just one squeeze is all you need to

Strategies to keep people and goods moving in and through Seattle The Seattle Squeeze and the SR

Driving simulated Machine Learning and Humans Training Seb Loze - Simulations Industry Manager,

Female farmers responses to the cost-price squeeze of Australian agriculture AARES NE Branch

Religion and Socio-demographic reproduction crisis: Sikh marriage squeeze in India and Diaspora

The Squeeze on the Sales Tax Presented by Matthew N. Murray The University of Tennessee

Climate Change, Biodiversity & Species Extinction www.eastdevon.gov.uk Coastal Squeeze

Studio 10 Squeeze Week4 Presentation Jinhui Zhu 756359 Case Study & Environmental Analysis

Programme Challenges and opportunities delivering coastal squeeze compensation habitat in

Wednesday's slides TUTORIAL (n) Security (2): The security considerations for ECRIT are put

PALINDROMES IN PURE MORPHIC WORDS Michelangelo Bucci with Elise Vaslet a ab b

Spotlight on Transparency: Secondary Market Disclosure Hot Topics June 16, 2020 SEC Rule 15c2-12

Design for simple application profiles Tom Baker and Karen Coyle Dublin Core Metadata Initiative

How Distance Transform Maps Boost Segmentation CNNs: An Empirical Study Jun Ma Department of

Properties of maximum and minimum factorization length in numerical semigroups By Gilad

Sparse Matrix-Matrix Mul/plica/on for Modern Manycore Architectures

Packet Information IMPORTANT LABEL ON RIGHT HAND SIDE OF PACKET Shoe, Gym 11111 (Student ID #)

GCNet: Non-local Networks Meet Squeeze- Excitation Networks and - PowerPoint PPT Presentation

GCNet: Non-local Networks Meet Squeeze- Excitation Networks and Beyond Yue Cao*, Jiarui Xu*, Stephen Lin, Fangyun Wei, Han Hu MSRA & HKUST Code available at: https://github.com/xvjiarui/GCNet Rela lated Work rks: : Se Self lf Attentio

Margin Squeeze Hartwig Tauber FTTH Council Europe Margin Squeeze 1 Recognise BEREC are

Squeeze-and-Excitation Networks Jie Hu 1,* Li Shen 2,* Gang Sun 1 2 Department of Engineering

Squeeze Out Stress Mindful Progressive Muscle Relaxation Modified from

Fresh Squeeze Full Design Project Details Client Brief and Brand Promise Client Brief Fresh

Checki king in and Treating High-Achievi ving Students Meet Meet you your r Doctor Doctor

What your Team needs to know and do at a CARA Meet PRIOR TO THE MEET 1. OBTAIN MEET SCHEDULE

Growing Growing Vegetables To Meet Vegetables To Meet Local Needs Local Needs Duwayne Sipper

Snap it! Every brilliant concept comes from a simple idea!* Just one squeeze is all you need to

Strategies to keep people and goods moving in and through Seattle The Seattle Squeeze and the SR

Driving simulated Machine Learning and Humans Training Seb Loze - Simulations Industry Manager,

Female farmers responses to the cost-price squeeze of Australian agriculture AARES NE Branch

Religion and Socio-demographic reproduction crisis: Sikh marriage squeeze in India and Diaspora

The Squeeze on the Sales Tax Presented by Matthew N. Murray The University of Tennessee

Climate Change, Biodiversity &amp; Species Extinction www.eastdevon.gov.uk Coastal Squeeze

Studio 10 Squeeze Week4 Presentation Jinhui Zhu 756359 Case Study &amp; Environmental Analysis

Programme Challenges and opportunities delivering coastal squeeze compensation habitat in

Wednesday's slides TUTORIAL (n) Security (2): The security considerations for ECRIT are put

PALINDROMES IN PURE MORPHIC WORDS Michelangelo Bucci with Elise Vaslet a ab b

Spotlight on Transparency: Secondary Market Disclosure Hot Topics June 16, 2020 SEC Rule 15c2-12

Design for simple application profiles Tom Baker and Karen Coyle Dublin Core Metadata Initiative

How Distance Transform Maps Boost Segmentation CNNs: An Empirical Study Jun Ma Department of

Properties of maximum and minimum factorization length in numerical semigroups By Gilad

Sparse Matrix-Matrix Mul/plica/on for Modern Manycore Architectures

Packet Information IMPORTANT LABEL ON RIGHT HAND SIDE OF PACKET Shoe, Gym 11111 (Student ID #)

GCNet: Non-local Networks Meet Squeeze- Excitation Networks and Beyond Yue Cao, Jiarui Xu, Stephen Lin, Fangyun Wei, Han Hu MSRA & HKUST Code available at: https://github.com/xvjiarui/GCNet Rela lated Work rks: : Se Self lf Attentio

Climate Change, Biodiversity & Species Extinction www.eastdevon.gov.uk Coastal Squeeze

Studio 10 Squeeze Week4 Presentation Jinhui Zhu 756359 Case Study & Environmental Analysis