1 / 10
Dive Deeper Into Box for Object Detection Ran Chen 1 , Yong Liu 2 , Mengdan Zhang 2 , Shu Liu 3 , Bei Yu 1 , Yu-Wing Tai 4 1 The Chinese University of Hong Kong 2 Tencent Youtu Lab 3 SmartMore 4 The Hong Kong University of Science and Technology 2 / 10
Introduction Box Decomposition and Recombination ◮ Reorganizing boundaries of boxes during training. ◮ Optimal boxes with tightening instances provide better localization. Semantic inconsistency in annotations ◮ Backgrounds regarded as positive pixels are the noise for the training. ◮ A self adaptive module is approached to tackle this problem. 3 / 10
<latexit sha1_base64="GAFXR/1Om5fGSedTSPnsn0bcvsE=">ACFHicbZBNSwMxEIazftb6terRS7AIglB2a0XxIXjypWhbaUbHZWQ7PZJZkVy9If4cW/4sWDIl49ePfmNaK2vpC4OGdGSbzBqkUBj3vwxkbn5icmi7MFGfn5hcW3aXlc5NkmkONJzLRlwEzIWCGgqUcJlqYHEg4SJoH/bqFzegjUjUGXZSaMbsSolIcIbWarmbDYRbzE+Zanf3aCMEiay1Rfe/0f/BSsteWvLzoK/gBKZKDjlveCBOexaCQS2ZM3fdSbOZMo+ASusVGZiBlvM2uoG5RsRhM+8f1aXr1glplGj7FNK+3siZ7ExnTiwnTHDazNc65n/1eoZRrvNXKg0Q1D8a1GUSYoJ7SVEQ6GBo+xYFwL+1fKr5lmHG2ORuCP3zyKJxXyn61vH1SLR1UBnEUyCpZIxvEJzvkgByRY1IjnNyRB/JEnp1759F5cV6/WsecwcwK+SPn7ROSH50x</latexit> <latexit sha1_base64="m+AOn2kXJDEyvOaMqmUcRt7XPis=">AB9XicbVDLSgNBEOyNrxhfUY9eBoMYQcJuiOgx4MVjJOYBybrMTmaTIbOzy8ysGpb8hxcPinj1X7z5N04eB0saCiqunu8mPOlLbtbyuzsrq2vpHdzG1t7+zu5fcPmipKJKENEvFItn2sKGeCNjTnLZjSXHoc9ryh9cTv/VApWKRuNOjmLoh7gsWMIK1ke67IX4q1j3nvH7q2WdevmCX7CnQMnHmpABz1Lz8V7cXkSkQhOleo4dqzdFEvNCKfjXDdRNMZkiPu0Y6jAIVuOr16jE6M0kNBJE0Jjabq74kUh0qNQt90hlgP1KI3Ef/zOokOrtyUiTjRVJDZoiDhSEdoEgHqMUmJ5iNDMJHM3IrIAEtMtAkqZ0JwFl9eJs1yamULm4rhWp5HkcWjuAYiuDAJVThBmrQAISnuEV3qxH68V6tz5mrRlrPnMIf2B9/gBNYpEJ</latexit> Box Decomposition and Recombination Decomposition: Ranking: Recombination: Assignment: S 0 S 0 max( S 1 , S 0 0 ) 1 1 S 1 S 1 S 0 S 0 1 1 S 0 S 0 0 0 S 0 S 0 1 1 S 0 S 0 S 0 S 0 S 0 S 0 0 0 2 2 S 0 S 0 S 0 S 0 2 2 1 1 S 0 S 0 S 0 S 0 S 2 S 2 S 0 S 0 1 1 S 0 S 0 2 2 1 1 S 0 S 0 S 2 S 2 S 2 S 2 S 0 S 0 S 0 S 0 S 2 S 2 0 0 1 1 δ 3 S 2 S 2 S 0 S 0 S 0 S 0 δ 1 S 0 S 0 S 0 S 0 S 0 S 0 2 2 S 2 S 2 S 0 S 0 S 1 S 1 0 0 0 0 S 0 S 0 S 1 S 1 δ 2 2 2 S 0 S 0 S 0 S 0 Rank : δ 3 > δ 1 > δ 2 2 2 S 0 S 0 right I S 1 S 1 S 1 S 1 0 0 S 0 S 0 2 2 (a) (b) (c) (d) S 1 S 1 n L IoU = − 1 � � log( IoU ( p i , p ∗ I )) , (1) N pos I i IoU = 1 � I > S I } L IoU ( B ′ L D & R ( 1 { S ′ I , T I ) N pos (2) I + 1 { S I � S ′ I } L IoU ( B I , T I )) , 4 / 10
Box Decomposition and Recombination 0 . 9 0 . 7 IoU 0 . 5 IoU w/ D&R IoU w/o D&R 0 . 3 0 2 4 6 8 10 12 epoch ◮ Boxes optimized by D&R have higher IoU scores and lower variances. 5 / 10
Box Decomposition and Recombination 6 / 10
Semantic Consistency � R I ↓ ← negative , C I ↓ � R I ↑ ← positive , C I ↑ (3) x g c i = max j = 0 ( c j ) ∈ C I , Settings AP AP 50 AP 75 AP S AP M AP L None 33.6 53.1 35.0 18.9 38.2 43.7 PN 34.2 53.2 36.3 20.8 38.9 44.2 PNI 33.7 53.0 35.5 17.9 38.3 44.1 Ours 35.3 55.4 37.1 20.9 39.6 45.9 C I ↑ R I ↑ positive set negative set 7 / 10
Semantic Consistency 8 / 10
Results Modules AP AP 50 AP 75 AP S AP M AP L Baseline D&R Consistency � 33.6 53.1 35.0 18.9 38.2 43.7 � � 34.8 54.0 36.4 19.7 39.0 44.9 � � 37.2 55.4 39.5 21.0 41.7 48.6 � � � 38.0 56.5 40.8 21.6 42.4 50.4 9 / 10
Results Method Backbone AP AP 50 AP 75 AP S AP M AP L Two-stage methods: Faster R-CNN w/ FPN ResNet-101-FPN 36.2 59.1 39.0 18.2 39.0 48.2 Faster R-CNN w/ TDM Inception-ResNet-v2-TDM 36.8 57.7 39.2 16.2 39.8 52.1 Faster R-CNN by G-RMI Inception-ResNet-v2 34.7 55.5 36.7 13.5 38.1 52.0 RPDet ResNet-101-DCN 42.8 65.0 46.3 24.9 46.2 54.7 Cascade R-CNN ResNet-101 42.8 62.1 46.3 23.7 45.5 55.2 One-stage methods: YOLOv2 DarkNet-19 21.6 44.0 19.2 5.0 22.4 35.5 SSD ResNet-101 31.2 50.4 33.3 10.2 34.5 49.8 DSSD ResNet-101 33.2 53.3 35.2 13.0 35.4 51.1 FSAF ResNet-101 40.9 61.5 44.0 24.0 44.2 51.3 RetinaNet ResNet-101-FPN 39.1 59.1 42.3 21.8 42.7 53.9 CornerNet Hourglass-104 40.5 56.5 43.1 19.4 42.7 53.9 ExtremeNet Hourglass-104 40.1 55.3 43.2 20.3 43.2 53.1 FCOS † ResNet-101-FPN 41.5 60.7 45.0 24.4 44.8 51.6 FCOS † ResNeXt-64x4d-101-FPN 43.2 62.8 46.6 26.5 46.2 53.3 FCOS † w/improvements ResNeXt-64x4d-101-FPN 44.7 64.1 48.4 27.6 47.5 55.6 DDBNet (Ours) ResNet-101-FPN 42.0 61.0 45.1 24.2 45.0 53.3 DDBNet (Ours) ResNeXt-64x4d-101-FPN 43.9 63.1 46.7 26.3 46.5 55.1 DDBNet (Ours) § ResNeXt-64x4d-101-FPN 45.5 64.5 48.5 27.8 47.7 57.1 10 / 10
Recommend
More recommend