geirhos et al 2019 introduction
play

Geirhos et al. (2019) Introduction ImageNet classifjcation with - PowerPoint PPT Presentation

IMAGENET-TRAINED CNNS ARE BIASED TOWARDS TEXTURE; INCREASING SHAPE BIAS IMPROVES ACCURACY AND ROBUSTNESS Geirhos et al. (2019) Introduction ImageNet classifjcation with CNNs Which image cues are learned How infmuential they are


  1. IMAGENET-TRAINED CNNS ARE BIASED TOWARDS TEXTURE; INCREASING SHAPE BIAS IMPROVES ACCURACY AND ROBUSTNESS Geirhos et al. (2019)

  2. Introduction ● ImageNet classifjcation with CNNs ● Which image cues are learned ● How infmuential they are ● Comparison with humans 2

  3. Testing Hypothesis Shape Hypothesis “The network acquires complex knowledge about the kinds of shapes associated with each category. [...] High-level units appear to learn representations of shapes occurring in natural images ” Kriegeskorte (2015) Intermediate CNN layers recognize “ parts of familiar objects, and subsequent layers [...] detect objects as combinations of these parts” LeCun et al. (2015) 3

  4. Testing Hypothesis Shape Hypothesis “The network acquires complex knowledge about the kinds of shapes associated with each category. [...] High-level units appear to learn representations of shapes occurring in natural images ” Kriegeskorte (2015) Intermediate CNN layers recognize “ parts of familiar objects, and subsequent layers [...] detect objects as combinations of these parts” LeCun et al. (2015) Texture Hypothesis CNNs can still classify texturised images perfectly well, even if the global shape structure is completely destroyed Gatys et al. (2017) and Brendel & Bethge (2019) Standard CNNs are bad at recognizing object sketches where object shapes are preserved yet all texture cues are missing Ballester & de Araújo (2016) 4

  5. Set-up Psychophysical Model experiments 97 observers AlexNet ● ● 48,560 trials GoogLeNet ● ● 300 ms fjxation square VGG-16 ● ● + 200 ms image ResNet-50 ● + 200 ms pink noise + 1500 ms category selection Breaks after every 256 trials ● Practice session of 320 trials ● 5

  6. Experiments Original 160 color images 10 per category white background 6

  7. Experiments Original Greyscale 160 color images As original but 10 per category greyscale white background 7

  8. Experiments Original Greyscale Silhouette 160 color images As original but As original but 10 per category greyscale only a manually white background created black mask 8

  9. Experiments Original Greyscale Silhouette Edge 160 color images As original but As original but Canny edge 10 per category greyscale only a manually extractor on white background created black original dataset mask 9

  10. Experiments Original Greyscale Silhouette Edge T exture 160 color images As original but As original but Canny edge For items with no 10 per category greyscale only a manually extractor on textured areas, eg white background created black original dataset “bottles” a cluster mask of those objects are considered as texture 10

  11. Experiments Original content images Original texture images Filled silhouette experiment Masked texture images inside The silhouettes. The textures had 360 degrees data augmentation Cue confmict experiment Using iterative style transfer Gatys et al. (2016) 11

  12. Cue Confmict Results Human observers ( red circles ) AlexNet ( purple diamonds ) VGG-16 ( blue triangles ) GoogLeNet ( turquoise circles ) ResNet-50 ( grey squares ) 12

  13. Overcoming the texture bias Stylized-ImageNet (SIN) Created by applying AdaIN style transfer to ImageNet images Huang et al. (2017) 13

  14. Model Metrics T op-5 Accuracy of the stylized-ImageNet trained models compared to the ImageNet trained models 14

  15. Model Metrics T op-5 Accuracy of the stylized-ImageNet trained models compared to the ImageNet trained models Shape-ResNet is the model trained jointly on SIN and IN and fjne-tuned on IN 15

  16. Bias Results Human observers ( red circles ) ResNet-50 on Stylized-Imagenet ( orange squares ) ResNet-50 on Imagenet ( grey squares ) 16

  17. Distortion Robustness Results 17

  18. Questions?

Recommend


More recommend