high resolution breast cancer screening with multi view
play

High-Resolution Breast Cancer Screening with Multi-View Deep - PowerPoint PPT Presentation

High-Resolution Breast Cancer Screening with Multi-View Deep Convolutional Neural Networks Krzysztof J. Geras Joint work with Kyunghyun Cho , Linda Moy , Gene Kim , Stacey Wolfson and Artie Shen . GTC 2017 W HERE DEEP LEARNING IS USEFUL amount


  1. High-Resolution Breast Cancer Screening with Multi-View Deep Convolutional Neural Networks Krzysztof J. Geras Joint work with Kyunghyun Cho , Linda Moy , Gene Kim , Stacey Wolfson and Artie Shen . GTC 2017

  2. W HERE DEEP LEARNING IS USEFUL amount of data available difficulty of the task

  3. W HERE DEEP LEARNING IS USEFUL natural image recognition speech machine recognition translation game playing amount of data available the learning tasks for which deep learning makes a difference difficulty of the task

  4. Can we save the world?

  5. W HERE DEEP LEARNING IS USEFUL natural image recognition speech machine recognition translation game playing amount of data available medical image analysis? the learning tasks for which deep learning makes a difference difficulty of the task

  6. B REAST CANCER SCREENING

  7. B REAST CANCER SCREENING About 40 million exams performed yearly in the US.

  8. B REAST CANCER SCREENING About 40 million exams performed yearly in the US. About 250 thousand women are diagnosed with cancer.

  9. B REAST CANCER SCREENING About 40 million exams performed yearly in the US. About 250 thousand women are diagnosed with cancer. About 40 thousand die.

  10. B REAST CANCER SCREENING R-MLO L-MLO R-CC L-CC (left cranial caudal) (right cranial caudal) (left mediolateral oblique) (right mediolateral oblique)

  11. B REAST CANCER SCREENING We try to mimic predictions of a radiologist. ◮ Class 0: incomplete ( ≈ 15%). ◮ Class 1: negative ( ≈ 50%). ◮ Class 2: bening findings ( ≈ 35%).

  12. B REAST CANCER SCREENING We try to mimic predictions of a radiologist. ◮ Class 0: incomplete ( ≈ 15%). ◮ Class 1: negative ( ≈ 50%). ◮ Class 2: bening findings ( ≈ 35%). Radiologists call these classes BI-RADS (short for Breast Imaging-Reporting and Data System).

  13. C HALLENGES (1)

  14. C HALLENGES (1) You need a lot of data to do deep learning.

  15. C HALLENGES (1) You need a lot of data to do deep learning. Publicly available data sets contain about 1k images.

  16. C HALLENGES (1) You need a lot of data to do deep learning. Publicly available data sets contain about 1k images. We build our own data set: ◮ 23k exams, ◮ 103k images. ◮ Each image is at least 2600 × 2000 pixels.

  17. C HALLENGES (2)

  18. C HALLENGES (2)

  19. C HALLENGES (2)

  20. C HALLENGES (2)

  21. C HALLENGES (2)

  22. C HALLENGES (2)

  23. C HALLENGES (2)

  24. C HALLENGES (2) High resolution necessary - computational and engineering challenge.

  25. C HALLENGES (3) Multi-view data. How to integrate information?

  26. O UR MODEL Classifier p ( y | x ) Concatenation (256 × 4 dim) DCN DCN DCN DCN L-CC R-CC L-MLO R-MLO

  27. O UR MODEL layer kernel size stride #maps repetition global average pooling 256 convolution 3 × 3 1 × 1 256 × 3 max pooling 2 × 2 2 × 2 128 Classifier p ( y | x ) convolution 3 × 3 1 × 1 128 × 3 Concatenation (256 × 4 dim) max pooling 2 × 2 2 × 2 128 convolution 3 × 3 1 × 1 128 × 3 DCN DCN DCN DCN max pooling 2 × 2 2 × 2 64 L-CC R-CC L-MLO R-MLO 3 × 3 1 × 1 × 2 convolution 64 convolution 3 × 3 2 × 2 64 max pooling 3 × 3 3 × 3 32 3 × 3 2 × 2 convolution 32 input 1

  28. R ESULTS 1.0 0.8 AUC true positive rate 0.6 0 vs. others: 0.609 1 vs. others: 0.717 0.4 2 vs. others: 0.728 BI-RADS 0 0.2 Average: 0.685 BI-RADS 1 BI-RADS 2 0.0 0.0 0.2 0.4 0.6 0.8 1.0 false positive rate

  29. C ONFIDENT TEST DATA We can compute the entropy of predictions, � p ( y ′ | x ) log p ( y ′ | x ) , H ( y | x ) = − y ′ ∈C and sort examples according to it.

  30. C ONFIDENT TEST DATA We can compute the entropy of predictions, � p ( y ′ | x ) log p ( y ′ | x ) , H ( y | x ) = − y ′ ∈C and sort examples according to it. We will consider the 30% with the lowest entropy to be “confident”.

  31. R ESULTS FOR CONFIDENT TEST DATA 1.0 0.8 AUC true positive rate 0.6 0 vs. others: 0.636 1 vs. others: 0.816 0.4 2 vs. others: 0.844 BI-RADS 0 0.2 Average: 0.765 BI-RADS 1 BI-RADS 2 0.0 0.0 0.2 0.4 0.6 0.8 1.0 false positive rate

  32. I MPACT OF DOWNSCALING 0.80 0.75 0.70 AUC 0.65 0.60 average AUC average AUC (confident) 0.55 1/8 1/4 1/2 1 resolution fraction

  33. I MPACT OF THE DATA SET SIZE 0.80 0.75 0.70 AUC 0.65 0.60 0.55 average AUC average AUC (confident) 0.50 1/10 1/5 1/2 1 data set size fraction

  34. V ISUALISATION � � ∂ H ( y | x ) � � We visualise � , � � ∂ x v � � ( i , j ) � � p ( y ′ | x ) log p ( y ′ | x ) . where H ( y | x ) = − y ′ ∈C

  35. V ISUALISATION

  36. V ISUALISATION

  37. C ONCLUSIONS ◮ We made a first step in the direction of end-to-end breast cancer screening with neural networks.

  38. C ONCLUSIONS ◮ We made a first step in the direction of end-to-end breast cancer screening with neural networks. ◮ It is much harder to learn the “incomplete” (0) class than other classes.

  39. C ONCLUSIONS ◮ We made a first step in the direction of end-to-end breast cancer screening with neural networks. ◮ It is much harder to learn the “incomplete” (0) class than other classes. ◮ We need to use the full resolution. A lot more effort is necessary to develop archiectures appropriate for data of large dimensionality.

  40. C ONCLUSIONS ◮ We made a first step in the direction of end-to-end breast cancer screening with neural networks. ◮ It is much harder to learn the “incomplete” (0) class than other classes. ◮ We need to use the full resolution. A lot more effort is necessary to develop archiectures appropriate for data of large dimensionality. ◮ We need more data.

  41. C ONCLUSIONS ◮ We made a first step in the direction of end-to-end breast cancer screening with neural networks. ◮ It is much harder to learn the “incomplete” (0) class than other classes. ◮ We need to use the full resolution. A lot more effort is necessary to develop archiectures appropriate for data of large dimensionality. ◮ We need more data. (We are currently processing a 10 times bigger data set).

  42. Thank you! High-Resolution Breast Cancer Screening with Multi-View Deep Convolutional Neural Networks K. J. Geras, S. Wolfson, S. G. Kim, L. Moy, K. Cho arXiv:1703.07047

Recommend


More recommend