interpretable multimodal deep learning for objective
play

Interpretable Multimodal Deep Learning for Objective Diagnosis, - PowerPoint PPT Presentation

Interpretable Multimodal Deep Learning for Objective Diagnosis, Prognosis and Biomarker Discovery Faisal Mahmood, PhD Postdoctoral Fellow Department of Biomedical Engineering Johns Hopkins University faisalm@jhu.edu | http://faisal.ai March


  1. Interpretable Multimodal Deep Learning for Objective Diagnosis, Prognosis and Biomarker Discovery Faisal Mahmood, PhD Postdoctoral Fellow Department of Biomedical Engineering Johns Hopkins University faisalm@jhu.edu | http://faisal.ai March 20, 2019 faisalm@jhu.edu | faisal.ai 1

  2. Deep Learning for Medical Imaging – Major Challenges Johns Hopkins 1. Limited Annotated Data Electronic Medical Records - Under representation of rare conditions. - Limited experts available for annotation. mmmmmmmmmm Pathology - Privacy Issues - Cost 65% Unlabeled Transfer Learning Unstructured Trained Radiology Pre- 15% Does not capture 70% clinical diversity. ... Faisal Mahmood , Nicholas J. Durr et al."Unsupervised Reverse Domain Adaptation for Synthetic Medical Images via Adversarial Training." IEEE Transactions on Medical Imaging (2018).

  3. Deep Learning for Medical Imaging – Major Challenges 2. Domain Adaptation - Diversity in data, different sensors, cites and patients. - Patient specific texture and color information. Train Test How can we train AI systems robust to variability in the data?

  4. Deep Learning for Medical Imaging – Major Challenges 2. Domain Adaptation - Diversity in data, different sensors, cites and patients. - Patient specific texture and color information. Test Train (F. Mahmood et al., 2018)

  5. Deep Learning for Medical Imaging – Major Challenges 3. Structured Prediction - Global vs Local features. Modify Compare Ground Output Truth AI System Input F. Mahmood , 2018 – ( EN.580.142.13 )

  6. Deep Learning for Medical Imaging – Major Challenges 3. Structured Prediction - Global vs Local features. Compare Per-pixel classification or regression is unstructured . Each pixel is considered Ground conditionally independent. Output Truth Ai System Input How can we develop conditionally dependent deep learning models?

  7. Deep Learning for Medical Imaging – Major Challenges Multi-omics Data 4. Incorporating Multimodal Information - Subjective diagnosis is multimodal. Genomics Proteomics Transcriptomic miRNAomics Imaging Data How Can we Fuse s Metabolomics Histopathology Unregistered, … Uncorrelated and Noisy Data? Immunohistochemistry Patient Specific Data How can we Patient History Familial History Endoscopy Radiology separate patient Clinical Phenotyping specific and … general Pharmacologic Data … … information?

  8. Deep Learning for Medical Imaging – Major Challenges 1. Limited Annotated Data - Under representation of rare conditions. - Limited experts available for annotation. - Privacy Issues, Cost 2. Domain Adaptation - Diversity in data, different sensors, cites and patients. - Patient specific texture and color information. 3. Structured Prediction - Global vs Local features. 4. Incorporating Multimodal Information - Subjective diagnosis is multimodal.

  9. Computational Endoscopy Computational Pathology

  10. Computational Endoscopy Computational Pathology

  11. Endoscopic Depth and Topography Application : Depth Estimation for Endoscopy Purpose : Predict Topography from Monocular Images Topography Matters Colonoscopy Gives 2D Images Faisal Mahmood , Nicholas J. Durr et al." Deep learning and conditional random fields-based depth estimation and topographical reconstruction from conventional endoscopy" Medical Image Analysis (2018).

  12. Endoscopic Depth and Topography Application : Depth Estimation for Endoscopy Purpose : Predict Topography from Monocular Images 60% of colorectal cancer cases detected after optical colonoscopy are associated with missed lesions. Topography Matters How do gastroenterologists predict the presence of a polyp? Colonoscopy Gives 2D Images Predict the size of the perforations. Predict surface topography. Faisal Mahmood , Nicholas J. Durr et al." Deep learning and conditional random fields-based depth estimation and topographical reconstruction from conventional endoscopy" Medical Image Analysis (2018).

  13. Depth Estimation from Monocular Endoscopy Images No Ground Truth Depth Data: - Limited real estate on an endoscope. - Regulatory approvals required to add depth sensor. 8.8mm Solution : Generate Synthetic Endoscopy Data Faisal Mahmood , Nicholas J. Durr et al." Deep learning and conditional random fields-based depth estimation and topographical reconstruction from conventional endoscopy" Medical Image Analysis (2018).

  14. Generating Synthetic Endoscopy Data with GT Depth (F. Mahmood et al., 2018)

  15. Generating Synthetic Endoscopy Data with GT Depth (F. Mahmood et al., 2018)

  16. Deep Learning for Medical Imaging – Major Challenges 1. Limited Annotated Data - Under representation of rare conditions. - Limited experts available for annotation. - Privacy Issues - Cost 2. Domain Adaptation - Diversity in data, different sensors, cites and patients. - Patient specific texture and color information. 3. Structured Prediction - Global vs Local features. 4. Incorporating Multimodal Information - Subjective diagnosis is multimodal.

  17. Training with Endoscopy Synthetic Data Problem: Standard Deep Learning Networks are not sufficiently context aware. Solution: Add non-local information using a joint CNN-Graphical Model Setup. (F. Mahmood et al., 2018)

  18. Solution: Joint CNN-CRF Model (F. Mahmood et al., 2018)

  19. Solution: Joint CNN-CRF Model Typical Deep Learning Flow Adds Non-local Information (F. Mahmood et al., 2018)

  20. Deep Learning for Medical Imaging – Major Challenges 1. Limited Annotated Data - Under representation of rare conditions. - Limited experts available for annotation. - Privacy Issues - Cost 2. Domain Adaptation - Diversity in data, different sensors, cites and patients. - Patient specific texture and color information. 3. Structured Prediction - Global vs Local features. 4. Incorporating Multimodal Information - Subjective diagnosis is multimodal.

  21. Adapting Synthetic Networks to Real Data Problem: Network trained on synthetic data does not work with real data. Solution: Adversarial Reverse Domain Adaptation. Train Test (Mahmood et al., 2018)

  22. Adversarial Reverse Domain Adaptation Typical Flow : Transfer Synthetic Data to Real-like Domain. Proposed Flow : Transfer Real Data to Synthetic-like Domain. Remove Patient Specific Features while preserving features necessary for depth estimation. F. Mahmood , Nicholas J. Durr et al."Unsupervised Reverse Domain Adaptation for Synthetic Medical Images via Adversarial Training." IEEE Transactions on Medical Imaging (2018).

  23. Adversarial Reverse Domain Adaptation Endoscopy Synthetic-like Shape, Shading, Intensity Images Representation Preserved Patient Specific Details Removed Faisal Mahmood , Nicholas J. Durr et al."Unsupervised Reverse Domain Adaptation for Synthetic Medical Images via Adversarial Training." (Mahmood et al., 2018) IEEE Transactions on Medical Imaging (2018).

  24. Endoscopy Depth Estimation Colonoscopy Video Colonoscopy Video Depth Estimate Depth Estimate (Mahmood et al., 2018)

  25. Validation – Endoscopy Depth Estimation (Mahmood et al., 2018)

  26. Estimated Depth to Topography Faisal Mahmood , Nicholas J. Durr et al."Unsupervised Reverse Domain Adaptation for Synthetic Medical Images via Adversarial Training." IEEE Transactions on Medical Imaging (2018).

  27. Polyp Charcterization Serrated Adenoma Hyperplastic Can we predict the type of polyp without a biopsy only from RGB Image using limited data?

  28. Polyp Charcterization Adenoma Serrated Hyperplastic - 76 Videos - All videos labeled by 4 Senior Gastroenterologists & 3 Fellows - Average GI Accuracy: Senior: 63.4% Fellow: 53.7%

  29. Deep Learning for Medical Imaging – Major Challenges 1. Limited Annotated Data - Under representation of rare conditions. - Limited experts available for annotation. - Privacy Issues - Cost 2. Domain Adaptation - Diversity in data, different sensors, cites and patients. - Patient specific texture and color information. 3. Structured Prediction - Global vs Local features. 4. Incorporating Multimodal Information - Subjective diagnosis is multimodal.

  30. Multimodal Data Fusion Depth is an additional predicted modality. Can we fuse depth and RGB Fusion to get better polyp Network classification results? Does depth fusion help train a network that requires less data? Faisal Mahmood , Nicholas J. Durr et al."Unsupervised Reverse Domain Adaptation for Synthetic Medical Images via Adversarial Training." IEEE Transactions on Medical Imaging (2018).

  31. RGB-D Classification via Depth Fusion Data Fusion in Feature Space is better than Concatenation. (Mahmood et al., 2018)

  32. Multimodal Densenet Error (F. Mahmood et al., 2018)

  33. RGB-D Classification RGB vs RGB-D Error Average Accuracy = 93% using RGB-D

  34. RGB vs RGB-D Classification 0.7 DenseNet DenseNet Fit 0.6 MultiDense MultiDense Fit 0.5 test error RGB 0.4 Classificatio 0.3 n RGB+Depth 0.2 Classificatio 0.1 n 0 20 40 60 80 100 120 140 160 180 200 epochs (F. Mahmood et al., 2018)

  35. Gradient Class Activation Maps RGB-D RGB Adenoma Classification Classification (F. Mahmood et al., 2018)

Recommend


More recommend