advisors robert chang jeff ullman andreas paepcke all
play

Advisors: Robert Chang, Jeff Ullman, Andreas Paepcke *All - PowerPoint PPT Presentation

Apaar Sadhwani, Leo Tam, Jason Su Advisors: Robert Chang, Jeff Ullman, Andreas Paepcke *All contributors are/were affiliated with Stanford University at time of their contributions. Leo Tam now works at Nvidia. Address correspondence to Apaar


  1. Apaar Sadhwani, Leo Tam, Jason Su Advisors: Robert Chang, Jeff Ullman, Andreas Paepcke *All contributors are/were affiliated with Stanford University at time of their contributions. Leo Tam now works at Nvidia. Address correspondence to Apaar Sadhwani at apaars@stanford.edu or Jason Su at sujason@stanford.edu.

  2.  Motivation:  Affects ~100M, many in developed, ~45% of diabetics  Make process faster, assist ophthalmologist, self-help  Widespread disease, enable early diagnosis/care  Given fundus image  Rate severity of Diabetic Retinopathy  5 Classes: 0 (Normal), 1, 2, 3, 4 (Severe)  Hard classification (may solve as ordinal though)  Metric: quadratic weighted kappa, (pred – real) 2 penalty  Data from Kaggle  ~35,000 training images, ~54,000 test images  High resolution: variable, more than 2560 x 1920  Other unlabeled data from Stanford

  3.  Motivation:  Affects ~100M, many in developed, ~45% of diabetics  Make process faster, assist ophthalmologist, self-help  Widespread disease, enable early diagnosis/care  Given fundus image  Rate severity of Diabetic Retinopathy  5 Classes: 0 (Normal), 1, 2, 3, 4 (Severe)  Hard classification (may solve as ordinal though)  Metric: quadratic weighted kappa, (pred – real) 2 penalty  Data from Kaggle  ~35,000 training images, ~54,000 test images  High resolution: variable, more than 2560 x 1920  Other unlabeled data from Stanford

  4. Class 0 (normal) Class 4 (severe)

  5.  Motivation:  Affects ~100M, many in developed, ~45% of diabetics  Make process faster, assist ophthalmologist, self-help  Widespread disease, enable early diagnosis/care  Given fundus image  Rate severity of Diabetic Retinopathy  5 Classes: 0 (Normal), 1, 2, 3, 4 (Severe)  Hard classification (may solve as ordinal though)  Metric: quadratic weighted kappa, (pred – real) 2 penalty  Data from Kaggle  ~35,000 training images, ~54,000 test images  High resolution: variable, more than 2560 x 1920  Other unlabeled data from Stanford

  6.  High resolution images Image size Batch Size Atypical in vision, GPU batch size issues  224 x 224 128  Discriminative features small 2K x 2K 2  Grading criteria: not clear (EyePACS guidelines)  learn from data   Incorrect labeling  Artifacts in ~40% images  Optimizing approach to QWK  Severe class imbalance class 0 dominates   Too few training examples

  7.  High resolution images 0 1 Atypical in vision, GPU batch size issues   Discriminative features small  Grading criteria: not clear (EyePACS guidelines)  learn from data  2 3  Incorrect labeling  Artifacts in ~40% images  Optimizing approach to QWK  Severe class imbalance class 0 dominates   Too few training examples 4

  8.  High resolution images Atypical in vision, GPU batch size issues   Discriminative features small  Grading criteria: not clear (EyePACS guidelines)  learn from data   Incorrect labeling  Artifacts in ~40% images  Optimizing approach to QWK  Severe class imbalance class 0 dominates   Too few training examples Class 2

  9.  High resolution images Atypical in vision, GPU batch size issues   Discriminative features small  Grading criteria: not clear (EyePACS guidelines)  learn from data   Incorrect labeling  Artifacts in ~40% images  Optimizing approach to QWK  Severe class imbalance class 0 dominates   Too few training examples

  10.  High resolution images Atypical in vision, GPU batch size issues   Discriminative features small  Grading criteria: not clear (EyePACS guidelines)  learn from data   Incorrect labeling - Mentioned in problem statement  Artifacts in ~40% images - Confirmed with doctors  Optimizing approach to QWK  Severe class imbalance class 0 dominates   Too few training examples

  11.  High resolution images Atypical in vision, GPU batch size issues   Discriminative features small  Grading criteria: not clear (EyePACS guidelines)  learn from data   Incorrect labeling  Artifacts in ~40% images  Optimizing approach to QWK  Severe class imbalance class 0 dominates   Too few training examples

  12. Penalty/Loss  High resolution images Atypical in vision, GPU batch size issues   Discriminative features small  Grading criteria: not clear (EyePACS guidelines)  learn from data   Incorrect labeling Class 0 1 2 3 4  Artifacts in ~40% images Truth  Optimizing approach to QWK  Severe class imbalance - Hard classification non-differentiable class 0 dominates  - Backprop difficult  Too few training examples

  13. Penalty/Loss  High resolution images Atypical in vision, GPU batch size issues   Discriminative features small Predict  Grading criteria: 1 not clear (EyePACS guidelines)  learn from data   Incorrect labeling Class 0 1 2 3 4  Artifacts in ~40% images Truth  Optimizing approach to QWK  Severe class imbalance - Hard classification non-differentiable class 0 dominates  - Backprop difficult  Too few training examples

  14. Penalty/Loss  High resolution images Atypical in vision, GPU batch size issues   Discriminative features small Predict 2  Grading criteria: not clear (EyePACS guidelines)  learn from data   Incorrect labeling Class 0 1 2 3 4  Artifacts in ~40% images Truth  Optimizing approach to QWK  Severe class imbalance - Hard classification non-differentiable class 0 dominates  - Backprop difficult  Too few training examples

  15. Penalty/Loss Predict  High resolution images 3 Atypical in vision, GPU batch size issues   Discriminative features small  Grading criteria: not clear (EyePACS guidelines)  learn from data   Incorrect labeling Class 0 1 2 3 4  Artifacts in ~40% images Truth  Optimizing approach to QWK  Severe class imbalance - Hard classification non-differentiable class 0 dominates  - Backprop difficult  Too few training examples

  16. Penalty/Loss  High resolution images Atypical in vision, GPU batch size issues   Discriminative features small  Grading criteria: not clear (EyePACS guidelines)  learn from data   Incorrect labeling Class 0 1 2 3 4  Artifacts in ~40% images Truth  Optimizing approach to QWK  Severe class imbalance - Hard classification non-differentiable class 0 dominates  - Backprop difficult  Too few training examples

  17. Penalty/Loss  High resolution images Atypical in vision, GPU batch size issues   Discriminative features small  Grading criteria: not clear (EyePACS guidelines)  learn from data   Incorrect labeling Class 2.5 0 1 2 3 4  Artifacts in ~40% images Truth  Optimizing approach to QWK  Severe class imbalance - Squared error approximation? class 0 dominates  - Differentiable  Too few training examples

  18.  High resolution images Atypical in vision, GPU batch size issues   Discriminative features small  Grading criteria: not clear (EyePACS guidelines)  learn from data   Incorrect labeling  Artifacts in ~40% images  Optimizing approach to QWK  Severe class imbalance class 0 dominates  - Naïve: 3 class problem, or all zeros!  Too few training examples - Learn all classes separately: 1 vs All? - Balanced while training - At test time?

  19.  High resolution images Atypical in vision, GPU batch size issues   Discriminative features small  Grading criteria: not clear (EyePACS guidelines)  learn from data   Incorrect labeling  Artifacts in ~40% images  Optimizing approach to QWK  Severe class imbalance class 0 dominates  - Big learning models take more data!  Too few training examples - Harness test set?

  20.  Literature survey:  Hand-designed features to pick each component  Clean images, small datasets  Optic disk, exudate segmentation: fail due to artifacts  SVM: poor performance

  21.  Literature survey:  Hand-designed features to pick each component  Clean images, small datasets  Optic disk, exudate segmentation: fail due to artifacts  SVM: poor performance

  22. 1. Registration, Pre-processing 2. Convolutional Neural Nets (CNNs) 3. Hybrid Architecture

  23.  Registration  Hough circles, remove outside portion  Downsize to common size (224 x 224, 1K x 1K)  Color correction  Normalization (mean, variance)

  24. Class probabilities  Network in Network architecture AvgPool  7.5M parameters MaxPool (stride2)  No FC layers, spatial average pooling 3 Conv layers (depth 1024) instead  Transfer learning (ImageNet) MaxPool (stride2) 3 Conv layers  Variable learning rates (depth 384)  Low for “ ImageNet ” layers MaxPool (stride2) 3 Conv layers  Schedule (depth 256)  Combat lack of data, over-fitting MaxPool (stride2)  Dropout, Early stopping 3 Conv layers (depth 96)  Data augmentation (flips, rotation) Input Image

Recommend


More recommend