IBM Research Automatic Diagnosis of Pulmonary Embolism Using an - PowerPoint PPT Presentation

IBM Research Automatic Diagnosis of Pulmonary Embolism Using an Attention-guided Framework: A Large-scale Study Luyao Shi 1 , Deepta Rajan 2 , Shafiq Abedin 2 , Manikanta Srikar Yellapragada 3 , David Beymer 2 , and Ehsan Dehghan 2 1 Yale University 2 IBM Research 3 New York University

About pulmonary embolism (PE) • Causes : a clump of material, most often a blood clot, gets wedged into an artery in patients’ lungs. These blood clots most commonly come from the deep veins of patients’ legs. • Mortality : • About 100,000 deaths/year in US. • 1 of 4 people who have a PE die without warning. • 10 to 30% of people will die within one month of diagnosis. • Prompt recognition of the diagnosis and immediate initiation of therapeutic action is important.

About pulmonary embolism (PE) • Contrast Enhanced Chest CT is the preferred method of diagnostic imaging in patients with a clinical risk score indicative of PE. • PE can be visualized as perfusion defects. Abdel-Razzak M. Al-hinnawi. Computer-Aided Detection, Pulmonary Embolism, Computerized Tomography Pulmonary Angiography: Current Status (2018).

Motivation • Challenges: • Increased probability of false-positive findings when the lesions involve peripheral pulmonary vascular regions. • Confounding factors: o Poorly filled vein with contrast media o Impacted bronchi or parenchymal disease o Lymphoid tissues around the vessels o Respiratory/cardiac motion artifacts o Image noise • PE detection/exclusion is quite time-consuming and dependent on the experience of the radiologist. • GOAL : A deep learning-based computer-aided diagnosis (CAD) platform to detect PE with high accuracy. In-Hye Jung et al. Clinical outcome of fiducial-less Cyber Knife radiosurgery for stage I non-small cell lung cancer.(2015).

Training strategy Cons Pros • Training with Time consuming for radiologists • Better interpretability • pixel-level annotated data Limited availability • Higher accuracy • Less scalability • • Largely available training data Less interpretability End-to-end training with • • Better scalability Potentially worse performance patient-level labels Hybrid Training PE or not? pixel-level annotated data patient-level label

Hybrid Training Overview Image slab Pixel-level annotation Stage 1: Training with Output Loss using pixel-level annotated data Network A dense annotation Feature encoder (weight fixed) Image slab 1 Network A Image slab 2 Network A Image slab 3 Network A Binary Output …… …… …… Stage 2: Label Network B patient-level ( PE or not) Training with loss patient-level labels Image slab N Network A

Stage1: training with pixel-level annotated data • Pixel-level annotations every 10mm • Goal : train an image encoding network that focus its attention on PE

Attention map • Class activation map (CAM): indicates the discriminative image regions used by the CNN to identify a particular class. Zhou et al. Learning Deep Features for Discriminative Localization. (2016) • Guided attention inference networks (GAIN): supervise the attention maps while training the network. Li et al., Tell Me Where to Look: Guided Attention Inference Network. (2018)

Stage1: attention-guided training 3x3 conv, 128, /2 3x3 conv, 256, /2 3x3 conv, 512, /2 3x3 conv, 128 3x3 conv, 128 3x3 conv, 128 3x3 conv, 256 3x3 conv, 256 3x3 conv, 256 3x3 conv, 512 3x3 conv, 512 3x3 conv, 512 Avg pool 3x3 conv, 64 3x3 conv, 64 3x3 conv, 64 3x3 conv, 64 3x3 conv, 64 Classification Classification Loss PE or not fc 2 Input 2.5D Image Results (categorical cross Label 2 entropy) ResNet18 + Down -sample Attention Attention Loss Maps (dice coefficient loss) 24 × 24 = Annotation Total Loss Mask 384 × 384 × 5 • Resample volumetric images (bilinear interpolation): slice thickness [0.5mm, 5mm] → 2.5mm • 10,388 slabs (5 slices) of annotated pairs from 1,670 positive volumetric images • Same amount of negative slabs randomly sampled from 593 negative volumetric images • Image cropped to center 384 × 384, [-1024HU,500HU] → [0, 255] • 80% training, 20% validation • Training epochs: 100 (save the model with the highest val. acc.)

Stage1: results on the validation set ResNet’s slab -level PE prediction Example Attention Maps result on the validation data Annotation Mask Attention Map (down-sampled) With Attention Training Without Attention Training

Stage2: training data and image pre-processing 384 × 384 • Data-preprocessing: – Image cropped to center 384 × 384, [-1024HU,500HU] → [0-255] – Identify lung regions using lung mask (produced by in-house lung segmentation method), resize to 200 slices, then sample 50 slabs – ? × 512 × 512 → 50 × 384 × 384 × 5 Slab Sampling Resize ……

Stage2: training with patient-level labeled data ResNet Slab 1: ResNet Slab 2: PE or not ResNet Slab 3: Neural Network …… Slab 50: ResNet

Stage2: training with patient-level labeled data Recurrent Framework: Conv-LSTM Last Conv Layer from 50*384*384*5 50*24*24*512 ResNet 50*6*6*96 50*3456 Max ResNet BN Slab 1: BN Flatten Pool Max ResNet BN BN Slab 2: Flatten Pool 1*3456 1*1 Conv-LSTM Conv-LSTM Dropout, Max ResNet BN BN AvePool Slab 3: Flatten downwards Dense upwards Pool PE or not …… …… …… …… Max Slab 50: ResNet BN BN Flatten Pool X 2

Stage2: training parameters …… • Classification loss: Binary cross entropy (BCE) • Optimizer: Adam optimizer • 10 -4 Learning rate: • Training epochs: 50 (save the model with the highest val. acc.)

Stage2: patient-level inference results on testing data • Training data: • Annotated Studies: 1670+, 593- • Labeled volumetric images: 4186+, 4603- • 80% training, 20% validation • Testing data (2160 total): 517+, 1643- Stage 1 Data Stage 1 Loss Stage 2 Data AUC Atten. Loss Scenario 1 1670+, 593- 1670+, 593- 0.739 + Cls. Loss 1670+, 593- Scenario 2 1670+, 593- Cls. Loss 0.643 & 4186+,4603- Atten. Loss 1670+, 593- Scenario 3 1670+, 593- 0.812 + Cls. Loss & 4186+,4603-

Comparison with state-of-the-art PENet 3D CNN • • Training data was labeled on a slice level for the Starts with an I3D model (3D CNN pretrained on video action presence/absence of a PE recognition dataset) • Demonstrated success in acute aortic syndrome detection • Trained only on our patient-level labeled PE data SC Huang, et al. PENet - a scalable deep-learning model for automated MS Yellapragada, et al. Deep Learning Based Detection of Acute Aortic Syndrome in diagnosis of pulmonary embolism using volumetric CT imaging. (2019) Contrast CT Images.(2020)

Comparison with state-of-the-art Testset description Approach AUC Accuracy Size Clinical sites Img. protocols PENet (int.) 198 Single Single 0.79 0.74 PENet (ext.) 227 Single Single 0.77 0.67 3D CNN 2160 Multiple Mixed 0.787 0.727 Proposed 2160 Multiple Mixed 0.812 0.781 Mixed protocols: ▪ Contrast-enhanced Chest CT vs. CT pulmonary angiogram ▪ Different dose levels (noise level) ▪ Different image reconstruction kernels ▪ Slice thickness: 0.5mm-5mm

Auxiliary output – PE localization

Auxiliary output – PE localization Contrast-enhanced CT Attention Maps Example 1 Example 2

Future Work • Using more efficient network structures (e.g. DenseNet) to replace ResNet18. • In Stage1, the weights of classification loss and attention loss can be optimized (currently 1:1). • Fully end-to-end training where the weights of ResNet can also be updated.

Summary • We introduced a deep learning model to detect PE on volumetric contrast-enhanced CT scans using a 2-stage hybrid training strategy – Training with attention loss on pixel- level annotated data improves the network’s localization ability – Second-stage convolution-LSTM networks reduce false positives on patient-level prediction • Our evaluation involves the largest number of patient studies among all the research studies on automatic PE detection. • Achieved state-of-the-art PE detection, while providing attention maps for radiologists as references. • Applicable to other detection problems where the availability of volumetric imaging data exceeds radiologists’ capacity to manually delineate ground truth.

Acknowledgement Thank you! Q&A IBM Research We would like to thank Yiting Xie , Benedikt Graf and Arkadiusz Sitek from IBM Watson Health Imaging for helping us generate the 3D CNN results.

IBM Research Automatic Diagnosis of Pulmonary Embolism Using an - PowerPoint PPT Presentation

IBM Research Automatic Diagnosis of Pulmonary Embolism Using an Attention-guided Framework: A Large-scale Study Luyao Shi 1 , Deepta Rajan 2 , Shafiq Abedin 2 , Manikanta Srikar Yellapragada 3 , David Beymer 2 , and Ehsan Dehghan 2 1 Yale

4CSLL5 IBM Translation Models Martin Emms October 22, 2020 4CSLL5 IBM Translation Models IBM

4CSLL5 IBM Translation Models IBM models Probabilities and Translation Alignments Martin Emms

non-von Neumann computing? Abu Sebastian IBM Research Zurich Stanford EE380, 7 th March 2018

Problem solved: IBM Notes Replacement 2 IBM Notes Replacement Migrating from IBM Notes to

IBM i Its been a challenge to determine how to distill the essence of IBM i. Since IBM i is

Infuse AI to Your Enterprise Yonghua LIN, IBM Research IBM Distinguished Engineer Leader of AI

The IBM 2016 Speaker Recognition System Seyed Omid Sadjadi, Sriram Ganapathy, Jason Pelecanos IBM

Black-Box Performance Control for High-Volume Non-Interactive Systems Chunqiang (CQ) Tang IBM

SimOSPPC Full System Simulation of PowerPC Architecture Tom Keller Austin Research Lab IBM

IBM Systems Cognitive Systems Dr. Wolfgang Maier Director HW Development IBM Research &

Apache Beam Dan Debrunner Programming Model Architect IBM Streams STSM, IBM Background

DSF: A Common Platform For Distributed Systems Research and Development Chunqiang (CQ) Tang IBM

IBM Cloud Private on Linux on IBM Z & LinuxONE Presentation for Vicom Infinity Kershaw Mehta -

Latest on IBM i Therese Eaton Client Technical Specialist Top IBM i Client Projects IBM i

IBM POWER6 Processor and Systems IBM POWER6 Fault-Tolerant Design Presenter: Natalya Kostenko

Distributed Planning Poker Integrating IBM Rational Team Concert and Google Wave for distributed

The Psychological Impact of Venous Thromboembolism in young women: Health anxiety and PTSD

Math for Liberal Arts MAT 110 : Chapter 2 Notes David J. Gisch A Quick Fractions Review

Prevention Learning Community Meeting Hosted by: Division of Behavioral Health & Recovery

Introduction to Toxicology Richard R. Rediske, Ph.D. Annis Water Resources Institute Grand

INR Self Tracking Rob Rothfarb 1 Patient <==> Tracker Proactive patient, tracking

Scott Kulp Rutgers University Center for Biomedical Imaging and Modeling (CBIM) April 8, 2011

TURNING ON ACCESSIBLE MODE FOR USERS WITH MOTOR IMP AIRMENTS W3C Workshop on Web Games

Patient Self Testing at the MGH AMS Lynn B. Oertel, MS, NP-BC, CACP Nursing Practice Specialist

IBM Research Automatic Diagnosis of Pulmonary Embolism Using an - PowerPoint PPT Presentation

IBM Research Automatic Diagnosis of Pulmonary Embolism Using an Attention-guided Framework: A Large-scale Study Luyao Shi 1 , Deepta Rajan 2 , Shafiq Abedin 2 , Manikanta Srikar Yellapragada 3 , David Beymer 2 , and Ehsan Dehghan 2 1 Yale

4CSLL5 IBM Translation Models Martin Emms October 22, 2020 4CSLL5 IBM Translation Models IBM

4CSLL5 IBM Translation Models IBM models Probabilities and Translation Alignments Martin Emms

non-von Neumann computing? Abu Sebastian IBM Research Zurich Stanford EE380, 7 th March 2018

Problem solved: IBM Notes Replacement 2 IBM Notes Replacement Migrating from IBM Notes to

IBM i Its been a challenge to determine how to distill the essence of IBM i. Since IBM i is

Infuse AI to Your Enterprise Yonghua LIN, IBM Research IBM Distinguished Engineer Leader of AI

The IBM 2016 Speaker Recognition System Seyed Omid Sadjadi, Sriram Ganapathy, Jason Pelecanos IBM

Black-Box Performance Control for High-Volume Non-Interactive Systems Chunqiang (CQ) Tang IBM

SimOSPPC Full System Simulation of PowerPC Architecture Tom Keller Austin Research Lab IBM

IBM Systems Cognitive Systems Dr. Wolfgang Maier Director HW Development IBM Research &amp;

Apache Beam Dan Debrunner Programming Model Architect IBM Streams STSM, IBM Background

DSF: A Common Platform For Distributed Systems Research and Development Chunqiang (CQ) Tang IBM

IBM Cloud Private on Linux on IBM Z &amp; LinuxONE Presentation for Vicom Infinity Kershaw Mehta -

Latest on IBM i Therese Eaton Client Technical Specialist Top IBM i Client Projects IBM i

IBM POWER6 Processor and Systems IBM POWER6 Fault-Tolerant Design Presenter: Natalya Kostenko

Distributed Planning Poker Integrating IBM Rational Team Concert and Google Wave for distributed

The Psychological Impact of Venous Thromboembolism in young women: Health anxiety and PTSD

Math for Liberal Arts MAT 110 : Chapter 2 Notes David J. Gisch A Quick Fractions Review

Prevention Learning Community Meeting Hosted by: Division of Behavioral Health &amp; Recovery

Introduction to Toxicology Richard R. Rediske, Ph.D. Annis Water Resources Institute Grand

INR Self Tracking Rob Rothfarb 1 Patient &lt;==&gt; Tracker Proactive patient, tracking

Scott Kulp Rutgers University Center for Biomedical Imaging and Modeling (CBIM) April 8, 2011

TURNING ON ACCESSIBLE MODE FOR USERS WITH MOTOR IMP AIRMENTS W3C Workshop on Web Games

Patient Self Testing at the MGH AMS Lynn B. Oertel, MS, NP-BC, CACP Nursing Practice Specialist

IBM Systems Cognitive Systems Dr. Wolfgang Maier Director HW Development IBM Research &

IBM Cloud Private on Linux on IBM Z & LinuxONE Presentation for Vicom Infinity Kershaw Mehta -

Prevention Learning Community Meeting Hosted by: Division of Behavioral Health & Recovery

INR Self Tracking Rob Rothfarb 1 Patient <==> Tracker Proactive patient, tracking