Deep Recurrent Survival Analysis Kan Ren, Jiarui Qin, Lei Zheng, - PowerPoint PPT Presentation

Deep Recurrent Survival Analysis Kan Ren, Jiarui Qin, Lei Zheng, Zhengyu Yang, Weinan Zhang, Lin Qiu, Yong Yu Apex Data & Knowledge Management Lab Shanghai Jiao Tong University

Table of Contents • Background • Deep Recurrent Model • Loss Functions • Experiments

Background • Time-to-event data analysis • The probability of the event over time. • May have different meanings in different areas. Area Time Event Event Probability Medicine Research Survival time Disease Survival rate Information System Duration time Next visit Visiting rate Second-price Auction Bid price Winning the Losing rate auction

Survival Analysis (SA) • Survival Analysis • To analyze the expected duration of time until one or more events happen.

Task of SA • Given the feature of the sample, forecast • the probability of event happening at each time: p(#) • the probability of event happened at that time: W & • the probability of event not happened at the time: ' & • 2 goals • Probability density function (P.D.F.) of the event prob. over time. • Cumulative distribution function (C.D.F.) of the event at the time . • 2 relationships between the three prob. functions , - # .# • Event Rate: ( & = ∫ + / - # .# = 1 − ((&) • Survival Rate: S & = ∫ ,

Challenges in SA • No ground truth • For the form of the event probability distribution • For the value of the event probability • Sparsity • Event is sparse, rare to happen • Censorship • Some clues are censored (without the true event time)

Censorship http://www.karlin.mff.cuni.cz/~pesta/NMFM404/survival.html

Censorship (cont.) • For the censored samples: • Observing time ! • True event time z is unknown • Only knows that • Right censored: ! < $ • Left censored: ! > $ • Interval censored: $ ∈ [! ( , ! * ]

Task Formulation • Data format ( (", $, %) ' • • " : sample feature • $ : observing time • % : true event time • % is known for uncensored data ( $ > % ); • % is unknown for censored data ( $ < % ). • Input: • Sample features " • Output • P.D.F. of event probability + , % • C.D.F. of event rate -($) & survival rate . $ = 1 − -($)

Existing Methods • Statistical methods • Kaplan-Meier method • Coarse-grained, counting-based, low generalization Kaplan and Meier 1958.

Existing Methods (cont.) • Statistical methods • Cox proportional hazard (CPH) model • Hazard function • The probability of event occurring at time 8 given not occurred before . • : 8 ; = : = 8 > ?@ • The base hazard function has some assumptions, e.g., Weibull distribution. • Drawback: not flexible in practice. Cox 1992; Zhang and Lu 2007.

Existing Methods (cont.) • Machine learning methods • Survival tree model • Drawback: • based on segmented data • coarse-grained Wang et al. 2016.

Existing Methods (cont.) • Deep learning method • DeepSurv 1 • bases on CPH method using deep learning as enhanced feature extraction. • DeepHit 2 • directly predicts ! " at each time • calculates #(%) by summing ! " over [1, %] 1. Katzman et al. 2018; 2. Lee et al. 2018.

Cons of the Existing Methods • Statistical methods • Counting-based statistics, loss of generality • Kaplan-Meier • Specific form of the probability distribution • CPH, Lasso-cox • Machine learning methods • Based on segmented data, too coarse-grained • Survival Trees • Assumption of the specific form of distribution • DeepSurv • No consideration about sequential patterns over time!

Deep Recurrent Survival Analysis (DRSA) • No assumption about distributional forms • Captures sequential patterns in the feature-time space • First work ever, utilizes auto-regressive model for SA • Handling censorship with unbiased learning • Significant improvement against both stat. methods and ML methods

Our method • Discrete time model # $ … , . , < , $-< , $-. , $ , $=. , $=< • ! ∈ # $ means event occurs at time % • ! ∉ # $ means event not occurs at time % • Hazard rate function, means the event probability at that time given not happened before . • ℎ $ = Pr ! ∈ # $ ! > , $-. , 0; 2 = 3 2 (0, , $ |6 $-. ) • Use the recurrent cell 3 8 to model cond. probability ℎ $ • 9 $-. is the transmitted information through time • : ; , , $ are the input to the unit

Relationships among Probability Functions • ! " # $; & = Pr " # < + $; & = Pr + ∉ - . , + ∉ - 0 , … , + ∉ - # $; & = Pr + ∉ - . $; & 2 Pr + ∉ - 0 + ∉ - . , $; & ⋯ 2 Pr + ∉ - # + ∉ - . , … , + ∉ - #4. , $; & = 5 1 − Pr(+ ∈ - 6 |+ > " 64. , $; &) 6:68# Probability chain rule = 5 (1 − ℎ 6 ) E F . , F 0 , F G = E F G F . , F 0 E F 0 F . E(F . ) 6:68# • A " # $; & = 1 − ! " $; & = 1- ∏ 6:68# (1 − ℎ 6 ) # $; & = ℎ # ∏ 6:6D# (1 − ℎ 6 ) • C # = Pr + ∈ -

The Recurrent Model

Loss Functions (1/3) • Uncensored data • P.D.F. loss on the true event time ! • Maximize the log likelihood

Loss Functions (2/3) • Uncensored data ( ! < # ) • C.D.F. loss on the observing time # • Maximize the log partial likelihood

Loss Functions (3/3) • Censored data ( ! is unknown since ! > # ) • C.D.F. loss on the observing time # • Maximize the log partial likelihood • Unbiased learning

Loss Functions (cont.) • Three losses Uncensored Data Censored Data ! = ! # + ! %&'(&)*+(, + ! '(&)*+(, P.D.F. Loss C.D.F. Loss

Intuition behind C.D.F. Losses Uncensored Case Censored Case (z has been known) (z is unknown) We need to • Push down ↓ the survival curve "($) when • event occurred before $ , i.e., z < $ for uncensored data. • Pull up ↑ the survival curve "($) when • event not occurs before $ , i.e., z > $ for censored data. •

Experiments • 3 real-world large-scale datasets • 2 evaluation metrics • 6 compared baseline models

Datasets • 3 real-world large-scale datasets • Download link of the processed data: • https://goo.gl/nUFND4. • CLINIC from medicine research • MUSIC from information systems • BIDDING from economics

Evaluation Metrics • ANLP • Averaged negative log probability • of the true event time ! • C-index • Time-dependent concordance index • measures the ranking performance of the censorship prediction at the given time. • The same as Area under ROC Curve in IR

Experiment Results Performance comparison on C-index (the higher, the better) and ANLP (the lower, the better). (* indicates p- value < 10 − 6 in significance test)

Learning Curves

Survival Curves 6urvLvDl CurvH Rf DLffHrHnt 0RGHls PrREDELlLty CurvH Rf DLffHrHnt 0RGHls 1.0 0.25 .0 .0 LDssR-CRx LDssR-CRx GDPPD GDPPD 0.8 0.20 PrREDELlLty Rf (vHnt S ( z ) 670 670 6urvLvDl 5DtH S ( t ) DHHS6urv DHHS6urv 0.6 DHHSHLt DHHSHLt 0.15 D56A D56A 0.4 0.10 0.2 0.05 0.0 0.00 z i 67 z i 67 tLPH tLPH

Conclusion • Thank you for attention! • We argued that, in survival analysis, • Sequential patterns over time should be considered. • More supervision over [", $] should be made. • We proposed • 1 st work using auto-regressive model for survival analysis. • DRSA (https://github.com/rk2900/drsa) • Utilizes recurrent neural cell predicting the conditional hazard rate; • Estimates the true event ratio and survival rate through probability chain rule; • Achieves significant improvements against strong baselines.

Deep Recurrent Survival Analysis Kan Ren, Jiarui Qin, Lei Zheng, - PowerPoint PPT Presentation

Deep Recurrent Survival Analysis Kan Ren, Jiarui Qin, Lei Zheng, Zhengyu Yang, Weinan Zhang, Lin Qiu, Yong Yu Apex Data & Knowledge Management Lab Shanghai Jiao Tong University Table of Contents Background Deep Recurrent Model

Survival Analysis / Time-to- Event Analysis in R Heidi Seibold Statistician at LMU Munich

CHAPTER VII VII CHAPTER Learning in Recurrent Networks Learning in Recurrent Networks CHAPTER

CHAPTER II I CHAPTER I Recurrent Neural Networks Recurrent Neural Networks CHAPTER II : I :

CS6501: Deep Learning for Visual Recognition Recurrent Neural Networks (RNNs) Todays Class

CS6501: Deep Learning for Visual Recognition Recurrent Neural Networks (RNNs) Todays Class

Survival Analysis Mark Lunt Centre for Epidemiology Versus Arthritis University of Manchester

Why use the Weibull model? Heidi Seibold Statistician at LMU Munich DataCamp Survival Analysis

Kaplan-Meier estimate Heidi Seibold Statistician at LMU Munich DataCamp Survival Analysis in R

Survival curve showing cohorts Overall Survival Survival Frequency Time (%) 1 year 53.7 2

Novocure Announces the Presentation of Subgroup Survival Data for Recurrent Glioblastoma Patients

RcmdrPlugin.survival : An R Commander Plug-in Package for Survival Analysis John Fox McMaster

Recurrent Neural Network Xiaogang Wang xgwang@ee.cuhk.edu.hk February 26, 2019 cuhk Xiaogang

Introduction CSCE CSCE 496/896 496/896 Lecture 6: Lecture 6: Recurrent Recurrent CSCE

Recurrent Neural Networks Greg Mori - CMPT 419/726 Goodfellow, Bengio, and Courville: Deep

Recurrent Neural Networks CS60010: Deep Learning Abir Das IIT Kharagpur Mar 11, 2020

Survival Analysis: Introduction Survival Analysis typically focuses on time to event data. In the

Stochastic Processes Will Perkins March 7, 2013 Stochastic Processes Q: What is a Stochastic

IN5550 Neural Methods in Natural Language Processing Applications of Recurrent Neural Networks

Non-projective Dependency-based Pre-Reordering with Recurrent Neural Network for Machine

A Comprehensive Survey on Deep Future Frame Video Prediction by Javier Selva Castell

Machine Learning for Computational Linguistics Recurrent neural networks (RNNs) ar

Machine Translation 2 Wikipedia Machine translation, often referred to by the acronym MT, is a

Vicsek Model Vicsek et al., 1995 YouTube: The Greatest Bird Show on Earth Characteristics such as

3 7 6 4 1 5 2 9 7 1 5 8 6 4 Meal Counting 5 3 1 0 7 9 Do's & Don'ts 8 5

Sambuz

Useful Links

Newsletter

Mail Us

Deep Recurrent Survival Analysis Kan Ren, Jiarui Qin, Lei Zheng, - PowerPoint PPT Presentation

Deep Recurrent Survival Analysis Kan Ren, Jiarui Qin, Lei Zheng, Zhengyu Yang, Weinan Zhang, Lin Qiu, Yong Yu Apex Data & Knowledge Management Lab Shanghai Jiao Tong University Table of Contents Background Deep Recurrent Model

Survival Analysis / Time-to- Event Analysis in R Heidi Seibold Statistician at LMU Munich

CHAPTER VII VII CHAPTER Learning in Recurrent Networks Learning in Recurrent Networks CHAPTER

CHAPTER II I CHAPTER I Recurrent Neural Networks Recurrent Neural Networks CHAPTER II : I :

CS6501: Deep Learning for Visual Recognition Recurrent Neural Networks (RNNs) Todays Class

CS6501: Deep Learning for Visual Recognition Recurrent Neural Networks (RNNs) Todays Class

Survival Analysis Mark Lunt Centre for Epidemiology Versus Arthritis University of Manchester

Why use the Weibull model? Heidi Seibold Statistician at LMU Munich DataCamp Survival Analysis

Kaplan-Meier estimate Heidi Seibold Statistician at LMU Munich DataCamp Survival Analysis in R

Survival curve showing cohorts Overall Survival Survival Frequency Time (%) 1 year 53.7 2

Novocure Announces the Presentation of Subgroup Survival Data for Recurrent Glioblastoma Patients

RcmdrPlugin.survival : An R Commander Plug-in Package for Survival Analysis John Fox McMaster

Recurrent Neural Network Xiaogang Wang xgwang@ee.cuhk.edu.hk February 26, 2019 cuhk Xiaogang

Introduction CSCE CSCE 496/896 496/896 Lecture 6: Lecture 6: Recurrent Recurrent CSCE

Recurrent Neural Networks Greg Mori - CMPT 419/726 Goodfellow, Bengio, and Courville: Deep

Recurrent Neural Networks CS60010: Deep Learning Abir Das IIT Kharagpur Mar 11, 2020

Survival Analysis: Introduction Survival Analysis typically focuses on time to event data. In the

Stochastic Processes Will Perkins March 7, 2013 Stochastic Processes Q: What is a Stochastic

IN5550 Neural Methods in Natural Language Processing Applications of Recurrent Neural Networks

Non-projective Dependency-based Pre-Reordering with Recurrent Neural Network for Machine

A Comprehensive Survey on Deep Future Frame Video Prediction by Javier Selva Castell

Machine Learning for Computational Linguistics Recurrent neural networks (RNNs) ar

Machine Translation 2 Wikipedia Machine translation, often referred to by the acronym MT, is a

Vicsek Model Vicsek et al., 1995 YouTube: The Greatest Bird Show on Earth Characteristics such as

3 7 6 4 1 5 2 9 7 1 5 8 6 4 Meal Counting 5 3 1 0 7 9 Do's &amp; Don'ts 8 5

Sambuz

Useful Links

Newsletter

Mail Us

3 7 6 4 1 5 2 9 7 1 5 8 6 4 Meal Counting 5 3 1 0 7 9 Do's & Don'ts 8 5