Survival Analysis: an era into Machine Learning

## [1] "status" "mean_radius" "mean_texture" ## [4] "mean_perimeter" "mean_area" "mean_smoothness" ## [7] "mean_compactness" "mean_concavity" "mean_concavepoints" ## [10] "mean_symmetry"

mod1 <- rfsrc(status ~ ., data = breast, nsplit = 10) mod2 <- bagging(status ~ ., data = breast, coob=TRUE) res <- as.data.frame(c(mean(mod1$err.rate[, 1], na.rm = TRUE), mod2$err)) colnames(res) <- "Error Rate" rownames(res) <- c("RSF", "Bagging")

index <- sample(1:nrow(Boston), round(0.75*nrow(Boston))) train <- Boston[index, ] test <- Boston[-index, ] lm.fit <- glm(medv~., data=train) pr.lm <- predict(lm.fit, test) MSE.lm <- sum((pr.lm - test$medv)^2)/nrow(test)

maxs <- apply(Boston, 2, max) mins <- apply(Boston, 2, min) scaled <- as.data.frame(scale(Boston, center = mins, scale = maxs - mins)) train_ <- scaled[index, ] test_ <- scaled[-index, ] n <- names(train_) f <- as.formula(paste("medv ~", paste(n[!n %in% "medv"], collapse = " + "))) nn <- neuralnet(f, data=train_, hidden=c(5,3), linear.output=T)

pr.nn <- compute(nn,test_[,1:13]) pr.nn_ <- pr.nn$net.result*(max(Boston$medv) -min(Boston$medv))+min(Boston$medv) test.r <- (test_$medv)*(max(Boston$medv) -min(Boston$medv))+min(Boston$medv) MSE.nn <- sum((test.r - pr.nn_)^2)/nrow(test_) print(paste(MSE.lm, MSE.nn))

mod3 <- coxph(Surv(time, status)~., data=veteran,x=TRUE,y=TRUE) mod4 <- rfsrc(Surv(time, status) ~ ., data = veteran, ntree = 100) cindex <- as.data.frame(c(concordance(mod3)$concordance, 1-mean(mod4$err.rate, na.rm=T))) colnames(cindex) <- "C-index" rownames(cindex) <- c("Cox PH", "RSF")

Breiman, Leo. 1996. “Bagging Predictors.” Machine Learning 24 (2): 123–40.

———. 2001. “Random Forests.” Machine Learning 45 (1): 5–32.

Breiman, Leo, and others. 1998. “Arcing Classifier (with Discussion and a Rejoinder by the Author).” The Annals of Statistics 26 (3): 801–49.

Ching, Travers, Xun Zhu, and Lana X Garmire. 2018. “Cox-Nnet: An Artificial Neural Network Method for Prognosis Prediction of High-Throughput Omics Data.” PLoS Computational Biology 14 (4): e1006076.

Cox, David R. 1972. “Regression Models and Life-Tables.” Journal of the Royal Statistical Society: Series B (Methodological) 34 (2): 187–202.

Faraggi, David, and Richard Simon. 1995. “A Neural Network Model for Survival Data.” Statistics in Medicine 14 (1): 73–82.

Faraggi, David, R Simon, E Yaskil, and A Kramar. 1997. “Bayesian Neural Network Models for Censored Data.” Biometrical Journal 39 (5): 519–32.

Giunchiglia, Eleonora, Anton Nemchenko, and Mihaela van der Schaar. 2018. “RNN-Surv: A Deep Recurrent Model for Survival Analysis.” In International Conference on Artificial Neural Networks, 23–32. Springer.

Goodfellow, Ian, Yoshua Bengio, and Aaron Courville. 2016. Deep Learning. MIT press.

Lee, Changhee, William R Zame, Jinsung Yoon, and Mihaela van der Schaar. 2018. “Deephit: A Deep Learning Approach to Survival Analysis with Competing Risks.” In Thirty-Second Aaai Conference on Artificial Intelligence.

Polson, Nicholas G, Vadim Sokolov, and others. 2017. “Deep Learning: A Bayesian Perspective.” Bayesian Analysis 12 (4): 1275–1304.

Ranganath, Rajesh, Adler Perotte, Noémie Elhadad, and David Blei. 2016. “Deep Survival Analysis.” arXiv Preprint arXiv:1608.02158.

Wang, Hong, and Lifeng Zhou. 2018. “SurvELM: An R Package for High Dimensional Survival Analysis with Extreme Learning Machine.” Knowledge-Based Systems 160: 28–33.

Wang, Ping, Yan Li, and Chandan K Reddy. 2019. “Machine Learning for Survival Analysis: A Survey.” ACM Computing Surveys (CSUR) 51 (6): 110.

Getting Started

About Me

Assumption about the audience

Aims of the lecture

Contents

Machine Learning (ML)

Traditional vs ML algorithm

Basic paradigm of ML algorithm

An example of ML: ‘Get the cake’

An example of ML: ‘Get the cake’ cont…

An example of ML: ‘Get the cake’ cont…

Popularly used ML techniques

An example of Bagging and RF using R

An example of Bagging and RF using R cont…

Notes on Bagging and RF

Basics of Deep Learning

Understanding neural system

DNN example 1

DNN example 1 cont..

DNN example 1 DNN cont..

DNN example 2

Visualization of one-layer DNN

Visualization of two-layer DNN

Visualization of multi-layer DNN

An example of DNN

An example of DNN cont…

An example of DNN cont…

An example of DNN cont…

An example of DNN cont…

Applications of ML

Survival data analysis

Survival data

An illustration of time-to-event data

Some interesting questions

Some real-life examples of survival data

Applications in healthcare

Applications in healthcare cont…

Applications in education

Applications in education cont…

Applications in Crowdfunding

Applications in Crowdfunding cont…

Traditional approaches to analyse survival data

Cox PH model

Cox PH model cont…

Estimation

Why ML algorithms?

ML for Survival Analysis

An example of RSF and Cox PH using R

An example of of RSF and Cox PH using R cont…

Notes on Cox PH and RSF

ML for Survival Analysis cont..

Previous works

Cox Non-proportional Neural Network Model

Cox Non-proportional Neural Network Model cont..

Details on layer mechanism

Details on layer mechanism cont…

Computing Gradient: Backpropagation

Computing Gradient: Backpropagation cont..

Gradient descent optimization

Complex Loss Function

Neural network for survival

Deep neural network for survival

Performance of DeepSurv

Performance of DeepSurv cont..

More recent developments

Future direction

Acknowledgement

Some additional resources

Thanks

References