Prediction models:
Machine Learning
vs Cox model

Prediction models with survival data: a comparison between Machine Learning methods and the Cox PH regression model

Background

This research project considers various predictive models for modeling osteosarcoma survival and disease progression. Various methods are examined, two of which are statistical approaches and extensions of the well-used Cox proportional hazard (PH) regression model, and two of which are data-based approaches, using machine learning techniques. Over the last decade interest in and publications on machine learning (ML) approaches in medical and specifically cancer research has grown, with analyses chiefly involving data with large numbers of covariates (large p), such as genomic data. A very limited amount of this has focused on survival outcomes specifically, and even less on clinical data with a small p. The limited research in this area coupled with the increasing popularity of ML methods in general has prompted a study into the potential of ML for analyzing clinical data with a small p.


Aims

Two applications – a dynamic prediction model and multi-state model - are performed with an aim towards clinical inference. The purpose is to obtain information not available from a simple Cox model, and evaluate this contribution of information. More concretely for the multi-state model and dynamic prediction model the respective objectives are as follows:

An in-depth comparison of the two ML methods – artificial neural networks and random survival forests –is performed in an application and simulation study with the purpose of identifying advantages and pitfalls and the global objective of gaining insight into which methods may be suitable for clinical data with a small p. Of particular interest is the trade-off between flexibility and interpretability and model stability. Concrete aims are:


Relevance for cancer research

Within this project, research will be carried out in order to comprehensively establish the potential of ML for survival analysis of cancer data. Methodology developed will be applied to clinical trial data from the EURAMOS-1 study. The methodology of this research could be applicable to any kind of cancerous diseases.


Project Outcomes


Team

Twitter
Email