Missing data imputation in longitudinal cohort studies - Application of PLANN-ARD in breast cancer survival

Ana S. Fernandes, Ian H. Jarman, Terence A. Etchells, José M. Fonseca, Elia Biganzoli, Chris Bajdik, Paulo J G Lisboa

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Missing values are common in medical datasets and may be amenable to data imputation when modelling a given data set or validating on an external cohort. This paper discusses model averaging over samples of the imputed distribution and extends this approach to generic non-linear modelling with the Partial Logistic Artificial Neural Network (PLANN) regularised within the evidence-based framework with Automatic Relevance Determination (ARD), The study then applies the imputation to external validation over new patient cohorts, considering also the case of predictions made for individual patients. A prognostic index is defined for the non-linear model and validation results show that 4 statistically significant risk groups identified at the 95% level of confidence from the modelling data, from Christie Hospital (n=931), retain good separation during external validation with data from the British Columbia Cancer Agency (n=4, 083).

Original languageEnglish
Title of host publicationProceedings - 7th International Conference on Machine Learning and Applications, ICMLA 2008
Pages644-649
Number of pages6
DOIs
Publication statusPublished - 2008
Event7th International Conference on Machine Learning and Applications, ICMLA 2008 - San Diego, CA, United States
Duration: Dec 11 2008Dec 13 2008

Other

Other7th International Conference on Machine Learning and Applications, ICMLA 2008
CountryUnited States
CitySan Diego, CA
Period12/11/0812/13/08

ASJC Scopus subject areas

  • Artificial Intelligence
  • Computer Science Applications
  • Software

Fingerprint Dive into the research topics of 'Missing data imputation in longitudinal cohort studies - Application of PLANN-ARD in breast cancer survival'. Together they form a unique fingerprint.

  • Cite this

    Fernandes, A. S., Jarman, I. H., Etchells, T. A., Fonseca, J. M., Biganzoli, E., Bajdik, C., & Lisboa, P. J. G. (2008). Missing data imputation in longitudinal cohort studies - Application of PLANN-ARD in breast cancer survival. In Proceedings - 7th International Conference on Machine Learning and Applications, ICMLA 2008 (pp. 644-649). [4725043] https://doi.org/10.1109/ICMLA.2008.106