OBJECTIVES: To evaluate the accuracy of a data-driven approach, such as machine learning classification, in predicting disability progression in MS.
METHODS: We analyzed structural brain images of 163 subjects diagnosed with MS acquired at two different sites. Participants were followed up for 2-6 years, with disability progression defined according to the expanded disability status scale (EDSS) increment at follow-up. T2-weighted lesion load (T2LL), thalamic and cerebellar gray matter (GM) volumes, fractional anisotropy of the normal appearing white matter were calculated at baseline and included in supervised machine learning classifiers. Age, sex, phenotype, EDSS at baseline, therapy and time to follow-up period were also included. Classes were labeled as stable or progressed disability. Participants were randomly chosen from both sites to build a sample including 50% patients showing disability progression and 50% patients being stable. One-thousand machine learning classifiers were applied to the resulting sample, and after testing for overfitting, classifier confusion matrix, relative metrics and feature importance were evaluated.
RESULTS: At follow-up, 36% of participants showed disability progression. The classifier with the highest resulting metrics had accuracy of 0.79, area under the true positive versus false positive rates curve of 0.81, sensitivity of 0.90 and specificity of 0.71. T2LL, thalamic volume, disability at baseline and administered therapy were identified as important features in predicting disability progression. Classifiers built on radiological features had higher accuracy than those built on clinical features.
CONCLUSIONS: Disability progression in MS may be predicted via machine learning classifiers, mostly evaluating neuroradiological features.