Temporal electronic phenotyping by mining careflows of breast cancer patients

Arianna Dagliati, L. Sacchi, A. Zambelli, V. Tibollo, L. Pavesi, John H. Holmes, R. Bellazzi

Research output: Contribution to journalArticlepeer-review


In this work we present a careflow mining approach designed to analyze heterogeneous longitudinal data and to identify phenotypes in a patient cohort. The main idea underlying our approach is to combine methods derived from sequential pattern mining and temporal data mining to derive frequent healthcare histories (careflows) in a population of patients. This approach was applied to an integrated data repository containing clinical and administrative data of more than 4000 breast cancer patients. We used the mined histories to identify sub-cohorts of patients grouped according to healthcare activities pathways, then we characterized these sub-cohorts with clinical data. In this way, we were able to perform temporal electronic phenotyping of electronic health records (EHR) data.

Original languageEnglish
Pages (from-to)136-147
Number of pages12
JournalJournal of Biomedical Informatics
Publication statusPublished - Feb 1 2017


  • Careflow mining
  • Electronic phenotyping
  • Heterogeneous data sets
  • Temporal data mining

ASJC Scopus subject areas

  • Computer Science Applications
  • Health Informatics


Dive into the research topics of 'Temporal electronic phenotyping by mining careflows of breast cancer patients'. Together they form a unique fingerprint.

Cite this