Data mining techniques for the identification of genes with expression levels related to breast cancer prognosis

Gabriele Giarratana, Marco Pizzera, Marco Masseroli, Enzo Medico, Pier Luca Lanzi

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Providing clinical predictions for cancer patients by analyzing their genetic make-up is a difficult and very important issue. With the goal of identifying genes more correlated with the prognosis of breast cancer, we used data mining techniques to study the gene expression values of breast cancer patients with known clinical outcome. Focus of our work was the creation of a classification model to be used in the clinical practice to support therapy prescription. We randomly subdivided a gene expression dataset of 311 samples into a training set to learn the model and a test set to validate the model and assess its performance. We evaluated several learning algorithms in their not weighted and weighted form, which we defined to take into account the different clinical importance of false positive and false negative classifications. Based on our results, these last, especially when used in their combined form, appear to provide better results.

Original languageEnglish
Title of host publicationProceedings of the 2009 9th IEEE International Conference on Bioinformatics and BioEngineering, BIBE 2009
Pages295-300
Number of pages6
DOIs
Publication statusPublished - 2009
Event2009 9th IEEE International Conference on Bioinformatics and BioEngineering, BIBE 2009 - Taichung, Taiwan, Province of China
Duration: Jun 22 2009Jun 24 2009

Other

Other2009 9th IEEE International Conference on Bioinformatics and BioEngineering, BIBE 2009
CountryTaiwan, Province of China
CityTaichung
Period6/22/096/24/09

Keywords

  • Breast cancer prognosis
  • Data mining
  • Gene expression

ASJC Scopus subject areas

  • Information Systems
  • Biomedical Engineering
  • Health Informatics

Fingerprint Dive into the research topics of 'Data mining techniques for the identification of genes with expression levels related to breast cancer prognosis'. Together they form a unique fingerprint.

Cite this