A methodology to identify consensus classes from clustering algorithms applied to immunohistochemical data from breast cancer patients

Daniele Soria, Jonathan M. Garibaldi, Federico Ambrogi, Andrew R. Green, Des Powe, Emad Rakha, R. Douglas Macmillan, Roger W. Blamey, Graham Ball, Paulo J G Lisboa, Terence A. Etchells, Patrizia Boracchi, Elia Biganzoli, Ian O. Ellis

Research output: Contribution to journalArticle

Abstract

Single clustering methods have often been used to elucidate clusters in high dimensional medical data, even though reliance on a single algorithm is known to be problematic. In this paper, we present a methodology to determine a set of 'core classes' by using a range of techniques to reach consensus across several different clustering algorithms, and to ascertain the key characteristics of these classes. We apply the methodology to immunohistochemical data from breast cancer patients. In doing so, we identify six core classes, of which several may be novel sub-groups not previously emphasised in literature.

Original languageEnglish
Pages (from-to)318-330
Number of pages13
JournalComputers in Biology and Medicine
Volume40
Issue number3
DOIs
Publication statusPublished - Mar 2010

Fingerprint

Clustering algorithms
Cluster Analysis
Breast Neoplasms

Keywords

  • Breast cancer
  • Clustering methods
  • Consensus clustering
  • Molecular classification
  • Validity indices

ASJC Scopus subject areas

  • Computer Science Applications
  • Health Informatics

Cite this

A methodology to identify consensus classes from clustering algorithms applied to immunohistochemical data from breast cancer patients. / Soria, Daniele; Garibaldi, Jonathan M.; Ambrogi, Federico; Green, Andrew R.; Powe, Des; Rakha, Emad; Douglas Macmillan, R.; Blamey, Roger W.; Ball, Graham; Lisboa, Paulo J G; Etchells, Terence A.; Boracchi, Patrizia; Biganzoli, Elia; Ellis, Ian O.

In: Computers in Biology and Medicine, Vol. 40, No. 3, 03.2010, p. 318-330.

Research output: Contribution to journalArticle

Soria, D, Garibaldi, JM, Ambrogi, F, Green, AR, Powe, D, Rakha, E, Douglas Macmillan, R, Blamey, RW, Ball, G, Lisboa, PJG, Etchells, TA, Boracchi, P, Biganzoli, E & Ellis, IO 2010, 'A methodology to identify consensus classes from clustering algorithms applied to immunohistochemical data from breast cancer patients', Computers in Biology and Medicine, vol. 40, no. 3, pp. 318-330. https://doi.org/10.1016/j.compbiomed.2010.01.003
Soria, Daniele ; Garibaldi, Jonathan M. ; Ambrogi, Federico ; Green, Andrew R. ; Powe, Des ; Rakha, Emad ; Douglas Macmillan, R. ; Blamey, Roger W. ; Ball, Graham ; Lisboa, Paulo J G ; Etchells, Terence A. ; Boracchi, Patrizia ; Biganzoli, Elia ; Ellis, Ian O. / A methodology to identify consensus classes from clustering algorithms applied to immunohistochemical data from breast cancer patients. In: Computers in Biology and Medicine. 2010 ; Vol. 40, No. 3. pp. 318-330.
@article{441902975e10426c85425ef888d67f3d,
title = "A methodology to identify consensus classes from clustering algorithms applied to immunohistochemical data from breast cancer patients",
abstract = "Single clustering methods have often been used to elucidate clusters in high dimensional medical data, even though reliance on a single algorithm is known to be problematic. In this paper, we present a methodology to determine a set of 'core classes' by using a range of techniques to reach consensus across several different clustering algorithms, and to ascertain the key characteristics of these classes. We apply the methodology to immunohistochemical data from breast cancer patients. In doing so, we identify six core classes, of which several may be novel sub-groups not previously emphasised in literature.",
keywords = "Breast cancer, Clustering methods, Consensus clustering, Molecular classification, Validity indices",
author = "Daniele Soria and Garibaldi, {Jonathan M.} and Federico Ambrogi and Green, {Andrew R.} and Des Powe and Emad Rakha and {Douglas Macmillan}, R. and Blamey, {Roger W.} and Graham Ball and Lisboa, {Paulo J G} and Etchells, {Terence A.} and Patrizia Boracchi and Elia Biganzoli and Ellis, {Ian O.}",
year = "2010",
month = "3",
doi = "10.1016/j.compbiomed.2010.01.003",
language = "English",
volume = "40",
pages = "318--330",
journal = "Computers in Biology and Medicine",
issn = "0010-4825",
publisher = "Elsevier Limited",
number = "3",

}

TY - JOUR

T1 - A methodology to identify consensus classes from clustering algorithms applied to immunohistochemical data from breast cancer patients

AU - Soria, Daniele

AU - Garibaldi, Jonathan M.

AU - Ambrogi, Federico

AU - Green, Andrew R.

AU - Powe, Des

AU - Rakha, Emad

AU - Douglas Macmillan, R.

AU - Blamey, Roger W.

AU - Ball, Graham

AU - Lisboa, Paulo J G

AU - Etchells, Terence A.

AU - Boracchi, Patrizia

AU - Biganzoli, Elia

AU - Ellis, Ian O.

PY - 2010/3

Y1 - 2010/3

N2 - Single clustering methods have often been used to elucidate clusters in high dimensional medical data, even though reliance on a single algorithm is known to be problematic. In this paper, we present a methodology to determine a set of 'core classes' by using a range of techniques to reach consensus across several different clustering algorithms, and to ascertain the key characteristics of these classes. We apply the methodology to immunohistochemical data from breast cancer patients. In doing so, we identify six core classes, of which several may be novel sub-groups not previously emphasised in literature.

AB - Single clustering methods have often been used to elucidate clusters in high dimensional medical data, even though reliance on a single algorithm is known to be problematic. In this paper, we present a methodology to determine a set of 'core classes' by using a range of techniques to reach consensus across several different clustering algorithms, and to ascertain the key characteristics of these classes. We apply the methodology to immunohistochemical data from breast cancer patients. In doing so, we identify six core classes, of which several may be novel sub-groups not previously emphasised in literature.

KW - Breast cancer

KW - Clustering methods

KW - Consensus clustering

KW - Molecular classification

KW - Validity indices

UR - http://www.scopus.com/inward/record.url?scp=77549085806&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=77549085806&partnerID=8YFLogxK

U2 - 10.1016/j.compbiomed.2010.01.003

DO - 10.1016/j.compbiomed.2010.01.003

M3 - Article

C2 - 20106472

AN - SCOPUS:77549085806

VL - 40

SP - 318

EP - 330

JO - Computers in Biology and Medicine

JF - Computers in Biology and Medicine

SN - 0010-4825

IS - 3

ER -