Information extraction from microarray data: A survey of data mining techniques

Alessandro Fiori, Alberto Grand, Giulia Bruno, Francesco Gavino Brundu, Domenico Schioppa, Andrea Bertotti

Research output: Contribution to journalArticle

Abstract

Nowadays, a huge amount of high throughput molecular data are available for analysis and provide novel and useful insights into complex biological systems, through the acquisition of a high-resolution picture of their molecular status in defined experimental conditions. In this context, microarrays are a powerful tool to analyze thousands of gene expression values with a single experiment. A number of approaches have been developed to detecting genes highly correlated to diseases, selecting genes that exhibit a similar behavior under specific conditions, building models to predict disease outcome based on genetic profiles, and inferring regulatory networks. This paper discusses popular and recent data mining techniques (i.e., Feature Selection, Clustering, Classification, and Association Rule Mining) applied to microarray data. The main characteristics of microarray data and preprocessing procedures are presented to understand the critical issues introduced by gene expression values analysis. Each technique is analyzed, and relevant examples of pertinent literature are reported. Moreover, real use cases exploiting analytic pipelines that use these methods are also introduced. Finally, future directions of data mining research on microarray data are envisioned.

Original languageEnglish
Pages (from-to)29-58
Number of pages30
JournalJournal of Database Management
Volume25
Issue number1
DOIs
Publication statusPublished - Jan 1 2014

Keywords

  • Association rules
  • Classification
  • Clustering
  • Data analysis
  • Data mining
  • Data normalization
  • Feature Selection
  • Microarray

ASJC Scopus subject areas

  • Information Systems
  • Hardware and Architecture
  • Software

Fingerprint Dive into the research topics of 'Information extraction from microarray data: A survey of data mining techniques'. Together they form a unique fingerprint.

  • Cite this