Large-scale open bioinformatics data resources

Elia Stupka

Research output: Contribution to journalArticlepeer-review


The data explosion in bioinformatics is relentless. More and more genomes are being sequenced and many new types of datasets are being generated in large-scale, projects. Integration and true open access to the data are still difficult issues, although they are gradually being addressed. Notably, certain fields have good standardization and interoperability, while others lag behind. This review summarizes the latest developments in genome and sequences databases, transcriptomics data (ESTs, ORESTES, MI-length cDNAs), proteomics data (protein databases, protein structures, family and domain classification) as well as loosely integrated fields, such as microarray experiments, mutation databases and databases of regulatory regions and elements. The review attempts to resist simply summarizing what data are available, and aims to provide a critical look at some of the integration and access issues associated with several of these resouces.

Original languageEnglish
Pages (from-to)265-274
Number of pages10
JournalCurrent Opinion in Molecular Therapeutics
Issue number3
Publication statusPublished - Jun 2002


  • Bioinformatics
  • Database
  • Genome
  • Integration
  • Microarray
  • Open source
  • Proteome
  • Transcriptome

ASJC Scopus subject areas

  • Genetics
  • Genetics(clinical)
  • Pharmacology, Toxicology and Pharmaceutics(all)


Dive into the research topics of 'Large-scale open bioinformatics data resources'. Together they form a unique fingerprint.

Cite this