Genewebex

Gene annotation Web extraction, aggregation, and updating from Web-interfaced biomolecular databanks

Marco Masseroli, Andrea Stella, Myriam Alcalay, Francesco Pinciroli

Research output: Contribution to journalArticle

Abstract

Numerous genomic annotations are currently stored in different Web-accessible databanks that scientists need to mine with user-defined queries and in a batch mode to orderly integrate the diverse extracted data in suitable user-customizable working environments. Unfortunately, to date, most accessible databanks can be interrogated only for a single gene or protein at a time and generally the data retrieved are available in HTML page format only. We developed GeneWebEx to effectively mine data of interest in different HTML pages of Web-interfaced databanks, and organize extracted data for further analyses. GeneWebEx utilizes user-defined templates to identify data to extract, and aggregates and structures them in a database designed to allocate the various extractions from distinct biomolecular databanks. Moreover, a template-based module enables automatic updating of extracted data. Validations performed on GeneWebEx allowed us to efficiently gather relevant annotations from various sources, and comprehensively query them to highlight significant biological characteristics.

Original languageEnglish
Pages (from-to)511-526
Number of pages16
JournalInternational Journal of Software Engineering and Knowledge Engineering
Volume15
Issue number3
DOIs
Publication statusPublished - Jun 2005

Fingerprint

HTML
Agglomeration
Genes
World Wide Web
Proteins

Keywords

  • Biomolecular database
  • Data extraction
  • Genomic information
  • Microarray data interpretation
  • Web wrapper

ASJC Scopus subject areas

  • Electrical and Electronic Engineering
  • Artificial Intelligence
  • Computer Graphics and Computer-Aided Design
  • Software

Cite this

Genewebex : Gene annotation Web extraction, aggregation, and updating from Web-interfaced biomolecular databanks. / Masseroli, Marco; Stella, Andrea; Alcalay, Myriam; Pinciroli, Francesco.

In: International Journal of Software Engineering and Knowledge Engineering, Vol. 15, No. 3, 06.2005, p. 511-526.

Research output: Contribution to journalArticle

@article{33b9cbc4cbb848cab2111bf707b64b89,
title = "Genewebex: Gene annotation Web extraction, aggregation, and updating from Web-interfaced biomolecular databanks",
abstract = "Numerous genomic annotations are currently stored in different Web-accessible databanks that scientists need to mine with user-defined queries and in a batch mode to orderly integrate the diverse extracted data in suitable user-customizable working environments. Unfortunately, to date, most accessible databanks can be interrogated only for a single gene or protein at a time and generally the data retrieved are available in HTML page format only. We developed GeneWebEx to effectively mine data of interest in different HTML pages of Web-interfaced databanks, and organize extracted data for further analyses. GeneWebEx utilizes user-defined templates to identify data to extract, and aggregates and structures them in a database designed to allocate the various extractions from distinct biomolecular databanks. Moreover, a template-based module enables automatic updating of extracted data. Validations performed on GeneWebEx allowed us to efficiently gather relevant annotations from various sources, and comprehensively query them to highlight significant biological characteristics.",
keywords = "Biomolecular database, Data extraction, Genomic information, Microarray data interpretation, Web wrapper",
author = "Marco Masseroli and Andrea Stella and Myriam Alcalay and Francesco Pinciroli",
year = "2005",
month = "6",
doi = "10.1142/S0218194005002403",
language = "English",
volume = "15",
pages = "511--526",
journal = "International Journal of Software Engineering and Knowledge Engineering",
issn = "0218-1940",
publisher = "World Scientific Publishing Co. Pte Ltd",
number = "3",

}

TY - JOUR

T1 - Genewebex

T2 - Gene annotation Web extraction, aggregation, and updating from Web-interfaced biomolecular databanks

AU - Masseroli, Marco

AU - Stella, Andrea

AU - Alcalay, Myriam

AU - Pinciroli, Francesco

PY - 2005/6

Y1 - 2005/6

N2 - Numerous genomic annotations are currently stored in different Web-accessible databanks that scientists need to mine with user-defined queries and in a batch mode to orderly integrate the diverse extracted data in suitable user-customizable working environments. Unfortunately, to date, most accessible databanks can be interrogated only for a single gene or protein at a time and generally the data retrieved are available in HTML page format only. We developed GeneWebEx to effectively mine data of interest in different HTML pages of Web-interfaced databanks, and organize extracted data for further analyses. GeneWebEx utilizes user-defined templates to identify data to extract, and aggregates and structures them in a database designed to allocate the various extractions from distinct biomolecular databanks. Moreover, a template-based module enables automatic updating of extracted data. Validations performed on GeneWebEx allowed us to efficiently gather relevant annotations from various sources, and comprehensively query them to highlight significant biological characteristics.

AB - Numerous genomic annotations are currently stored in different Web-accessible databanks that scientists need to mine with user-defined queries and in a batch mode to orderly integrate the diverse extracted data in suitable user-customizable working environments. Unfortunately, to date, most accessible databanks can be interrogated only for a single gene or protein at a time and generally the data retrieved are available in HTML page format only. We developed GeneWebEx to effectively mine data of interest in different HTML pages of Web-interfaced databanks, and organize extracted data for further analyses. GeneWebEx utilizes user-defined templates to identify data to extract, and aggregates and structures them in a database designed to allocate the various extractions from distinct biomolecular databanks. Moreover, a template-based module enables automatic updating of extracted data. Validations performed on GeneWebEx allowed us to efficiently gather relevant annotations from various sources, and comprehensively query them to highlight significant biological characteristics.

KW - Biomolecular database

KW - Data extraction

KW - Genomic information

KW - Microarray data interpretation

KW - Web wrapper

UR - http://www.scopus.com/inward/record.url?scp=22344433881&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=22344433881&partnerID=8YFLogxK

U2 - 10.1142/S0218194005002403

DO - 10.1142/S0218194005002403

M3 - Article

VL - 15

SP - 511

EP - 526

JO - International Journal of Software Engineering and Knowledge Engineering

JF - International Journal of Software Engineering and Knowledge Engineering

SN - 0218-1940

IS - 3

ER -