Discovery of 342 putative new genes from the analysis of 5′-end-sequenced full-length-enriched cDNA human transcripts

E. Dalla, F. Mignone, R. Verardo, L. Marchionni, S. Marzinotto, D. Lazarević, J. F. Reid, R. Marzio, E. Klarić, D. Licastro, G. Marcuzzi, R. Gambetta, M. A. Pierotti, G. Pesole, C. Schneider

Research output: Contribution to journalArticle

Abstract

In this work we describe the process that, starting with the production of human full-length-enriched cDNA libraries using the CAP-Trapper method, led us to the discovery of 342 putative new human genes. Twenty-three thousand full-length-enriched clones, obtained from various cell lines and tissues in different developmental stages, were 5′-end sequenced, allowing the identification of a pool of 5300 unique cDNAs. By comparing these sequences to various human and vertebrate nucleotide databases we found that about 40% of our clones extended previously annotated 5′ ends, 662 clones were likely to represent splice variants of known genes, and finally 342 clones remained unknown, with no or poor functional annotation. cDNA-microarray gene expression analysis showed that 260 of 342 unknown clones are expressed in at least one cell line and/or tissue. Further analysis of their sequences and the corresponding genomic locations allowed us to conclude that most of them represent potential novel genes, with only a small fraction having protein-coding potential.

Original languageEnglish
Pages (from-to)739-751
Number of pages13
JournalGenomics
Volume85
Issue number6
DOIs
Publication statusPublished - Jun 2005

Keywords

  • cDNA microarrays
  • Full-length cDNA
  • Gene expression
  • Human transcriptome

ASJC Scopus subject areas

  • Genetics

Fingerprint Dive into the research topics of 'Discovery of 342 putative new genes from the analysis of 5′-end-sequenced full-length-enriched cDNA human transcripts'. Together they form a unique fingerprint.

  • Cite this

    Dalla, E., Mignone, F., Verardo, R., Marchionni, L., Marzinotto, S., Lazarević, D., Reid, J. F., Marzio, R., Klarić, E., Licastro, D., Marcuzzi, G., Gambetta, R., Pierotti, M. A., Pesole, G., & Schneider, C. (2005). Discovery of 342 putative new genes from the analysis of 5′-end-sequenced full-length-enriched cDNA human transcripts. Genomics, 85(6), 739-751. https://doi.org/10.1016/j.ygeno.2005.02.009