Fragment Prioritization on a Large Mutagenicity Dataset

Matteo Floris, Giuseppa Raitano, Ricardo Medda, Emilio Benfenati

Research output: Contribution to journalArticlepeer-review


The identification of structural alerts is one of the simplest tools used for the identification of potentially toxic chemical compounds. Structural alerts have served as an aid to quickly identify chemicals that should be either prioritized for testing or for elimination from further consideration and use. In the recent years, the availability of larger datasets, often growing in the context of collaborative efforts and competitions, created the raw material needed to identify new and more accurate structural alerts. This work applied a method to efficiently mine large toxicological dataset for structural alert showing a strong statistical association with mutagenicity. In details, we processed a large Ames mutagenicity dataset comprising 14,015 unique molecules obtained by joining different data sources. After correction for multiple testing, we were able to assign a probability value to each fragment. A total of 51 rules were identified, with p-value < 0.05. Using the same method, we also confirmed the statistical significance of several mutagenicity rules already present and largely recognized in the literature. In addition, we have extended the application of our method by predicting the mutagenicity of an external data set.

Original languageEnglish
JournalMolecular Informatics
Publication statusAccepted/In press - 2016


  • Computational toxicology
  • QSAR
  • Read Across

ASJC Scopus subject areas

  • Structural Biology
  • Molecular Medicine
  • Drug Discovery
  • Computer Science Applications
  • Organic Chemistry


Dive into the research topics of 'Fragment Prioritization on a Large Mutagenicity Dataset'. Together they form a unique fingerprint.

Cite this