QSAR model for the prediction of bio-concentration factor using aqueous solubility and descriptors considering various electronic effects

G. Piir, S. Sild, A. Roncaglioni, E. Benfenati, U. Maran

Research output: Contribution to journalArticle

11 Citations (Scopus)

Abstract

The in silico modelling of bio-concentration factor (BCF) is of considerable interest in environmental sciences, because it is an accepted indicator for the accumulation potential of chemicals in organisms. Numerous QSAR models have been developed for the BCF, and the majority utilize the octanol/water partition coefficient (logP) to account for the penetration characteristics of the chemicals. The present work used descriptors from a variety of software packages for the development of a multi-linear regression model to estimate BCF. The modelled data set of 473 diverse compounds covers a wide range of log BCF values. In the proposed QSAR model, most of the variation is described by the calculated solubility in water. Other contributing descriptors describe, for instance, hydrophobic surface area, hydrogen bonding and other electronic effects. The model was validated internally by using a variety of statistical approaches. Two external validations were also performed. For the former validation, a subset from the same data source was used. The 2nd external validation was based on an independent data set collected from different resources. All validations showed the consistency of the model. The applicability domain of the model was discussed and described and a thorough outlier analysis was performed.

Original languageEnglish
Pages (from-to)711-729
Number of pages19
JournalSAR and QSAR in Environmental Research
Volume21
Issue number7
DOIs
Publication statusPublished - Oct 2010

Fingerprint

Quantitative Structure-Activity Relationship
Solubility
Linear Models
Octanols
Water
Information Storage and Retrieval
Hydrogen Bonding
Ecology
Computer Simulation
Software
Linear regression
Software packages
Datasets
Hydrogen bonds

Keywords

  • Applicability domain
  • BCF
  • Bioconcentration
  • QSAR
  • Solubility

ASJC Scopus subject areas

  • Molecular Medicine
  • Bioengineering
  • Drug Discovery

Cite this

@article{a0c69c1e2e2247ab97aac99de7bf1eba,
title = "QSAR model for the prediction of bio-concentration factor using aqueous solubility and descriptors considering various electronic effects",
abstract = "The in silico modelling of bio-concentration factor (BCF) is of considerable interest in environmental sciences, because it is an accepted indicator for the accumulation potential of chemicals in organisms. Numerous QSAR models have been developed for the BCF, and the majority utilize the octanol/water partition coefficient (logP) to account for the penetration characteristics of the chemicals. The present work used descriptors from a variety of software packages for the development of a multi-linear regression model to estimate BCF. The modelled data set of 473 diverse compounds covers a wide range of log BCF values. In the proposed QSAR model, most of the variation is described by the calculated solubility in water. Other contributing descriptors describe, for instance, hydrophobic surface area, hydrogen bonding and other electronic effects. The model was validated internally by using a variety of statistical approaches. Two external validations were also performed. For the former validation, a subset from the same data source was used. The 2nd external validation was based on an independent data set collected from different resources. All validations showed the consistency of the model. The applicability domain of the model was discussed and described and a thorough outlier analysis was performed.",
keywords = "Applicability domain, BCF, Bioconcentration, QSAR, Solubility",
author = "G. Piir and S. Sild and A. Roncaglioni and E. Benfenati and U. Maran",
year = "2010",
month = "10",
doi = "10.1080/1062936X.2010.528596",
language = "English",
volume = "21",
pages = "711--729",
journal = "SAR and QSAR in Environmental Research",
issn = "1062-936X",
publisher = "Taylor and Francis Ltd.",
number = "7",

}

TY - JOUR

T1 - QSAR model for the prediction of bio-concentration factor using aqueous solubility and descriptors considering various electronic effects

AU - Piir, G.

AU - Sild, S.

AU - Roncaglioni, A.

AU - Benfenati, E.

AU - Maran, U.

PY - 2010/10

Y1 - 2010/10

N2 - The in silico modelling of bio-concentration factor (BCF) is of considerable interest in environmental sciences, because it is an accepted indicator for the accumulation potential of chemicals in organisms. Numerous QSAR models have been developed for the BCF, and the majority utilize the octanol/water partition coefficient (logP) to account for the penetration characteristics of the chemicals. The present work used descriptors from a variety of software packages for the development of a multi-linear regression model to estimate BCF. The modelled data set of 473 diverse compounds covers a wide range of log BCF values. In the proposed QSAR model, most of the variation is described by the calculated solubility in water. Other contributing descriptors describe, for instance, hydrophobic surface area, hydrogen bonding and other electronic effects. The model was validated internally by using a variety of statistical approaches. Two external validations were also performed. For the former validation, a subset from the same data source was used. The 2nd external validation was based on an independent data set collected from different resources. All validations showed the consistency of the model. The applicability domain of the model was discussed and described and a thorough outlier analysis was performed.

AB - The in silico modelling of bio-concentration factor (BCF) is of considerable interest in environmental sciences, because it is an accepted indicator for the accumulation potential of chemicals in organisms. Numerous QSAR models have been developed for the BCF, and the majority utilize the octanol/water partition coefficient (logP) to account for the penetration characteristics of the chemicals. The present work used descriptors from a variety of software packages for the development of a multi-linear regression model to estimate BCF. The modelled data set of 473 diverse compounds covers a wide range of log BCF values. In the proposed QSAR model, most of the variation is described by the calculated solubility in water. Other contributing descriptors describe, for instance, hydrophobic surface area, hydrogen bonding and other electronic effects. The model was validated internally by using a variety of statistical approaches. Two external validations were also performed. For the former validation, a subset from the same data source was used. The 2nd external validation was based on an independent data set collected from different resources. All validations showed the consistency of the model. The applicability domain of the model was discussed and described and a thorough outlier analysis was performed.

KW - Applicability domain

KW - BCF

KW - Bioconcentration

KW - QSAR

KW - Solubility

UR - http://www.scopus.com/inward/record.url?scp=78649771398&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=78649771398&partnerID=8YFLogxK

U2 - 10.1080/1062936X.2010.528596

DO - 10.1080/1062936X.2010.528596

M3 - Article

C2 - 21120758

AN - SCOPUS:78649771398

VL - 21

SP - 711

EP - 729

JO - SAR and QSAR in Environmental Research

JF - SAR and QSAR in Environmental Research

SN - 1062-936X

IS - 7

ER -