The use of Quantitative Structure-Activity Relationships in assessing the potential negative effects of chemicals plays an important role in ecotoxicology. (LC50)96h in Pimephales promelas (Duluth database) is widely modeled as an aquatic toxicity end-point. The object of this study was to compare different molecular descriptors in the development of new statistically validated QSAR models to predict the aquatic toxicity of chemicals classified according to their MOA and in a unique general model. The applied multiple linear regression approach (ordinary least squares) is based on theoretical molecular descriptor variety (ID, 2D, and 3D, from DRAGON package, and some calculated logP). The best combination of modeling descriptors was selected by the Genetic Algorithm-Variable Subset Selection procedure. The robustness and the predictive performance of the proposed models was verified using both internal (cross-validation by LOO, bootstrap, Y-scrambling) and external statistical validations (by splitting the original data set into training and validation sets by Kohonen-artificial neural networks (K-ANN)). The model applicability domain (AD) was checked by the leverage approach to verify prediction reliability.

Statistically validated QSARs, based on theoretical descriptors, for modeling aquatic toxicity of organic chemicals in Pimephales promelas (fathead minnow)

PAPA, ESTER;GRAMATICA, PAOLA
2005-01-01

Abstract

The use of Quantitative Structure-Activity Relationships in assessing the potential negative effects of chemicals plays an important role in ecotoxicology. (LC50)96h in Pimephales promelas (Duluth database) is widely modeled as an aquatic toxicity end-point. The object of this study was to compare different molecular descriptors in the development of new statistically validated QSAR models to predict the aquatic toxicity of chemicals classified according to their MOA and in a unique general model. The applied multiple linear regression approach (ordinary least squares) is based on theoretical molecular descriptor variety (ID, 2D, and 3D, from DRAGON package, and some calculated logP). The best combination of modeling descriptors was selected by the Genetic Algorithm-Variable Subset Selection procedure. The robustness and the predictive performance of the proposed models was verified using both internal (cross-validation by LOO, bootstrap, Y-scrambling) and external statistical validations (by splitting the original data set into training and validation sets by Kohonen-artificial neural networks (K-ANN)). The model applicability domain (AD) was checked by the leverage approach to verify prediction reliability.
2005
Papa, Ester; Villa, F.; Gramatica, Paola
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11383/1495579
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? 6
  • Scopus 157
  • ???jsp.display-item.citation.isi??? 144
social impact