QSAR Modeling is not “Push a Button and Find a Correlation”: A Case Study of Toxicity of (Benzo‐)triazoles on Algae

Paola Gramatica(University of Insubria), Stefano Cassani(University of Insubria), Partha Pratim Roy(University of Insubria), Simona Kovarich(University of Insubria), Chun Wei Yap(National University of Singapore), Ester Papa(University of Insubria)
Molecular Informatics
November 19, 2012
Cited by 254Open Access
Full Text

Abstract

A case study of toxicity of (benzo)triazoles ((B)TAZs) to the algae Pseudokirchneriella subcapitata is used to discuss some problems and solutions in QSAR modeling, particularly in the environmental context. The relevance of data curation (not only of experimental data, but also of chemical structures and input formats for the calculation of molecular descriptors), the crucial points of QSAR model validation and the potential application for new chemicals (internal robustness, exclusion of chance correlation, external predictivity, applicability domain) are described, while developing MLR-OLS models based on molecular descriptors, calculated by various QSAR software tools (commercial DRAGON, free PaDEL-Descriptor and QSPR-THESAURUS). Additionally, the utility of consensus models is highlighted. This work summarizes a methodology for a rigorous statistical approach to obtain reliable QSAR predictions, also for a large number of (B)TAZs in the ECHA preregistration list of REACH (even if starting from limited experimental data availability), and has evidenced some ambiguities and discrepancies related to SMILES notations from different databases; furthermore it highlighted some general problems related to QSAR model generation and was useful in the implementation of the PaDEL-Descriptor software.


Related Papers

No related papers found

Powered by citation graph analysis