Are the Chemical Structures in Your QSAR Correct?
Quantitative structure–activity relationships (QSARs) are used to predict many different endpoints, utilize hundreds, and even thousands of different parameters (or descriptors), and are created using a variety of approaches. The one thing they all have in common is the assumption that the chemical...
Saved in:
Published in | QSAR & combinatorial science Vol. 27; no. 11-12; pp. 1337 - 1345 |
---|---|
Main Authors | , , , |
Format | Journal Article |
Language | English |
Published |
Weinheim
WILEY-VCH Verlag
01.12.2008
WILEY‐VCH Verlag |
Subjects | |
Online Access | Get full text |
ISSN | 1611-020X 1611-0218 |
DOI | 10.1002/qsar.200810084 |
Cover
Summary: | Quantitative structure–activity relationships (QSARs) are used to predict many different endpoints, utilize hundreds, and even thousands of different parameters (or descriptors), and are created using a variety of approaches. The one thing they all have in common is the assumption that the chemical structures used are correct. This research investigates this assumption by examining six public and private databases that contain structural information for chemicals. Molecular fingerprinting techniques are used to determine the error rates for structures in each of the databases. It was observed that the databases had error rates ranging from 0.1 to 3.4%. A case study to predict the n‐octanol/water partition coefficient was also investigated to highlight the effects of these errors in the predictions of QSARs. In this case study, QSARs were developed using both (i) all correct structures and (ii) structures from a database with an error rate of 3.4%. This case study showed how slight errors in chemical structures, such as misplacing a Cl atom or swapping hydroxy and methoxy functional groups on a multiple ring structure, can result in significant differences in the accuracy of the prediction for those chemicals. |
---|---|
Bibliography: | istex:99DD63E9F3682C9EC843C2B24F74CB83AB019048 ark:/67375/WNG-X2P20172-4 ArticleID:QSAR200810084 |
ISSN: | 1611-020X 1611-0218 |
DOI: | 10.1002/qsar.200810084 |