Are the Chemical Structures in Your QSAR Correct?

Quantitative structure–activity relationships (QSARs) are used to predict many different endpoints, utilize hundreds, and even thousands of different parameters (or descriptors), and are created using a variety of approaches. The one thing they all have in common is the assumption that the chemical...

Full description

Saved in:
Bibliographic Details
Published inQSAR & combinatorial science Vol. 27; no. 11-12; pp. 1337 - 1345
Main Authors Young, Douglas, Martin, Todd, Venkatapathy, Raghuraman, Harten, Paul
Format Journal Article
LanguageEnglish
Published Weinheim WILEY-VCH Verlag 01.12.2008
WILEY‐VCH Verlag
Subjects
Online AccessGet full text
ISSN1611-020X
1611-0218
DOI10.1002/qsar.200810084

Cover

More Information
Summary:Quantitative structure–activity relationships (QSARs) are used to predict many different endpoints, utilize hundreds, and even thousands of different parameters (or descriptors), and are created using a variety of approaches. The one thing they all have in common is the assumption that the chemical structures used are correct. This research investigates this assumption by examining six public and private databases that contain structural information for chemicals. Molecular fingerprinting techniques are used to determine the error rates for structures in each of the databases. It was observed that the databases had error rates ranging from 0.1 to 3.4%. A case study to predict the n‐octanol/water partition coefficient was also investigated to highlight the effects of these errors in the predictions of QSARs. In this case study, QSARs were developed using both (i) all correct structures and (ii) structures from a database with an error rate of 3.4%. This case study showed how slight errors in chemical structures, such as misplacing a Cl atom or swapping hydroxy and methoxy functional groups on a multiple ring structure, can result in significant differences in the accuracy of the prediction for those chemicals.
Bibliography:istex:99DD63E9F3682C9EC843C2B24F74CB83AB019048
ark:/67375/WNG-X2P20172-4
ArticleID:QSAR200810084
ISSN:1611-020X
1611-0218
DOI:10.1002/qsar.200810084