The Hidden Component of Size in Two-Dimensional Fragment Descriptors:  Side Effects on Sampling in Bioactive Libraries

We have carried out a number of sampling experiments in libraries of bioactive compounds to illustrate how size biases introduced by two-dimensional (2D) fragment distance functions may provide misleading information about the diversity of compound subsets. The number of different biological targets...

Full description

Saved in:
Bibliographic Details
Published inJournal of medicinal chemistry Vol. 42; no. 15; pp. 2887 - 2900
Main Authors Dixon, Steven L, Koehler, Ryan T
Format Journal Article
LanguageEnglish
Published Washington, DC American Chemical Society 29.07.1999
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:We have carried out a number of sampling experiments in libraries of bioactive compounds to illustrate how size biases introduced by two-dimensional (2D) fragment distance functions may provide misleading information about the diversity of compound subsets. The number of different biological targets covered by a given subset is used as a measure of bioactive diversity, and it is considered to be the relevant property with which 2D diversity should correlate. Since the nature of the size biases depends on the way in which 2D distance is computed, we investigated three different methods of calculating distance. Use of 1-Tanimoto as a dissimilarity measure leads to the spurious conclusion that collections of structurally small compounds are inherently more diverse than other collections which may cover a broader range of sizes and more biological targets. XOR or squared Euclidean distance, by contrast, shows a preference for subsets of structurally larger compounds, but this does not appear to have as many adverse consequences in terms of target coverage. A simple product of 1-Tanimoto and XOR tends to equalize the opposing size effects of the two component distance functions and leads to a relatively unbiased means of comparing structures. Results here suggest that careful consideration should be given to the way in which chemical structures are compared whenever 2D fragment descriptors are used.
Bibliography:istex:FCEC75835297989845525F569A49CAEF81BBA765
ark:/67375/TPS-X6N747G3-5
ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:0022-2623
1520-4804
DOI:10.1021/jm980708c