Hamiltonian diversity: effectively measuring molecular diversity by shortest Hamiltonian circuits

In recent years, significant advancements have been made in molecular generation algorithms aimed at facilitating drug development, and molecular diversity holds paramount importance within the realm of molecular generation. Nonetheless, the effective quantification of molecular diversity remains an...

Full description

Saved in:
Bibliographic Details
Published inJournal of cheminformatics Vol. 16; no. 1; pp. 94 - 11
Main Authors Hu, Xiuyuan, Liu, Guoqing, Yao, Quanming, Zhao, Yang, Zhang, Hao
Format Journal Article
LanguageEnglish
Published Cham Springer International Publishing 07.08.2024
BioMed Central Ltd
Springer Nature B.V
BMC
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:In recent years, significant advancements have been made in molecular generation algorithms aimed at facilitating drug development, and molecular diversity holds paramount importance within the realm of molecular generation. Nonetheless, the effective quantification of molecular diversity remains an elusive challenge, as extant metrics exemplified by Richness and Internal Diversity fall short in concurrently encapsulating the two main aspects of such diversity: quantity and dissimilarity. To address this quandary, we propose Hamiltonian diversity, a novel molecular diversity metric predicated upon the shortest Hamiltonian circuit. This metric embodies both aspects of molecular diversity in principle, and we implement its calculation with high efficiency and accuracy. Furthermore, through empirical experiments we demonstrate the high consistency of Hamiltonian diversity with real-world chemical diversity, and substantiate its effects in promoting diversity of molecular generation algorithms. Our implementation of Hamiltonian diversity in Python is available at: https://github.com/HXYfighter/HamDiv . Scientific contribution We propose a more rational molecular diversity metric for the community of cheminformatics and drug development. This metric can be applied to evaluation of existing molecular generation methods and enhancing drug design algorithms.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:1758-2946
1758-2946
DOI:10.1186/s13321-024-00883-4