FusionDTA: attention-based feature polymerizer and knowledge distillation for drug-target binding affinity prediction

Abstract The prediction of drug-target affinity (DTA) plays an increasingly important role in drug discovery. Nowadays, lots of prediction methods focus on feature encoding of drugs and proteins, but ignore the importance of feature aggregation. However, the increasingly complex encoder networks lea...

Full description

Saved in:

Bibliographic Details
Published in	Briefings in bioinformatics Vol. 23; no. 1
Main Authors	Yuan, Weining, Chen, Guanxing, Chen, Calvin Yu-Chian
Format	Journal Article
Language	English
Published	England Oxford University Press 17.01.2022 Oxford Publishing Limited (England)
Subjects	Affinity Binding Coders Distillation Drug Development Drug Discovery Humans Knowledge Mathematical models Parameters Predictions Proteins Proteins - chemistry Redundancy Therapeutic targets model compression muti-head linear attention knowledge distillation feature polymerizer drug–target affinity
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Abstract The prediction of drug-target affinity (DTA) plays an increasingly important role in drug discovery. Nowadays, lots of prediction methods focus on feature encoding of drugs and proteins, but ignore the importance of feature aggregation. However, the increasingly complex encoder networks lead to the loss of implicit information and excessive model size. To this end, we propose a deep-learning-based approach namely FusionDTA. For the loss of implicit information, a novel muti-head linear attention mechanism was utilized to replace the rough pooling method. This allows FusionDTA aggregates global information based on attention weights, instead of selecting the largest one as max-pooling does. To solve the redundancy issue of parameters, we applied knowledge distillation in FusionDTA by transfering learnable information from teacher model to student. Results show that FusionDTA performs better than existing models for the test domain on all evaluation metrics. We obtained concordance index (CI) index of 0.913 and 0.906 in Davis and KIBA dataset respectively, compared with 0.893 and 0.891 of previous state-of-art model. Under the cold-start constrain, our model proved to be more robust and more effective with unseen inputs than baseline methods. In addition, the knowledge distillation did save half of the parameters of the model, with only 0.006 reduction in CI index. Even FusionDTA with half the parameters could easily exceed the baseline on all metrics. In general, our model has superior performance and improves the effect of drug–target interaction (DTI) prediction. The visualization of DTI can effectively help predict the binding region of proteins during structure-based drug design.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 content type line 23
ISSN:	1467-5463 1477-4054 1477-4054
DOI:	10.1093/bib/bbab506