DeepEBV: a deep learning model to predict Epstein–Barr virus (EBV) integration sites

Abstract Motivation Epstein–Barr virus (EBV) is one of the most prevalent DNA oncogenic viruses. The integration of EBV into the host genome has been reported to play an important role in cancer development. The preference of EBV integration showed strong dependence on the local genomic environment,...

Full description

Saved in:
Bibliographic Details
Published inBioinformatics (Oxford, England) Vol. 37; no. 20; pp. 3405 - 3411
Main Authors Liang, Jiuxing, Cui, Zifeng, Wu, Canbiao, Yu, Yao, Tian, Rui, Xie, Hongxian, Jin, Zhuang, Fan, Weiwen, Xie, Weiling, Huang, Zhaoyue, Xu, Wei, Zhu, Jingjing, You, Zeshan, Guo, Xiaofang, Qiu, Xiaofan, Ye, Jiahao, Lang, Bin, Li, Mengyuan, Tan, Songwei, Hu, Zheng
Format Journal Article
LanguageEnglish
Published England Oxford University Press 25.10.2021
Online AccessGet full text

Cover

Loading…
More Information
Summary:Abstract Motivation Epstein–Barr virus (EBV) is one of the most prevalent DNA oncogenic viruses. The integration of EBV into the host genome has been reported to play an important role in cancer development. The preference of EBV integration showed strong dependence on the local genomic environment, which enables the prediction of EBV integration sites. Results An attention-based deep learning model, DeepEBV, was developed to predict EBV integration sites by learning local genomic features automatically. First, DeepEBV was trained and tested using the data from the dsVIS database. The results showed that DeepEBV with EBV integration sequences plus Repeat peaks and 2-fold data augmentation performed the best on the training dataset. Furthermore, the performance of the model was validated in an independent dataset. In addition, the motifs of DNA-binding proteins could influence the selection preference of viral insertional mutagenesis. Furthermore, the results showed that DeepEBV can predict EBV integration hotspot genes accurately. In summary, DeepEBV is a robust, accurate and explainable deep learning model, providing novel insights into EBV integration preferences and mechanisms. Availabilityand implementation DeepEBV is available as open-source software and can be downloaded from https://github.com/JiuxingLiang/DeepEBV.git. Supplementary information Supplementary data are available at Bioinformatics online.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:1367-4803
1367-4811
1367-4811
DOI:10.1093/bioinformatics/btab388