Deep learning-based automated high-accuracy location and identification of fresh vertebral compression fractures from spinal radiographs: a multicenter cohort study

Digital radiography (DR) is a common and widely available examination. However, spinal DR cannot detect bone marrow edema, therefore, determining vertebral compression fractures (VCFs), especially fresh VCFs, remains challenging for clinicians. We trained, validated, and externally tested the deep r...

Full description

Saved in:
Bibliographic Details
Published inFrontiers in bioengineering and biotechnology Vol. 12; p. 1397003
Main Authors Zhang, Hao, Xu, Ruixiang, Guo, Xiang, Zhou, Dan, Xu, Tongshuai, Zhong, Xin, Kong, Meng, Zhang, Zhimin, Wang, Yan, Ma, Xuexiao
Format Journal Article
LanguageEnglish
Published Switzerland Frontiers Media S.A 14.05.2024
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Digital radiography (DR) is a common and widely available examination. However, spinal DR cannot detect bone marrow edema, therefore, determining vertebral compression fractures (VCFs), especially fresh VCFs, remains challenging for clinicians. We trained, validated, and externally tested the deep residual network (DRN) model that automated the detection and identification of fresh VCFs from spinal DR images. A total of 1,747 participants from five institutions were enrolled in this study and divided into the training cohort, validation cohort and external test cohorts (YHDH and BMUH cohorts). We evaluated the performance of DRN model based on the area under the receiver operating characteristic curve (AUC), feature attention maps, sensitivity, specificity, and accuracy. We compared it with five other deep learning models and validated and tested the model internally and externally and explored whether it remains highly accurate for an external test cohort. In addition, the influence of old VCFs on the performance of the DRN model was assessed. The AUC was 0.99, 0.89, and 0.88 in the validation, YHDH, and BMUH cohorts, respectively, for the DRN model for detecting and discriminating fresh VCFs. The accuracies were 81.45% and 72.90%, sensitivities were 84.75% and 91.43%, and specificities were 80.25% and 63.89% in the YHDH and BMUH cohorts, respectively. The DRN model generated correct activation on the fresh VCFs and accurate peak responses on the area of the target vertebral body parts and demonstrated better feature representation learning and classification performance. The AUC was 0.90 (95% confidence interval [CI] 0.84-0.95) and 0.84 (95% CI 0.72-0.93) in the non-old VCFs and old VCFs groups, respectively, in the YHDH cohort ( = 0.067). The AUC was 0.89 (95% CI 0.84-0.94) and 0.85 (95% CI 0.72-0.95) in the non-old VCFs and old VCFs groups, respectively, in the BMUH cohort ( = 0.051). In present study, we developed the DRN model for automated diagnosis and identification of fresh VCFs from spinal DR images. The DRN model can provide interpretable attention maps to support the excellent prediction results, which is the key that most clinicians care about when using the model to assist decision-making.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
Edited by: Stephen Ferguson, ETH Zürich, Switzerland
These authors have contributed equally to this work
Renjie Chen, Beijing Jishuitan Hospital, China
Reviewed by: Jukka Hirvasniemi, Erasmus Medical Center, Netherlands
ISSN:2296-4185
2296-4185
DOI:10.3389/fbioe.2024.1397003