Image segmentation technology based on transformer in medical decision‐making system

Due to the improvement in computing power and the development of computer technology, deep learning has pene‐trated into various fields of the medical industry. Segmenting lesion areas in medical scans can help clinicians make accurate diagnoses. In particular, convolutional neural networks (CNNs) a...

Full description

Saved in:
Bibliographic Details
Published inIET image processing Vol. 17; no. 10; pp. 3040 - 3054
Main Authors He, Keke, Gou, Fangfang, Wu, Jia
Format Journal Article
LanguageEnglish
Published Wiley 01.08.2023
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Due to the improvement in computing power and the development of computer technology, deep learning has pene‐trated into various fields of the medical industry. Segmenting lesion areas in medical scans can help clinicians make accurate diagnoses. In particular, convolutional neural networks (CNNs) are a dominant tool in computer vision tasks. They can accurately locate and classify lesion areas. However, due to their inherent inductive bias, CNNs may lack an understanding of long‐term dependencies in medical images, leading to less accurate grasping of details in the images. To address this problem, we explored a Transformer‐based solution and studied its feasibility in medical imaging tasks (OstT). First, we performed super‐resolution reconstruction on the original MRI image of osteosarcoma and improved the texture features of the tissue structure to reduce the error caused by the unclear tissue structure in the image during model training. Then, we propose a Transformer‐based method for medical image segmentation. A gated axial attention model is used, which augments existing architectures by introducing an additional control mechanism in the self‐attention module to improve segmentation accuracy. Experiments on real datasets show that our method outper‐forms existing models such as Unet. It can effectively assist doctors in imaging examinations. A transformer‐based solution is explored and its feasibility in medical imaging tasks is studied. A transformer‐based medical image segmentation system is proposed, OstT, which includes a gated axial attention model that enhances the existing architecture by introducing an additional control mechanism in the self‐attention module to improve segmentation accuracy.
ISSN:1751-9659
1751-9667
DOI:10.1049/ipr2.12854