期刊论文

【摘要】

Background: Methods based on the combination of transformer and convolutional neural networks (CNNs) have achieved impressive results in the field of medical image segmentation. However, most of the recently proposed combination segmentation approaches simply treat transformers as auxiliary modules which help to extract long-range information and encode global context into convolutional representations, and there is a lack of investigation on how to optimally combine self-attention with convolution. Methods: We designed a novel transformer block (MRFormer) that combines a multi-head self-attention layer and a residual depthwise convolutional block as the basic unit to deeply integrate both long-range and local spatial information. The MRFormer block was embedded between the encoder and decoder in U-Net at the last two layers. This framework (UMRFormer-Net) was applied to the segmentation of three-dimensional (3D) pancreas, and its ability to effectively capture the characteristic contextual information of the pancreas and surrounding tissues was investigated. Results: Experimental results show that the proposed UMRFormer-Net achieved accuracy in pancreas segmentation that was comparable or superior to that of existing state-of-the-art 3D methods in both the Clinical Proteomic Tumor Analysis Consortium Pancreatic Ductal Adenocarcinoma (CPTAC-PDA) dataset and the public Medical Segmentation Decathlon dataset (self-division). UMRFormer-Net statistically significantly outperformed existing transformer-related methods and state-of-the-art 3D methods (P<0.05, P<0.01, or P<0.001), with a higher Dice coefficient (85.54% and 77.36%, respectively) or a lower 95% Hausdorff distance (4.05 and 8.34 mm, respectively). Conclusions: UMRFormer-Net can obtain more matched and accurate segmentation boundary and region information in pancreas segmentation, thus improving the accuracy of pancreas segmentation. The code is available at https://github.com/supersunshinefk/UMRFormer-Net.

【授权许可】

【预览】

附件列表
Files	Size	Format	View
RO202303290000711ZK.pdf	767KB	PDF	download

Quantitative Imaging in Medicine and Surgery
UMRFormer-net: a three-dimensional U-shaped pancreas segmentation method based on a double-layer bridged transformer network
article
Kun Fang¹ Baochun He² Libo Liu² Haoyu Hu³ Chihua Fang³ Xuguang Huang¹ Fucang Jia²
[1]School for Information and Optoelectronic Science and Engineering , South China Normal University
[2]Research Center for Medical Artificial Intelligence, Shenzhen Institute of Advanced Technology , Chinese Academy of Sciences
[3]Department of Hepatobiliary Surgery ,(I) , Zhujiang Hospital of Southern Medical University
[4]Pazhou Lab
[5]Shenzhen Key Laboratory of Minimally Invasive Surgical Robotics and System, Shenzhen Institute of Advanced Technology , Chinese Academy of Sciences
关键词: Pancreas; image segmentation; transformer; deep learning; U-Net;
DOI : 10.21037/qims-22-544
学科分类：外科医学
来源: AME Publications
PDF


	文献评价指标
	下载次数：0次	浏览次数：0次

【 摘 要 】

【 授权许可】

【 预 览 】

【摘要】

【授权许可】

【预览】