3D Medical image segmentation is a basic and key task in computer-aided diagnosis, and glioma has a high degree of non-uniformity and irregular shape in multimodal MRI images. Therefore, accurate and reliable segmenta...
详细信息
ISBN:
(数字)9781510650435
ISBN:
(纸本)9781510650435;9781510650428
3D Medical image segmentation is a basic and key task in computer-aided diagnosis, and glioma has a high degree of non-uniformity and irregular shape in multimodal MRI images. Therefore, accurate and reliable segmentation of brain tumors is still a challenging work in medical image analysis. U-Net has become the de facto standard in various medical image segmentation tasks and has achieved great success. However, due to the inherent locality of convolution operations, U-Net is generally limited in terms of explicitly modeling long-term dependencies. Although this problem is solved in transformer, it has extreme complexity in terms of calculation and space when processing high-resolution 3D feature maps. In this paper, we propose the trans-coder, which is embedded in the end of the U-Net encoder to improve the segmentation performance while reducing the amount of calculation. The trans-coder takes the feature map from U-Net as the input sequence, and extracts the relative position information of the feature map, so as to get more detailed information of the image, and input it into the decoder to obtain good segmentation performance. At the same time, a variational autoencoder is used for regularization to prevent over-fitting problems. our method achieves superior performances to certain competing methods on the Multimodal Brain Tumor Segmentation Challenge (Brats) dataset.
暂无评论