一种基于特征融合的息肉分割双解码模型
作者:
作者单位:

昆明理工大学信息工程与自动化学院,昆明 650500

作者简介:

通讯作者:

基金项目:

国家自然科学基金(61861023)。


A Double-Decoding Model for Polyp Segmentation Based on Feature Fusion
Author:
Affiliation:

Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650500, China

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
    摘要:

    在结直肠癌的早期筛查中,通过对结肠镜图像进行自动化的息肉检测和分割可以提高诊断效率和准确性。由于肠道内部环境的复杂性以及图像质量的限制,自动化的息肉分割仍然是一个具有挑战性的问题。针对这一问题,提出了一种基于Transformer和空洞卷积特征融合的息肉分割双解码模型(Dual decoded polyp segmentation model fusing Transformer and dilated convolution, FTDC-Net)。该模型以ResNet50作为编码器,以便能够更好地提取图像深层次特征。使用 Transformer 编码模块,它的自注意力(Self-attention)机制能够捕捉输入之间的长距离依赖关系,模型中使用了不同的空洞卷积(Dilated-convolution)来扩大模型的感受野,让模型能捕捉到结肠镜图像更大范围内的信息。本文网络模型的解码部分使用双解码结构,包含一个自动编码器分支,自动编码器可以重构输入,另一个编码分支用于分割结果。模型中,自动编码器的输出被用于生成一个注意力图作为注意力机制,该图将被用于指导分割结果。在Kvasir-SEG和ETIS-LARIBPOLYPDB标准数据集上进行了实验验证,实验结果表明FTDC-Net能有效地分割出结肠息肉,相比目前主流息肉分割模型,在各项评价指标上均取得了较高的提升。

    Abstract:

    In the early screening of colorectal cancer, diagnostic efficiency and accuracy can be improved by automated polyp detection and segmentation of colonoscopy images. Due to the complexity of internal environment of intestines and the limitation of image quality, automated polyp segmentation is still a challenging problem. Aiming at this problem, this paper proposes a dual-decoding model for polyp segmentation using Transformer and null convolution to achieve feature fusion (FTDC-Net). ResNet50 is used as an encoder in order to be able to better extract deep image features. The Transformer coding module is used, which has a self-attention mechanism to capture long distance dependencies between the inputs, and different dilated-convolutions are used in the model to expand the sensory field of the model to allow the model to capture a larger range of information in the colonoscopy image. The decoding part of the network model in this paper uses a dual-decoding structure, including an autoencoder branch that reconstructs the inputs and a coding branch for segmenting the results. The output of the autoencoder is used in the model to generate an attention map as an attention mechanism. This map will be used to guide the segmentation results. Experimental validation is carried out on the Kvasir-SEG and ETIS-LARIBPOLYPDB standard datasets, and experimental results show that FTDC-Net can effectively segment colon polyps, and achieves a high level of improvement in all evaluation metrics compared to the current mainstream polyp segmentation models.

    参考文献
    相似文献
    引证文献
引用本文

吴港,全海燕.一种基于特征融合的息肉分割双解码模型[J].数据采集与处理,2024,(4):954-966

复制
分享
文章指标
  • 点击次数:
  • 下载次数:
历史
  • 收稿日期:2023-08-28
  • 最后修改日期:2024-01-27
  • 录用日期:
  • 在线发布日期: 2024-07-25