多级注意力特征优化的道路场景实时语义分割
DOI:
作者:
作者单位:

1.重庆理工大学;2.重庆理工大学电气与电子工程学院 重庆 400054

作者简介:

通讯作者:

基金项目:

国家自然科学基金资助项目(62371081);重庆市自然科学基金(cstc2021jcyj-msxmX0411和CSTB2022NSCQ-MSX0873)。


Real-time semantic segmentation of road scene based on multi-level attention feature optimization
Author:
Affiliation:

Chongqing University of Technology

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
    摘要:

    针对复杂多变道路场景下目标重叠导致图像边缘难以分割、小目标特征提取困难等问题,提出一种多级注意力特征优化的道路场景实时语义分割方法。首先,设计深度残差注意力模块,考虑不同层级下特征权重的差异性,通过压缩注意力机制来优化图像局部特征,从而改善像素之间的边缘效应;然后,设计通道注意力和深度聚合金字塔池化模块进一步加强语义上下文信息的提取,从而改善小目标信息丢失问题;最后,设计注意力融合模块自上而下地融合不同尺度下的特征信息,实现全局特征信息下的有效交互,增强网络对重要特征的表达。在Cityscapes和CamVid道路场景数据集上进行实验测试,分别达到74.4%和67.7%的分割精度,138帧/s和148帧/s的推理速度。与近几年优秀方法相比,该方法改善了图像边缘信息丢失,优化了对图像中小目标的分割准确度。

    Abstract:

    Aiming at the problems of overlapping targets in complex and changeable road scenes, it is difficult to segment image edges, and it is difficult to extract small target features. A multi-level attention feature optimization method for real-time semantic segmentation of road scenes is proposed. Firstly, a lightweight residual attention module is designed, taking into account the difference in feature weights at different levels, and optimizing local features of the image through a compressed attention mechanism, thereby improving the edge effect between pixels;then, design the channel The attention and depth aggregation pyramid pooling module further strengthens the extraction of semantic context information, thereby improving the problem of small target information loss; finally, the attention fusion module is designed to fuse feature information at different scales from top to bottom to achieve global feature information. Effectively interact and enhance the network"s expression of important features. Experimental tests were carried out on the Cityscapes and CamVid road scene datasets, and the segmentation accuracy was 74.4% and 67.7%, respectively, and the inference speed was 138 frames/s and 148 frames/s. Compared with the excellent methods in recent years, this method improves the loss of image edge information and optimizes the segmentation accuracy of small objects in the image.

    参考文献
    相似文献
    引证文献
引用本文
分享
文章指标
  • 点击次数:
  • 下载次数:
历史
  • 收稿日期:2023-10-19
  • 最后修改日期:2024-03-28
  • 录用日期:2024-03-29
  • 在线发布日期: