基于语义分割和融合残差U-Net的单视光学遥感影像三维重建方法

doi:10.16337/j.1004-9037.2024.02.008

首页 > 按月查看>2024年第2月 >348-360. DOI:10.16337/j.1004-9037.2024.02.008

基于语义分割和融合残差U-Net的单视光学遥感影像三维重建方法
DOI:
                        10.16337/j.1004-9037.2024.02.008
                    
作者:
                        
                        
                    
作者单位:1.浙江省测绘科学技术研究院，杭州310012;2.南京航空航天大学航天学院，南京211106;3.浙江艺佳地理信息技术有限公司，杭州311700;4.绍兴市上虞区自然资源监测中心，绍兴312365
作者简介:
通讯作者:
基金项目:浙江省基础公益研究计划（LTGS23D010003）。

Three-Dimensional Reconstruction Method for Single-View Optical Remote Sensing Images Based on Semantic Segmentation and Residual U-Net Fusion

Author:

Affiliation:

1.Zhejiang Institute of Surveying and Mapping Science and Technology, Hangzhou 310012, China;2.College of Aerospace Science, Nanjing University of Aeronautics & Astronautics, Nanjing 211106, China;3.Zhejiang Yijia Geographic Information Technology Co. Ltd.,Hangzhou 311700, China;4.Shaoxing Shangyu District Natural Resources Monitoring Center,Shaoxing 312365, China

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

摘要:

从单视遥感图像进行三维重建本身是一个解不唯一的非适定问题，往往需要大量的人工经验来补充缺失信息以构建完整三维模型。为了解决这一问题，提出了一种基于语义分割和融合残差U-Net的单视遥感影像三维重建方法。该方法包括语义分割和单视遥感影像高度估计两个阶段。语义分割阶段使用U-Net确定地物属性，在此基础上改进U-Net对遥感影像进行高度估计，并联合语义特征进行锚定高度回归以提高重建精度。针对改进U-Net，通过嵌入不同数量与通道的残差块，强化编码器的特征提取能力，并修改解码器输出层使其适应于高度回归任务，从而实现逐像素预测遥感影像的数字表面模型（Digital surface model， DSM）高度值。在公开的US3D数据集上得到了均方根误差（Root mean square error，RMSE）为2.751 m、平均绝对误差（Mean absolute error，MAE）为1.446 m的结果，重建结果均优于其余网络，证实该方法实现了基于单视遥感影像的三维估计，能够重建地物的分布结构。

Abstract:

Three-dimensional （3D） reconstruction from single-view remote sensing images is an unsolvable problem， which often requires a lot of manual experience to supplement the missing information to construct a complete 3D model. To solve this problem， a 3D reconstruction method of single-view remote sensing image based on semantic segmentation and fusion residual U-Net is proposed. The method includes two stages： Semantic segmentation and height estimation of single-view remote sensing images. In the semantic segmentation stage， U-Net is used to determine the property of ground objects. On this basis， U-Net is improved to estimate the height of remote sensing image. The anchoring height regression is combined with semantic features to improve the reconstruction accuracy. Specifically， in order to improve U-Net， the feature extraction capability of encoder is enhanced by embedding residual blocks with different numbers and channels， and the decoder output layer is modified to adapt to the height regression task， so as to achieve pixel-to-pixel prediction of digital surface model （DSM） height values of remote sensing images. The results of root mean square error （RMSE） of 2.751 m and mean absolute error （MAE） of 1.446 m are obtained on the published US3D data set， and the reconstructed results are superior to those of other networks， confirming that the method can realize 3D estimation based on single-view remote sensing images and can reconstruct the distribution structure of ground objects.

参考文献

相似文献

引证文献

引用本文

黄桦,朱宇昕,章历,陈志达,张乙志,王博.基于语义分割和融合残差U-Net的单视光学遥感影像三维重建方法[J].数据采集与处理,2024,(2):348-360

复制

文章指标

点击次数:
下载次数:

历史

收稿日期:2023-10-18
最后修改日期:2024-02-25
录用日期:
在线发布日期: 2024-04-10

引用本文

分享

文章指标

历史