武汉大学陈关州--中文主页-- S2Net: A Multitask Learning Network for Semantic Stereo of Satellite Image Pairs

陈关州

当前位置：中文主页 > 科学研究 > 论文成果

论文成果

S2Net: A Multitask Learning Network for Semantic Stereo of Satellite Image Pairs

发布时间：2024-10-27

点击次数：

DOI码：10.1109/tgrs.2023.3335997

发表刊物：IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING

关键字：Convolutional neural network,multitask learning,semantic segmentation,stereo image pairs,stereo matching

摘要：Stereo matching and semantic segmentation are two significant tasks in remote sensing. Recently, deep learning approaches have been applied to these tasks separately. However, the lack of semantic supervision makes the training of stereo matching model susceptible to data disturbance, resulting in inferior generalization ability; foreground objects are sometimes confused with background pixels in RGB images, limiting the classification accuracy. By exploring the relationship between these two tasks, semantic stereo solves these problems simultaneously with multitask learning. Previous methods took semantic stereo as two parallel processing tasks, so they did not take full advantages of the additional information from both tasks and only obtained slight improvement. In this work, we designed a multitask learning framework semantic stereo network (S(2)Net). The proposed network generates cost volumes with feature maps supervised by semantic information to estimate disparity maps and fuses RGB-D feature maps to predict classification maps, therefore gathering multitask learning information. To enhance the performance of trained model, we also considered the continuity of disparity values and the duality of stereo image pairs in data augmentation. When applied in datasets without training, S(2)Net obtained 2.937% D1-Error in the WHU dataset, lower than 4.297% of the previous best method, depicting the generalization ability improvement from semantic supervision. In terms of semantic segmentation, the introduction of disparity maps increases the mean intersection over union (mIoU) from 61.375% to 69.096% in the US3D datasets. The experiments on the KITTI semantics benchmark show that our proposed method obtains 60.76% mIoU, achieving state-of-the-art among multitask learning methods.

合写作者：Tong Wang,Xianwei Li,Haobo Yang,Wenlin Zhou,Chanjuan He,Qing Wang

论文类型：期刊论文

通讯作者：Xiaodong Zhang,Guanzhou Chen

文献类型：J

卷号：62

页面范围：1-13

ISSN号：0196-2892

是否译文：否

发表时间：2023-11-28

收录刊物：SCI

上一条：On-Orbit Geometric Calibration and Accuracy Validation of the Jilin1-KF01B Wide-Field Camera

下一条：A Deep Transfer Learning Framework Using Teacher-Student Structure for Land Cover Classification of Remote-Sensing Imagery

陈关州

首页

科学研究

获奖信息

招生信息

学生信息

相关链接

论文成果

S2Net: A Multitask Learning Network for Semantic Stereo of Satellite Image Pairs

陈关州 var _tsites_com_view_mode_type_=8;

首页

科学研究

获奖信息

招生信息

学生信息

相关链接

论文成果

S2Net: A Multitask Learning Network for Semantic Stereo of Satellite Image Pairs

陈关州