Wuhan University Chen Guanzhou--Home-- LMFNet: Lightweight Multimodal Fusion Network for high-resolution remote sensing image segmentation

Chen Guanzhou

Main positions:副研究员
Gender:Male
Status:Employed
School/Department:测绘遥感信息工程全国重点实验室

Discipline: Photogrammetry and Remote Sensing

MOBILE Version

Click: times

Open Time:..

The Last Update Time:..

Current position: Home >> Scientific Research >> Paper Publications

LMFNet: Lightweight Multimodal Fusion Network for high-resolution remote sensing image segmentation

Hits : Praise

Impact Factor:7.5

Journal:Pattern Recognition

Abstract:Despite the rapid evolution of semantic segmentation for land cover classification in high-resolution remote sensing imagery, integrating multiple data modalities such as Digital Surface Model (DSM), RGB, and Near-infrared (NIR) remains a challenge. Current methods often process only two types of data, missing out on the rich information that additional modalities can provide. Addressing this gap, we propose a novel Lightweight Multimodal data Fusion Network (LMFNet) to accomplish the tasks of fusion and semantic segmentation of multimodal remote sensing images. LMFNet uniquely accommodates various data types simultaneously, including RGB, NirRG, and DSM, through a weight-sharing, multi-branch vision transformer that minimizes parameter count while ensuring robust feature extraction. Our proposed multimodal fusion module integrates a Multimodal Feature Fusion Reconstruction Layer and Multimodal Feature Self-Attention Fusion Layer, which can reconstruct and fuse multimodal features. Our method achieves a mean Intersection over Union (mIoU) of 85.09% on the US3D dataset, marking a significant improvement over existing methods. We also studied the scalability of our method, directly extending the input modality to the SAR and hyperspectral fields. Our experimental results on the C2Seg dataset show that our method has generalization applicability to data of various modalities.

Co-author:Chenxi Liu,Jiaqi Wang,Xiaoliang Tan,Wenlin Zhou,Chanjuan He

Indexed by:Journal paper

Correspondence Author:Xiaodong Zhang

Document Type:J

Volume:164

Translation or Not:no

Date of Publication:2025-03-15

Included Journals:SCI

Links to published journals:https://www.sciencedirect.com/science/article/abs/pii/S0031320325002390

Pre One:Spatio-Temporal Fusion of Landsat and MODIS Data for Monitoring of High-Intensity Fire Traces in Karst Landscapes: A Case Study in China

Next One:Disentangling the hourly dynamics of mixed urban function: A multimodal fusion perspective using dynamic graphs