LG-Umer: UNet-like Network Integrate Local-Global Feature with Novel Attention for Road Extraction from Remote Sensing Images
Article
| Article Title | LG-Umer: UNet-like Network Integrate Local-Global Feature with Novel Attention for Road Extraction from Remote Sensing Images |
|---|---|
| ERA Journal ID | 200605 |
| Article Category | Article |
| Authors | Niu, Penghui, Cai, Taotao, Zhang, Yajuan, Zhang, Ping, Xu, Wenjia, Gu, Junhua and Han, Jungong |
| Journal Title | IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing |
| Journal Citation | 18, pp. 21755-21768 |
| Number of Pages | 14 |
| Year | 2025 |
| Publisher | IEEE (Institute of Electrical and Electronics Engineers) |
| Place of Publication | United States |
| ISSN | 1939-1404 |
| 2151-1535 | |
| Digital Object Identifier (DOI) | https://doi.org/10.1109/JSTARS.2025.3573735 |
| Web Address (URL) | https://ieeexplore.ieee.org/abstract/document/11015561 |
| Abstract | Road extraction from remote sensing images is a key research area in smart city development. While deep learning techniques have demonstrated remarkable effectiveness in this domain, existing approaches exhibit limitations: convolutional neural network (CNN)-based methods struggle to capture global contextual information for long-range road networks, vision transformer (ViT)-based methods fail to adequately extract multiscale local features, and hybrid CNN–ViT architectures overlook the synergistic guidance between local and global features. To address these challenges, we propose LG-Umer, a UNet-like network that integrates Local-Global features with a novel attention mechanism, combining the complementary strengths of CNNs and ViTs within an encoder–decoder framework. Specifically, the encoder employs a multiscale strip deformationalmodule, which utilizes deformable convolutions to adaptively extract topological structures and variable-shaped local road features. In the decoder, a multistage gate unit module is introduced, incorporating a novel attention mechanism to model long-range dependencies by leveraging local features as attention operators for global feature refinement. Extensive experiments on three public benchmarks demonstrate the superiority ofLG-Umer. It achieves IoUscores of 70.4%, 71.2%, and 68.7% on the Massachusetts Road, DeepGlobe Road, and CHN6-CUG datasets, respectively, surpassing recent state-of-theart methods by 1.2%, 0.9%, and 1.1%. These results validate the effectiveness of our approach in balancing local detail preservation and global contextual modeling for road extraction tasks. |
| Keywords | Building extraction; deep learning (DL); global attention; multiscale direction context-aware |
| Contains Sensitive Content | Does not contain sensitive content |
| ANZSRC Field of Research 2020 | 460206. Knowledge representation and reasoning |
| Byline Affiliations | Hebei University of Technology, China |
| School of Science, Engineering and Digital Technologies | |
| Hebei Prospecting Institute of Hydrogeology and Engineering Geological, China | |
| University of Sheffield, United Kingdom |
https://research.usq.edu.au/item/1013wq/lg-umer-unet-like-network-integrate-local-global-feature-with-novel-attention-for-road-extraction-from-remote-sensing-images
Download files
Published Version
| LG-Umer_UNet-Like_Network_Integrate_LocalGlobal_Feature_With_Novel_Attention_for_Road_Extraction_From_Remote_Sensing_Images.pdf | ||
| License: CC BY 4.0 | ||
| File access level: Anyone | ||
14
total views1
total downloads14
views this month1
downloads this month