31 May 2024 FDAENet: frequency domain attention encoder-decoder network for road extraction of remote sensing images
Hai Huan, Bo Zhang
Author Affiliations +
Abstract

Road information is a crucial type of geographic information. The extraction of road information from remote sensing images has been widely applied in various fields such as mapping, transportation, and navigation. However, due to the obstruction of buildings, trees, and shadows, or the spectral similarity between roads and buildings, road extraction remains a challenging research topic. Most current methods focus only on the spatial domain, neglecting the information contained in the image frequency domain. Therefore, this work proposes a remote sensing image road extraction model, frequency domain attention encoder-decoder network (FDAENet). This model mainly consists of three parts. First, the encoder is composed of frequency domain transformer modules (FDTMs). The gnConv in the FDTM includes depthwise separable convolution and phase and magnitude (PM) filters, where the PM filter contains a global filter and phase and amplitude filters located in two parallel layers, used to extract feature information of road remote sensing images from the frequency domain. Then, a multi-scale context extraction module is proposed, which introduces appropriate road context information to enhance inference capability. Finally, a stripe convolution module is introduced to capture long-distance context information from four different directions. Experiments on public road datasets show that FDAENet performs excellently in terms of F1, intersection over union, average path length similarity, and other indicators. Visualization results show that FDAENet performs better in extracting complex roads and can effectively extract roads from high-resolution remote sensing images.

© 2024 Society of Photo-Optical Instrumentation Engineers (SPIE)
Hai Huan and Bo Zhang "FDAENet: frequency domain attention encoder-decoder network for road extraction of remote sensing images," Journal of Applied Remote Sensing 18(2), 024510 (31 May 2024). https://doi.org/10.1117/1.JRS.18.024510
Received: 16 January 2024; Accepted: 13 May 2024; Published: 31 May 2024
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Roads

Convolution

Remote sensing

Tunable filters

Transformers

Feature extraction

Image filtering

RELATED CONTENT

Robust tracking for visual complex environments
Proceedings of SPIE (June 21 2024)
Attention aggregation learning for fast stereo matching
Proceedings of SPIE (July 19 2024)
Road extraction from SAR images based on particle filtering
Proceedings of SPIE (November 15 2007)

Back to Top