Semantic segmentation of remote sensing images in urban scenes suffers from blurred multi-scale target boundaries, insufficient use of global context, and classification errors caused by high inter-class variance and low intra-class variance. Therefore, we propose a semantic segmentation network with edge and class guidance (ECGNet). First, ECGNet introduces multi-scale edge prior knowledge to address the problem of blurred target boundaries. Second, ECGNet applies synergistic class augmented attention to introduce class prior knowledge while retaining rich spatial dimensional localization information to alleviate the problem of classification errors caused by low intra-class variance and high inter-class variance. Finally, the multi-scale large receptive field attention in ECGNet simulates a large convolutional kernel to capture multi-scale global context information. Experiments conducted on the ISPRS Vaihingen and ISPRS Potsdam datasets show that the proposed method is competitive. |
ACCESS THE FULL ARTICLE
No SPIE Account? Create one
Image segmentation
Semantics
Prior knowledge
Remote sensing
Convolution
Visualization
Transformers