27 September 2024 Multi-label remote sensing scene classification using two-level double channel spatial attention residual blocks
Sagar Chavda, Mahesh Goyani
Author Affiliations +
Abstract

In the field of remote sensing, there are numerous challenges associated with multi-label scene classification, such as the firm correlation among labels, the presence of small-scale noisy objects or areas, cluttered backgrounds, diverse image categories in datasets, differences within classes and similarities among classes, and unbalanced class weights. Moreover, the limited representation ability of convolutional neural networks (CNNs) makes multi-label scene classification a complex endeavor. We address the CNN model's limited representation ability and aim to enhance its performance through the proposed framework. Current research has demonstrated that channel spatial attention modules can boost the representation power of CNNs. However, these modules are not yet widely utilized in the multi-label scene domain. Therefore, we have incorporated state-of-the-art channel–spatial attention modules into MobileNet_v1 to improve its representation ability for multi-label remote sensing scene classification. The proposed method employs two-level feature extraction with double channel–spatial attention residual blocks. We tested it on the UC, AID, and DFC15 multi-label datasets and evaluated its performance using various metrics. The results show that our method improves the representation power of MobileNet_v1 by 3% to 7% in terms of F1- and F2-scores, using channel spatial attention and residual connections on the UC and AID datasets. Our approach also surpasses MobileNet_v3, achieving a 3% to 8% increase in F1 and a 3% to 6% increase in F2-scores on the UC and AID datasets. On the DFC15 dataset, a significant improvement is made over the MobileNet family. Compared with the identified state-of-the-art methods, our approach achieves superior/competitive outcomes while employing fewer training parameters/floating point operations. Therefore, the proposed approach is a promising solution for multi-label remote sensing scene classification and can help overcome the challenges posed by this task in the remote sensing domain.

© 2024 Society of Photo-Optical Instrumentation Engineers (SPIE)
Sagar Chavda and Mahesh Goyani "Multi-label remote sensing scene classification using two-level double channel spatial attention residual blocks," Journal of Applied Remote Sensing 18(3), 036511 (27 September 2024). https://doi.org/10.1117/1.JRS.18.036511
Received: 5 August 2023; Accepted: 2 September 2024; Published: 27 September 2024
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Remote sensing

Scene classification

Convolution

Data modeling

Education and training

Performance modeling

Visualization

Back to Top