20 August 2024 Laddering vision foundation model for remote sensing image change detection
Yingying Liu, Gang Zhou
Author Affiliations +
Abstract

This paper proposes a novel laddering vision foundation model for change detection (CD) of remote sensing images. Current approaches have limitations in simultaneously extracting universal features and task-specific characteristics, and they cannot effectively integrate these features for detection tasks. The proposed model exploits both general features and task-specific characteristics for CD of remote sensing images. Specifically, task-agnostic characteristics are extracted from a pre-trained visual foundation model, which contains knowledge information of images. Then, the hierarchical transformer-based CD backbone is exploited to learn both long-range and local spatial information from remote sensing images. Furthermore, task-specific and universal features are integrated within the hierarchical network architecture, which can integrate heterogeneous feature maps and embedding tokens, addressing the differences in structure and content of different types of features. Three benchmark datasets are employed for comparative experiments, and experimental results certify the effectiveness and progressiveness in terms of CD of the investigated approach.

© 2024 Society of Photo-Optical Instrumentation Engineers (SPIE)
Yingying Liu and Gang Zhou "Laddering vision foundation model for remote sensing image change detection," Journal of Applied Remote Sensing 18(3), 036503 (20 August 2024). https://doi.org/10.1117/1.JRS.18.036503
Received: 7 May 2024; Accepted: 30 July 2024; Published: 20 August 2024
Lens.org Logo
CITATIONS
Cited by 2 scholarly publications.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Visual process modeling

Feature extraction

Remote sensing

Machine learning

Semantics

Image segmentation

Network architectures

Back to Top