Paper
21 June 2019 Multimodal data fusion for object recognition
Author Affiliations +
Abstract
Multi-spectral imagery provides wide possibilities for improving quality of object detection and recognition due to better visibility of different scene features in different spectral ranges. To use the advantage of multi-spectral data the relation between different types of data is required. This relation is provided by capturing data using calibrated, aligned and synchronized sensors. Also geo-spatial data in form of geo-referenced digital terrain models can be used for establishing geometric and semantic relations between different types of data. The presented study considers the problem of object recognition based on two data sources: visible and thermal imagery. The main aim of the performed study was to evaluate the performance of different convolutional neural network models for multimodal object recognition. For this purpose a special dataset was collected. The dataset contains synchronized visible and thermal images acquired by several sensor based on unmanned aerial vehicle. The dataset contains synchronized color and thermal images of urban and suburb scenes gathered in different seasons, different times of day and various weather conditions. For convolutional neural network training the dataset was augmented by model images created using object 3D models textured by real visible and thermal images. Several convolutional neural network architectures were trained and evaluated on the created dataset using different splits to estimate the influence of training data on object recognition performance.
© (2019) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Vladimir Knyaz "Multimodal data fusion for object recognition", Proc. SPIE 11059, Multimodal Sensing: Technologies and Applications, 110590P (21 June 2019); https://doi.org/10.1117/12.2526067
Lens.org Logo
CITATIONS
Cited by 7 scholarly publications and 1 patent.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
3D modeling

RGB color model

Cameras

Object recognition

3D image processing

Thermal modeling

Data fusion

RELATED CONTENT

Mosaicking thermal images of buildings
Proceedings of SPIE (May 23 2013)
Inference for data fusion
Proceedings of SPIE (December 16 1992)
Exploring approaches to layered image registration
Proceedings of SPIE (April 15 2008)

Back to Top