Paper
22 May 2020 Fusion of color and depth information for human actions recognition
V. Voronin, M. Zhdanova, E. Semenishchev, A. Zelensky, O. Tokareva
Author Affiliations +
Abstract
The solution to the problem of recognizing human actions on video sequences is one of the key areas on the path to the development and implementation of computer vision systems in various spheres of life. Such areas as video surveillance systems, monitoring, contactless control interfaces, video processing as a preliminary stage of processing, etc. Most of the approaches published in the literature can be divided into two groups: approaches based on constructing a global descriptor or a description of local points, unrelated to the human skeleton; and approaches based on the construction of the feature points (joint) of the human skeleton. In most cases, only the second group of methods use depth sensors to obtain clear information about the human skeleton. While additional sources of information (such as depth sensors, thermal sensors) allow you to get more informative features, and thus increase the reliability and stability of recognition. In the article, we present the algorithm, combining information from visible cameras and depth sensors based on the PLIP model (parameterized model of logarithmic image processing) close to the perception of the human visual system, and the development of a global descriptor characterizing the action taking place in the frame. The proposed algorithm takes advantage of the fusion of various modalities that provide the construction of a more informative descriptor. Experimental results showed the effectiveness of the proposed algorithm.
© (2020) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
V. Voronin, M. Zhdanova, E. Semenishchev, A. Zelensky, and O. Tokareva "Fusion of color and depth information for human actions recognition", Proc. SPIE 11423, Signal Processing, Sensor/Information Fusion, and Target Recognition XXIX, 114231C (22 May 2020); https://doi.org/10.1117/12.2560130
Lens.org Logo
CITATIONS
Cited by 4 scholarly publications.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Video

Image fusion

Fourier transforms

RGB color model

3D image processing

3D modeling

Image processing

Back to Top