28 April 2018 Robust human action recognition based on depth motion maps and improved convolutional neural network
Linqin Cai, Xiaolin Liu, Fuli Chen, Min Xiang
Author Affiliations +
Abstract
Human action recognition has been widely used in various fields of computer vision, pattern recognition, and human–computer interaction and has attracted substantial attention. Combining deep learning and depth information, this paper proposed a method of human action recognition based on improved convolutional neural networks (CNN). First, we use the depth motion maps to extract the depth sequence features and obtain three projected maps corresponding to front, side, and the top views. On this basis, an improved CNN is constructed to realize the recognition of human action, which uses three-dimensional (3-D) input and two-dimensional process identification to speed up the computation and reduce the complexity of recognition process. We evaluate our approach on two public 3-D action datasets: MSR Action3D dataset and UT-Kinect dataset, and our private CTP Action3D dataset built using Kinect to collect data. The experimental results show that the proposed methods of human action recognition achieve higher average recognition rate of 91.3% on MSR Action3D dataset, 97.98% on UT-Kinect dataset, and the average recognition rate is 93.8% on our CTP Action3D dataset. Furthermore, the trained model on one depth video sequence dataset can be easily generalized to different datasets without changing network parameters.
© 2018 SPIE and IS&T 1017-9909/2018/$25.00 © 2018 SPIE and IS&T
Linqin Cai, Xiaolin Liu, Fuli Chen, and Min Xiang "Robust human action recognition based on depth motion maps and improved convolutional neural network," Journal of Electronic Imaging 27(5), 051218 (28 April 2018). https://doi.org/10.1117/1.JEI.27.5.051218
Received: 8 January 2018; Accepted: 10 April 2018; Published: 28 April 2018
Lens.org Logo
CITATIONS
Cited by 11 scholarly publications.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Video

3D modeling

Convolutional neural networks

Feature extraction

Data modeling

Sensors

3D image processing

RELATED CONTENT


Back to Top