Translator Disclaimer
Paper
31 January 2020 Multi-channels CNN temporal features for depth-based action recognition
Author Affiliations +
Proceedings Volume 11433, Twelfth International Conference on Machine Vision (ICMV 2019); 114330U (2020) https://doi.org/10.1117/12.2559432
Event: Twelfth International Conference on Machine Vision, 2019, Amsterdam, Netherlands
Abstract
In this paper, we investigate temporal features that are extracted by a multi-channel convolutional neural network in depth map-based human action recognition. At the beginning, for the non-zero pixels representing the person shape in each depth map we calculate handcrafted features. On multivariate time-series of such handcrafted features we train a multi-class, multi-channel CNN to model temporal features as well as we extract statistical features of time-series. The concatenated features are stored in a common feature vector. Afterwards, for each class we train a separate one-against-all convolutional neural network to extract class-specific features of depth maps. For each class-specific, multivariate time-series we calculate statistical features of time-series. Finally, each class-specific feature vector is concatenated with the common feature vector resulting in an action feature vector. For each action represented by action feature vectors we train a multi-class classifier with one-hot encoding of output labels. The recognition of the action is done by a voting-based ensemble operating on such one-hot encodings. We demonstrate experimentally that on UTD-MHAD dataset the proposed algorithm outperforms state-of-the-art depth-based algorithms and attains promising results on MSR-Action3D dataset.
© (2020) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Jacek Trelinski and Bogdan Kwolek "Multi-channels CNN temporal features for depth-based action recognition", Proc. SPIE 11433, Twelfth International Conference on Machine Vision (ICMV 2019), 114330U (31 January 2020); https://doi.org/10.1117/12.2559432
PROCEEDINGS
8 PAGES


SHARE
Advertisement
Advertisement
RELATED CONTENT

Region proposal-based semantic matcher
Proceedings of SPIE (June 21 2019)
Single shot relation detector for pedestrian detection
Proceedings of SPIE (August 14 2019)
Tiny RetinaNet a one stage detector for real time...
Proceedings of SPIE (January 03 2020)

Back to Top