In order to avoid the influence of external factors on the subsequent recognition of RGB video and improve the accuracy of human motion recognition, an algorithm of human action recognition based on Two-Stream Ind Recurrent Neural Network is proposed. In terms of extracting features, the temporal network extracts the information on the 3D coordinate of different joints at each time and classifies it by a softmax layer. The spatial network converts the spatial positional relationship of the joints at each moment into a skeleton sequence and inputs it into the softmax layer to classify. Finally, the results of the classification of the temporal network and the spatial network are weighted and summed to obtain the final classification result. Experiments verify the validity of the model on the largest 3D skeleton action recognition dataset NTU RGB + D and SBU interactive dataset.
Access to the requested content is limited to institutions that have purchased or subscribe to SPIE eBooks.
You are receiving this notice because your organization may not have SPIE eBooks access.*
*Shibboleth/Open Athens users─please
sign in
to access your institution's subscriptions.
To obtain this item, you may purchase the complete book in print or electronic format on
SPIE.org.
INSTITUTIONAL Select your institution to access the SPIE Digital Library.
PERSONAL Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.