Translator Disclaimer
26 June 2017 Pedestrian detection in video surveillance using fully convolutional YOLO neural network
Author Affiliations +
More than 80% of video surveillance systems are used for monitoring people. Old human detection algorithms, based on background and foreground modelling, could not even deal with a group of people, to say nothing of a crowd. Recent robust and highly effective pedestrian detection algorithms are a new milestone of video surveillance systems. Based on modern approaches in deep learning, these algorithms produce very discriminative features that can be used for getting robust inference in real visual scenes. They deal with such tasks as distinguishing different persons in a group, overcome problem with sufficient enclosures of human bodies by the foreground, detect various poses of people. In our work we use a new approach which enables to combine detection and classification tasks into one challenge using convolution neural networks. As a start point we choose YOLO CNN, whose authors propose a very efficient way of combining mentioned above tasks by learning a single neural network. This approach showed competitive results with state-of-the-art models such as FAST R-CNN, significantly overcoming them in speed, which allows us to apply it in real time video surveillance and other video monitoring systems. Despite all advantages it suffers from some known drawbacks, related to the fully-connected layers that obstruct applying the CNN to images with different resolution. Also it limits the ability to distinguish small close human figures in groups which is crucial for our tasks since we work with rather low quality images which often include dense small groups of people. In this work we gradually change network architecture to overcome mentioned above problems, train it on a complex pedestrian dataset and finally get the CNN detecting small pedestrians in real scenes.
© (2017) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
V. V. Molchanov, B. V. Vishnyakov, Y. V. Vizilter, O. V. Vishnyakova, and V. A. Knyaz "Pedestrian detection in video surveillance using fully convolutional YOLO neural network ", Proc. SPIE 10334, Automated Visual Inspection and Machine Vision II, 103340Q (26 June 2017);


Real-time detection of abandoned bags using CNN
Proceedings of SPIE (June 25 2017)
Single shot relation detector for pedestrian detection
Proceedings of SPIE (August 13 2019)
Neural modelling of fuzzy set connectives
Proceedings of SPIE (December 21 1993)
Multimedia traffic monitoring system
Proceedings of SPIE (October 10 2000)

Back to Top