You have requested a machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Neither SPIE nor the owners and publishers of the content make, and they explicitly disclaim, any express or implied representations or warranties of any kind, including, without limitation, representations and warranties as to the functionality of the translation feature or the accuracy or completeness of the translations.
Translations are not retained in our system. Your use of this feature and the translations is subject to all use restrictions contained in the Terms and Conditions of Use of the SPIE website.
16 October 2019Depth information calculation method for unstructured objects based on deep neural network
Depth information perception of unstructured scene images is an important problem for applications using computer vision. This paper proposes a method based on deep learning combined with self-attention mechanism to reason the depth information of unstructured indoor targets, which effectively solves the problem of blurred image detail and insufficient layering in depth information reasoning in unstructured scenes. First, the deep learning-based encoder-decoder model is trained to learn the depth information of indoor scenes on large 3D datasets. The trained model has good results for general structured indoor scenes. Secondly, the soft self-attention mechanism is used to obtain the disparity information between the upper and lower sequences of the input image, by which the depth map obtained in the first step is corrected to enhance the accuracy of depth. Finally, in order to get clear objects with obvious boundaries in the depth response map, the nearest neighbor regression is used to correct the contour of the objects. The experimental results show that the proposed method has very good depth information reasoning ability for indoor unstructured scenes. Through depth information reasoning, the obtained objects have obvious texture structure, strong geometric features, clear contour edges and delicate layers, and also the misleading of deep information reasoning in reflective and highlight areas is eliminated.
The alert did not successfully save. Please try again later.
Wei Hu, Jinjun Rao, Zhe Xu, Jinbo Chen, Tao Wang, Mei Liu, Jingtao Lei, "Depth information calculation method for unstructured objects based on deep neural network," Proc. SPIE 11205, Seventh International Conference on Optical and Photonic Engineering (icOPEN 2019), 1120525 (16 October 2019); https://doi.org/10.1117/12.2542220