Segmentation of illuminated areas of light using fully-convolutional neural networks and computer vision algorithms for augmented reality systems

Maxim Sorokin; Andrey Zhdanov; Dmitry Zhdanov; Igor S. Potemin; Nikolay Bogdanov

doi:10.1117/12.2526150

21 June 2019 Segmentation of illuminated areas of light using fully-convolutional neural networks and computer vision algorithms for augmented reality systems

Maxim Sorokin, Andrey Zhdanov, Dmitry Zhdanov, Igor S. Potemin, Nikolay Bogdanov

Author Affiliations +

Proceedings Volume 11062, Digital Optical Technologies 2019; 110621N (2019) https://doi.org/10.1117/12.2526150
Event: SPIE Digital Optical Technologies, 2019, Munich, Germany

Abstract

The relevance of this topic is due to the rapid development of virtual and augmented reality systems. The problem lies in the formation of natural conditions for lighting objects of the virtual world in real space. To solve a light sources determination problem and recovering its optical parameters were proposed the fully-convolutional neural network, which allows catching the 'behavior of light' features. The output of FCNN is a segmented image with light levels and its strength. Naturally, the fully-convolutional neural network is well suited for image segmentation, so as an encoder was taken the architecture of VGG-16 with layers that pools and convolves an input image to 1x1 pixel and wisely classifies it to one of a class which characterizes its strength. As image dataset was synthesized by Integra developed realistic scene rendering software 'Lumicept', which has on its boat powerful tools for modeling and passing the behavior of light, so there is no doubt of wrong behavior or visualization of light rays and its secondary lighting, that guarantees proper optical parameters and its classification. Lumicept renders the image and its multi-color mask, where each color corresponds to it's optical strengthens. More 'cold' colors mean less intensive illumination when 'hot' colors correspond for the light sources, in digit equivalent that values ranging from 0 to 500 nits (or candela per square meter), where 0 is not lit at all area and 500 is a value of a light brightness of a typical room lamp. These images were used to feed CNN to dense layer, where the network learn features to recognize and as output upsamples to a segmentation image. To say more closely about an upsample layer, this is a kind of a function that brings a low-resolution image to a high-one by duplicating each pixel twice, this is called the nearest neighbor approach. Now FCNN decision can be used in tasks of definition of lighted areas of an accommodation, restoring brightness parameters, taking features of shadows, analyzing its secondary illumination and classifies it to one of a brightness level, which nowadays is one of a major task in augmented reality systems to place a synthesized object to our environment to match the specified optical parameters and lighting of a room, also speaking about determination of a light, the CNN encoder can determine the type of illumination, by this, is meant ceiling ones or wall light sources. Neural network training was conducted on 221 train images and 29 validation images with learning rate 1E-2 and 200 epochs, after training the loss was 0,2. As a test was used an ‘intersection over union’ method, that compares the ground truth area of an input image and output image, comparing its pixels and giving the result of accuracy. The mean IoU is 0.7, almost rightly classifying the first class with a value of 90 percents of accordance and the last class with a probability of 30 percents. Lately, the FCNN will be trained on more images and will be trained to determine light sources location.

Citation Download Citation

Maxim Sorokin, Andrey Zhdanov, Dmitry Zhdanov, Igor S. Potemin, and Nikolay Bogdanov "Segmentation of illuminated areas of light using fully-convolutional neural networks and computer vision algorithms for augmented reality systems", Proc. SPIE 11062, Digital Optical Technologies 2019, 110621N (21 June 2019); https://doi.org/10.1117/12.2526150

ACCESS THE FULL ARTICLE

INSTITUTIONAL
Select your institution to access the SPIE Digital Library.

SELECT YOUR INSTITUTION

PERSONAL
Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.

PERSONAL SIGN IN

No SPIE Account? Create one

PURCHASE THIS CONTENT

SUBSCRIBE TO DIGITAL LIBRARY

50 downloads per 1-year subscription

Members: $195

Non-members: $335 ADD TO CART

25 downloads per 1 - year subscription

Members: $145

Non-members: $250 ADD TO CART

PURCHASE SINGLE ARTICLE

Includes PDF, HTML & Video, when available

Members: $17.00

Non-members: $21.00 ADD TO CART

PROCEEDINGS
6 PAGES

DOWNLOAD PAPER SAVE TO MY LIBRARY

GET CITATION

RIGHTS & PERMISSIONS

Get copyright permission Get copyright permission on Copyright Marketplace

KEYWORDS

Augmented reality

Image segmentation

Neural networks

Virtual reality

Light sources and illumination

Light sources

Algorithm development

Show All Keywords

Keywords/Phrases

Search In:

Publication Years