27 September 2016 A real-time 3D end-to-end augmented reality system (and its representation transformations)
Author Affiliations +
Abstract
The new generation of HMDs coming to the market is expected to enable many new applications that allow free viewpoint experiences with captured video objects. Current applications usually rely on 3D content that is manually created or captured in an offline manner. In contrast, this paper focuses on augmented reality applications that use live captured 3D objects while maintaining free viewpoint interaction. We present a system that allows live dynamic 3D objects (e.g. a person who is talking) to be captured in real-time. Real-time performance is achieved by traversing a number of representation formats and exploiting their specific benefits. For instance, depth images are maintained for fast neighborhood retrieval and occlusion determination, while implicit surfaces are used to facilitate multi-source aggregation for both geometry and texture. The result is a 3D reconstruction system that outputs multi-textured triangle meshes at real-time rates. An end-to-end system is presented that captures and reconstructs live 3D data and allows for this data to be used on a networked (AR) device. For allocating the different functional blocks onto the available physical devices, a number of alternatives are proposed considering the available computational power and bandwidth for each of the components. As we will show, the representation format can play an important role in this functional allocation and allows for a flexible system that can support a highly heterogeneous infrastructure.
© (2016) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Donny Tytgat, Donny Tytgat, Maarten Aerts, Maarten Aerts, Jeroen De Busser, Jeroen De Busser, Sammy Lievens, Sammy Lievens, Patrice Rondao Alface, Patrice Rondao Alface, Jean-Francois Macq, Jean-Francois Macq, } "A real-time 3D end-to-end augmented reality system (and its representation transformations)", Proc. SPIE 9971, Applications of Digital Image Processing XXXIX, 99710U (27 September 2016); doi: 10.1117/12.2237804; https://doi.org/10.1117/12.2237804
PROCEEDINGS
12 PAGES


SHARE
RELATED CONTENT

Three-dimensional face model creation from video
Proceedings of SPIE (April 13 2001)
Traffic camera markup language (TCML)
Proceedings of SPIE (February 15 2012)
Predictive Coding of Depth Images Across Multiple Views
Proceedings of SPIE (March 05 2007)

Back to Top