We propose a method to for robot planning using deep learning to integrate object detection and natural language understanding. This is different from other techniques, such the RCTA World Model,1 which explicitly defines the interfaces between each module. These boundaries simplify the design and testing of complex robotic systems, but also introduce constraints that may reduce overall system performance. For example, perception tasks generate large amounts of data, but much of it is discarded to simplify interpretation by higher level tasks, e.g., a 3D object becomes a point in space, or a distribution over classifications is reduced to its mode. Further, errors in the robot’s overal task are not back-propagated to lower level tasks, and therefore, these tasks never adapt themselves to improve robot performance. We intend to address this by using a deep learning framework to replace these interfaces with learned interfaces that select what data is shared between modules and allow for error back-propagation that could adapt each module to the robot’s task. We will do this in a simplified system that accepts aerial orthographic images and simple commands and generates paths to achieve the command. Paths are learned from expert example via inverse optimal control. In time, we hope to evolve this simplified architecture towards something more complex and practical.
Arne J. Suppé and Martial Hebert, "Using deep learning to bridge the gap between perception and intelligence," Proc. SPIE 10195, Unmanned Systems Technology XIX, 1019503 (Presented at SPIE Defense + Security: April 11, 2017; Published: 5 May 2017); https://doi.org/10.1117/12.2262252.
Conference Presentations are recordings of oral presentations given at SPIE conferences and published as part of the conference proceedings. They include the speaker's narration along with a video recording of the presentation slides and animations. Many conference presentations also include full-text papers. Search and browse our growing collection of more than 14,000 conference presentations, including many plenary and keynote presentations.