Paper
11 July 2016 Development of coffee maker service robot using speech and face recognition systems using POMDP
Author Affiliations +
Proceedings Volume 10011, First International Workshop on Pattern Recognition; 1001110 (2016) https://doi.org/10.1117/12.2243589
Event: First International Workshop on Pattern Recognition, 2016, Tokyo, Japan
Abstract
There are many development of intelligent service robot in order to interact with user naturally. This purpose can be done by embedding speech and face recognition ability on specific tasks to the robot. In this research, we would like to propose Intelligent Coffee Maker Robot which the speech recognition is based on Indonesian language and powered by statistical dialogue systems. This kind of robot can be used in the office, supermarket or restaurant. In our scenario, robot will recognize user’s face and then accept commands from the user to do an action, specifically in making a coffee. Based on our previous work, the accuracy for speech recognition is about 86% and face recognition is about 93% in laboratory experiments. The main problem in here is to know the intention of user about how sweetness of the coffee. The intelligent coffee maker robot should conclude the user intention through conversation under unreliable automatic speech in noisy environment. In this paper, this spoken dialog problem is treated as a partially observable Markov decision process (POMDP). We describe how this formulation establish a promising framework by empirical results. The dialog simulations are presented which demonstrate significant quantitative outcome.
© (2016) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Widodo Budiharto, Meiliana, and Alexander Agung Santoso Gunawan "Development of coffee maker service robot using speech and face recognition systems using POMDP", Proc. SPIE 10011, First International Workshop on Pattern Recognition, 1001110 (11 July 2016); https://doi.org/10.1117/12.2243589
Lens.org Logo
CITATIONS
Cited by 6 scholarly publications.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Facial recognition systems

Speech recognition

Detection and tracking algorithms

Image processing

Cameras

Principal component analysis

Relays

Back to Top