22 April 2010 Online gaming for learning optimal team strategies in real time
Author Affiliations +
Abstract
This paper first presents an overall view for dynamical decision-making in teams, both cooperative and competitive. Strategies for team decision problems, including optimal control, zero-sum 2-player games (H-infinity control) and so on are normally solved for off-line by solving associated matrix equations such as the Riccati equation. However, using that approach, players cannot change their objectives online in real time without calling for a completely new off-line solution for the new strategies. Therefore, in this paper we give a method for learning optimal team strategies online in real time as team dynamical play unfolds. In the linear quadratic regulator case, for instance, the method learns the Riccati equation solution online without ever solving the Riccati equation. This allows for truly dynamical team decisions where objective functions can change in real time and the system dynamics can be time-varying.
© (2010) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Gregory Hudas, F. L. Lewis, K. G. Vamvoudakis, "Online gaming for learning optimal team strategies in real time", Proc. SPIE 7692, Unmanned Systems Technology XII, 76920W (22 April 2010); doi: 10.1117/12.850231; https://doi.org/10.1117/12.850231
PROCEEDINGS
13 PAGES


SHARE
Back to Top