Abstract
This paper is concerned with two-person zero-sum games for continuous-time Markov chains, with possibly unbounded payoff and transition rate functions, under the discounted payoff criterion. We give conditions under which the existence of the value of the game and a pair of optimal stationary strategies is ensured by using the optimality (or Shapley) equation. We prove the convergence of the value iteration scheme to the game's value and to a pair of optimal stationary strategies. Moreover, when the transition rates are bounded we further show that the convergence of value iteration is exponential. Our results are illustrated with a controlled queueing system with unbounded transition and reward rates.
Citation
Xianping Guo. Onésimo Hernández-Lerma. "Zero-sum continuous-time Markov games with unbounded transition and discounted payoff rates." Bernoulli 11 (6) 1009 - 1029, dec 2005. https://doi.org/10.3150/bj/1137421638
Information