Open Access
October 2011 Discounted continuous-time constrained Markov decision processes in Polish spaces
Xianping Guo, Xinyuan Song
Ann. Appl. Probab. 21(5): 2016-2049 (October 2011). DOI: 10.1214/10-AAP749

Abstract

This paper is devoted to studying constrained continuous-time Markov decision processes (MDPs) in the class of randomized policies depending on state histories. The transition rates may be unbounded, the reward and costs are admitted to be unbounded from above and from below, and the state and action spaces are Polish spaces. The optimality criterion to be maximized is the expected discounted rewards, and the constraints can be imposed on the expected discounted costs. First, we give conditions for the nonexplosion of underlying processes and the finiteness of the expected discounted rewards/costs. Second, using a technique of occupation measures, we prove that the constrained optimality of continuous-time MDPs can be transformed to an equivalent (optimality) problem over a class of probability measures. Based on the equivalent problem and a so-called w̄-weak convergence of probability measures developed in this paper, we show the existence of a constrained optimal policy. Third, by providing a linear programming formulation of the equivalent problem, we show the solvability of constrained optimal policies. Finally, we use two computable examples to illustrate our main results.

Citation

Download Citation

Xianping Guo. Xinyuan Song. "Discounted continuous-time constrained Markov decision processes in Polish spaces." Ann. Appl. Probab. 21 (5) 2016 - 2049, October 2011. https://doi.org/10.1214/10-AAP749

Information

Published: October 2011
First available in Project Euclid: 25 October 2011

zbMATH: 1258.90104
MathSciNet: MR2884058
Digital Object Identifier: 10.1214/10-AAP749

Subjects:
Primary: 60J27 , 90C40

Keywords: constrained optimal policy , continuous-time Markov decision process , linear programming formulation , occupation measure , unbounded transition rates

Rights: Copyright © 2011 Institute of Mathematical Statistics

Vol.21 • No. 5 • October 2011
Back to Top