Open Access
2024 Continuity of cost in Borkar control topology and implications on discrete space and time approximations for controlled diffusions under several criteria
Somnath Pradhan, Serdar Yüksel
Author Affiliations +
Electron. J. Probab. 29: 1-32 (2024). DOI: 10.1214/24-EJP1093

Abstract

We first show that the discounted cost, cost up to an exit time, and ergodic cost involving controlled non-degenerate diffusions are continuous on the space of stationary control policies when the policies are given a topology introduced by Borkar [V. S. Borkar, A topology for Markov controls, Applied Mathematics and Optimization 20 (1989), 55–62]. The same applies for finite horizon problems when the control policies are Markov and the topology is revised to include time also as a parameter. We then establish that finite action/piecewise constant stationary policies are dense in the space of stationary Markov policies under this topology and the same holds for continuous policies. Using the above mentioned continuity and denseness results we establish that finite action/piecewise constant policies approximate optimal stationary policies with arbitrary precision. This gives rise to the applicability of many numerical methods such as policy iteration and stochastic learning methods for discounted cost, cost up to an exit time, and ergodic cost optimal control problems in continuous-time. For the finite-horizon setup, we establish additionally near optimality of time-discretized policies by an analogous argument. We thus present a unified and concise approach for approximations directly applicable under several commonly adopted cost criteria.

Acknowledgments

We wish to thank the anonymous referees for the careful reading of the manuscript and proposed improvements. This research was partially supported by the Natural Sciences and Engineering Research Council of Canada (NSERC).

Citation

Download Citation

Somnath Pradhan. Serdar Yüksel. "Continuity of cost in Borkar control topology and implications on discrete space and time approximations for controlled diffusions under several criteria." Electron. J. Probab. 29 1 - 32, 2024. https://doi.org/10.1214/24-EJP1093

Information

Received: 10 November 2022; Accepted: 30 January 2024; Published: 2024
First available in Project Euclid: 22 February 2024

Digital Object Identifier: 10.1214/24-EJP1093

Subjects:
Primary: 60J60 , 93E20
Secondary: 35Q93

Keywords: controlled diffusions , finite actions , Hamilton-Jacobi-Bellman equation , near optimality , piecewise constant policy

Vol.29 • 2024
Back to Top