Kurt Helmes, Richard H. Stockbridge
This paper examines the numerical implementation of a linear programming (LP) formulation of stochastic control problems involving singular stochastic processes. The decision maker has the ability to influence a diffusion process through the selection of its drift rate (a control that acts absolutely continuously in time) and may also decide to instantaneously move the process to some other level (a singular control). The first goal of the paper is to show that linear programming provides a viable approach to solving singular control problems. A second goal is the determination of the absolutely continuous control from the LP results and is intimately tied to the particular numerical implementation. The original stochastic control problem is equivalent to an infinite-dimensional linear program in which the variables are measures on appropriate bounded regions. The implementation method replaces the LP formulation involving measures by one involving the moments of the measures. This moment approach does not directly provide the optimal control in feedback form of the current state. The second goal of this paper is to show that the feedback form of the optimal control can be obtained using sensitivity analysis.
References
[1] Beneš, V. E., Shepp, L. A. and Witsenhausen, H. S. (1980). Some solvable stochastic control problems. Stochastics 4 39–83.
Mathematical Reviews (MathSciNet):
MR587428
[2] Bhatt, A. G. and Borkar, V. S. (1996). Occupation measures for controlled Markov processes: Characterization and optimality. Annals of Probability 24 1531–1562.
[3] Cho, M. J. and Stockbridge, R. H. (2002). Linear programming formulation for optimal stopping problems. SIAM Journal of Control and Optimization 40 1965–1982.
[4] Decker, T. (2006). Die Charakterisierung des verallgemeinerten DalePolytops und ihre Verwendung in linearen Programmen zur Lösung von Austrittszeit-, Stopp- und anderen Optimierungsproblemen. Dissertation, Humboldt Universitaet, Berlin.
[5] Hausdorff, F. (1923). Momentprobleme für ein endliches Intervall. Mathematische Zeitschrift 16 220–248.
[6] Helmes, K. and Röhl, S. (2008). A geometrical characterization of multidimensional Hausdorff polytopes with applications to exit time problems. Math. Oper. Res. 33 315–326.
[7] Helmes, K., Röhl, S. and Stockbridge, R. H. (2001). Computing moments of the exit time distribution for Markov processes by linear programming. Operations Research 49 516–530.
[8] Helmes, K. and Stockbridge, R. H. (2000). Numerical comparison of controls and verification of optimality for stochastic control problems. Journal of Optimization Theory and Applications 106 107–127.
[9] Helmes, K. and Stockbridge, R. H. (2001). Numerical evaluation of resolvents and Laplace transforms of Markov processes. Mathematical Methods of Operations Research 53 309–331.
[10] Helmes, K. and Stockbridge, R. H. (2003). Extension of Dale’s moment conditions with application to the Wright–Fisher model. Stochastic Models 19 255–267.
[11] Helmes, K. and Stockbridge, R. H. (2007). Linear programming approach to the optimal stopping of stochastic processes. Stochastics 79 309–335.
[12] Kaczmarek, P. (2006). Numerical analysis of a long-term average control problem. M.S. Thesis, University of Wisconsin–Milwaukee.
[13] Kaczmarek, P., Kent, S. T., Rus, G. A., Stockbridge, R. H. and Wade, B. A. (2007). Numerical solution of a long-term average control problem for singular stochastic processes. Math. Oper. Res. 66 451–473.
[14] Kurtz, T. G. (1991). A control formulation for constrained Markov processes. Mathematics of Random Media. Lectures in Applied Mathematics 27.
[15] Kurtz, T. G. and Stockbridge, R. H. (1998). Existence of Markov controls and characterization of optimal Markov controls. SIAM Journal of Control and Optimization 36 609–653.
[16] Kurtz, T. G. and Stockbridge, R. H. (2001). Stationary solutions and forward equations for controlled and singular martingale problems. Electronic Journal of Probability 6 paper 14, 1–52.
[17] Manne, A. S. (1960). Linear programming and sequential decisions. Management Science 6 259–267.
Mathematical Reviews (MathSciNet):
MR129022
[18] Mendiondo, M. S. and Stockbridge, R. H. (1998). Approximation of infinite-dimensional linear programming problems which arise in stochastic control. SIAM Journal of Control and Optimization 36 1448–1472.
[19] Röhl. S. (2001). Ein linearer Programmierungsansatz zur Lösung von Stopp- und Steuerungsproblemen. Dissertation, Humboldt Universitaet, Berlin.
[20] Stockbridge, R. H. (1990). Time-average control of martingale problems: A linear programming formulation. Annals of Probability 18 206–217.
[21] Zhang, Q. (2001). Stock trading: An optimal selling rule. SIAM Journal of Control and Optimization 40 64–87.