Institute of Mathematical Statistics Collections

Determining the Optimal Control of Singular Stochastic Processes Using Linear Programming

Kurt Helmes, Richard H. Stockbridge

Abstract

This paper examines the numerical implementation of a linear programming (LP) formulation of stochastic control problems involving singular stochastic processes. The decision maker has the ability to influence a diffusion process through the selection of its drift rate (a control that acts absolutely continuously in time) and may also decide to instantaneously move the process to some other level (a singular control). The first goal of the paper is to show that linear programming provides a viable approach to solving singular control problems. A second goal is the determination of the absolutely continuous control from the LP results and is intimately tied to the particular numerical implementation. The original stochastic control problem is equivalent to an infinite-dimensional linear program in which the variables are measures on appropriate bounded regions. The implementation method replaces the LP formulation involving measures by one involving the moments of the measures. This moment approach does not directly provide the optimal control in feedback form of the current state. The second goal of this paper is to show that the feedback form of the optimal control can be obtained using sensitivity analysis.

First Page: Show Hide
Full-text: Open access
Links and Identifiers

Permanent link to this document: http://projecteuclid.org/euclid.imsc/1233152940 Digital Object Identifier: doi:10.1214/074921708000000354

References

[1] Beneš, V. E., Shepp, L. A. and Witsenhausen, H. S. (1980). Some solvable stochastic control problems. Stochastics 4 39–83.
Mathematical Reviews (MathSciNet): MR587428
[2] Bhatt, A. G. and Borkar, V. S. (1996). Occupation measures for controlled Markov processes: Characterization and optimality. Annals of Probability 24 1531–1562.
Mathematical Reviews (MathSciNet): MR1411505
Zentralblatt MATH: 0863.93086
Digital Object Identifier: doi:10.1214/aop/1065725192
Project Euclid: euclid.aop/1065725192
[3] Cho, M. J. and Stockbridge, R. H. (2002). Linear programming formulation for optimal stopping problems. SIAM Journal of Control and Optimization 40 1965–1982.
Mathematical Reviews (MathSciNet): MR1897205
Zentralblatt MATH: 1010.60037
Digital Object Identifier: doi:10.1137/S0363012900377663
[4] Decker, T. (2006). Die Charakterisierung des verallgemeinerten DalePolytops und ihre Verwendung in linearen Programmen zur Lösung von Austrittszeit-, Stopp- und anderen Optimierungsproblemen. Dissertation, Humboldt Universitaet, Berlin.
[5] Hausdorff, F. (1923). Momentprobleme für ein endliches Intervall. Mathematische Zeitschrift 16 220–248.
[6] Helmes, K. and Röhl, S. (2008). A geometrical characterization of multidimensional Hausdorff polytopes with applications to exit time problems. Math. Oper. Res. 33 315–326.
Mathematical Reviews (MathSciNet): MR2415995
Digital Object Identifier: doi:10.1287/moor.1070.0293
[7] Helmes, K., Röhl, S. and Stockbridge, R. H. (2001). Computing moments of the exit time distribution for Markov processes by linear programming. Operations Research 49 516–530.
Mathematical Reviews (MathSciNet): MR1852358
Zentralblatt MATH: 1163.60321
Digital Object Identifier: doi:10.1287/opre.49.4.516.11221
[8] Helmes, K. and Stockbridge, R. H. (2000). Numerical comparison of controls and verification of optimality for stochastic control problems. Journal of Optimization Theory and Applications 106 107–127.
Mathematical Reviews (MathSciNet): MR1780110
Zentralblatt MATH: 0965.93099
Digital Object Identifier: doi:10.1023/A:1004659107996
[9] Helmes, K. and Stockbridge, R. H. (2001). Numerical evaluation of resolvents and Laplace transforms of Markov processes. Mathematical Methods of Operations Research 53 309–331.
Mathematical Reviews (MathSciNet): MR1842712
Zentralblatt MATH: 1020.60064
Digital Object Identifier: doi:10.1007/s001860100121
[10] Helmes, K. and Stockbridge, R. H. (2003). Extension of Dale’s moment conditions with application to the Wright–Fisher model. Stochastic Models 19 255–267.
Mathematical Reviews (MathSciNet): MR1979144
Zentralblatt MATH: 1043.62005
Digital Object Identifier: doi:10.1081/STM-120020389
[11] Helmes, K. and Stockbridge, R. H. (2007). Linear programming approach to the optimal stopping of stochastic processes. Stochastics 79 309–335.
Mathematical Reviews (MathSciNet): MR2308079
Digital Object Identifier: doi:10.1080/17442500600976814
[12] Kaczmarek, P. (2006). Numerical analysis of a long-term average control problem. M.S. Thesis, University of Wisconsin–Milwaukee.
[13] Kaczmarek, P., Kent, S. T., Rus, G. A., Stockbridge, R. H. and Wade, B. A. (2007). Numerical solution of a long-term average control problem for singular stochastic processes. Math. Oper. Res. 66 451–473.
Mathematical Reviews (MathSciNet): MR2357660
Zentralblatt MATH: 1125.93071
Digital Object Identifier: doi:10.1007/s00186-007-0166-9
[14] Kurtz, T. G. (1991). A control formulation for constrained Markov processes. Mathematics of Random Media. Lectures in Applied Mathematics 27.
Mathematical Reviews (MathSciNet): MR1117242
Zentralblatt MATH: 0728.60078
[15] Kurtz, T. G. and Stockbridge, R. H. (1998). Existence of Markov controls and characterization of optimal Markov controls. SIAM Journal of Control and Optimization 36 609–653.
Mathematical Reviews (MathSciNet): MR1616514
Zentralblatt MATH: 0935.93064
Digital Object Identifier: doi:10.1137/S0363012995295516
[16] Kurtz, T. G. and Stockbridge, R. H. (2001). Stationary solutions and forward equations for controlled and singular martingale problems. Electronic Journal of Probability 6 paper 14, 1–52.
Mathematical Reviews (MathSciNet): MR1873294
Zentralblatt MATH: 0984.60086
[17] Manne, A. S. (1960). Linear programming and sequential decisions. Management Science 6 259–267.
Mathematical Reviews (MathSciNet): MR129022
Digital Object Identifier: doi:10.1287/mnsc.6.3.259
[18] Mendiondo, M. S. and Stockbridge, R. H. (1998). Approximation of infinite-dimensional linear programming problems which arise in stochastic control. SIAM Journal of Control and Optimization 36 1448–1472.
Mathematical Reviews (MathSciNet): MR1627589
Zentralblatt MATH: 0922.93051
Digital Object Identifier: doi:10.1137/S0363012996313367
[19] Röhl. S. (2001). Ein linearer Programmierungsansatz zur Lösung von Stopp- und Steuerungsproblemen. Dissertation, Humboldt Universitaet, Berlin.
[20] Stockbridge, R. H. (1990). Time-average control of martingale problems: A linear programming formulation. Annals of Probability 18 206–217.
Mathematical Reviews (MathSciNet): MR1043944
Zentralblatt MATH: 0699.49019
Digital Object Identifier: doi:10.1214/aop/1176990945
Project Euclid: euclid.aop/1176990945
[21] Zhang, Q. (2001). Stock trading: An optimal selling rule. SIAM Journal of Control and Optimization 40 64–87.
Mathematical Reviews (MathSciNet): MR1855306
Zentralblatt MATH: 0990.91014
Digital Object Identifier: doi:10.1137/S0363012999356325

2012 © Institute of Mathematical Statistics

Institute of Mathematical Statistics Collections

Institute of Mathematical Statistics Collections