Abstract
Consider a process $X(\cdot) = {X(t), 0 \leq t < \infty}$ which takes values in the interval $I = (0, 1)$, satisfies a stochastic differential equation $$dX(t) = \beta(t)dt + \sigma(t)dW(t), X(0) = x \epsilon I$$ and, when it reaches an endpoint of the interval I, it is absorbed there. Suppose that the parameters $\beta$ and $\sigma$ are selected by a controller at each instant $t \epsilon [0, \infty)$ from a set depending on the current position. Assume also that the controller selects a stopping time $\tau$ for the process and seeks to maximize $\mathbf{E}u(X(\tau))$, where $u: [0, 1] \to \Re$ is a continuous "reward" function. If $\lambda := \inf{x \epsilon I: u(x) = \max u}$ and $\rho := \sup{x \epsilon I: u(x) = \max u}$, then, to the left of $\lambda$, it is best to maximize the mean-variance ratio $(\beta/\sigma^2)$ or to stop, and to the right of $\rho$, it is best to minimize the ratio $(\beta/\sigma^2)$ or to stop. Between $\lambda$ and $\rho$, it is optimal to follow any policy that will bring the process $X(\cdot)$ to a point of maximum for the function $u(\cdot)$ with probability 1, and then stop.
Citation
Ioannis Karatzas. William D. Sudderth. "Control and stopping of a diffusion process on an interval." Ann. Appl. Probab. 9 (1) 188 - 196, February 1999. https://doi.org/10.1214/aoap/1029962601
Information