### The Optimal Reward Operator in Special Classes of Dynamic Programming Problems

David A. Freedman
Source: Ann. Probab. Volume 2, Number 5 (1974), 942-949.

#### Abstract

Consider a dynamic programming problem with separable metric state space $S$, constraint set $A$, and reward function $r(x, P, y)$ for $(x, P)\in A$ and $y\in S$. Let $Tf$ be the optimal reward in one move, for the reward function $r(x, P, y) + f(y)$. Three results are proved. First, suppose $S$ is compact, $A$ closed, and $r$ upper semi-continuous; then $T^n0$ is upper semi-continuous, and there is an optimal Borel strategy for the $n$-move game. Second, suppose $S$ is compact, $A$ is an $F_\sigma$, and $\{r > a\}$ is an $F_\sigma$ for all $a$; then $\{T^n0 > a\}$ is an $F_\sigma$ for all $a$, and there is an $\varepsilon$-optimal Borel strategy for the $n$-move game. Third, suppose $A$ is open and $r$ is lower semi-continuous; then $T^n0$ is lower semi-continuous, and there is an $\varepsilon$-optimal Borel measurable strategy for the $n$-move game.

First Page:
Primary Subjects: 49C99
Secondary Subjects: 60K99, 90C99, 28A05
Full-text: Open access