Conditions for the Equivalence of Optimality Criteria in Dynamic Programming

James Flynn

doi:10.1214/aos/1176343590

September, 1976 Conditions for the Equivalence of Optimality Criteria in Dynamic Programming

James Flynn

Ann. Statist. 4(5): 936-953 (September, 1976). DOI: 10.1214/aos/1176343590

Abstract

This paper examines the relationships between optimality criteria which are commonly used for undiscounted, discrete-time, countable state Markovian decision models. One approach, due to Blackwell, is to maximize the expected discounted total return as the discount factor approaches 1. Another, due to Veinott, is to maximize the Cesaro means of the finite horizon expected returns as the horizon tends to infinity. Derman's is to maximize the long-run average gain. Denardo, Miller and Lippman showed that Blackwell's and Veinott's approaches are equivalent for finite state and action spaces. As shown here, that equivalence breaks down when the state space is countable. Also, policies optimal according to Blackwell's or Veinott's approach need not be optimal according to Derman's. On the positive side, fairly weak conditions are given under which Blackwell's and Veinott's criteria imply Derman's, and somewhat stronger conditions under which Blackwell's and Veinott's criteria are equivalent.

Citation

Download Citation

James Flynn. "Conditions for the Equivalence of Optimality Criteria in Dynamic Programming." Ann. Statist. 4 (5) 936 - 953, September, 1976. https://doi.org/10.1214/aos/1176343590

Information

Published: September, 1976

First available in Project Euclid: 12 April 2007

zbMATH: 0351.93038

MathSciNet: MR429138

Digital Object Identifier: 10.1214/aos/1176343590

Subjects:

Primary: 49C15

Secondary: 60J10 , 60J20 , 62L99 , 90C40 , 93C55

Keywords: average gain , average overtaking criteria , discounting , dynamic programming , Markovian decision process , optimality criteria , small interest rates

Access the abstract

JOURNAL ARTICLE
18 PAGES

DOWNLOAD PDF + SAVE TO MY LIBRARY