Abstract
In this paper we consider the discrete time finite state Markov decision problem with Veinott's criterion of maximizing the Cesaro mean of the vector of expected returns received in a finite horizon as the horizon tends to infinity. A necessary and sufficient condition for optimality is obtained, and at the same time we verify Veinott's conjecture that there are optimal stationary policies.
Citation
E. V. Denardo. B. L. Miller. "An Optimality Condition for Discrete Dynamic Programming with no Discounting." Ann. Math. Statist. 39 (4) 1220 - 1227, August, 1968. https://doi.org/10.1214/aoms/1177698247
Information