[1] Howard R. A.: 
Dynamic Programming and Markov Processes. M.I.T. and Wiley Press, New York 1960. 
MR 0118514 | 
Zbl 0091.16001[2] Denardo E. V., Miller B. L.: 
An Optimality Condition for Discrete Dynamic Programming with No Discounting. Annals Mathem. Statistics 39 (1968), 4, 1220-1227. 
MR 0232593 | 
Zbl 0167.18402[3] Lippman S. A.: 
On the Set of Optimal Policies in Discrete Dynamic Programming. Journal Mathem. Analysis Applic. 24 (1968), 2, 440-445. 
MR 0231615 | 
Zbl 0194.20602[4] Mandl P.: 
Controlled Markov Chains. (in Czech). Kybernetika 6 (1969), Supplement, 1-74. 
MR 0434456[6] Veinott A. F.: 
On Finding Optimal Policies in Discrete Dynamic Programming with No Discounting. Annals Mathem. Statistics 37 (1966), 5, 1284-1294. 
MR 0208992 | 
Zbl 0149.16301