메뉴 건너뛰기




Volumn 21, Issue 1, 1996, Pages 158-181

Overtaking and almost-sure optimality for infinite horizon Markov decision processes

Author keywords

Almost sure optimality; Controlled Markov chains; Infinite horizon optimal control; Infinite horizon optimality equation; Overtaking optimality

Indexed keywords

DECISION THEORY; DISCRETE TIME CONTROL SYSTEMS; EIGENVALUES AND EIGENFUNCTIONS; MARKOV PROCESSES; OPTIMAL CONTROL SYSTEMS; STATE SPACE METHODS;

EID: 0030087007     PISSN: 0364765X     EISSN: None     Source Type: Journal    
DOI: 10.1287/moor.21.1.158     Document Type: Article
Times cited : (12)

References (39)
  • 1
    • 0002525380 scopus 로고
    • On asymptotic optimality in probability and almost surely in dynamic control
    • Asriev, A. V., V. I. Rotar' (1990). On asymptotic optimality in probability and almost surely in dynamic control Stochastics Stochastics Rep. 33 1-16.
    • (1990) Stochastics Stochastics Rep. , vol.33 , pp. 1-16
    • Asriev, A.V.1    Rotar', V.I.2
  • 2
    • 0008949159 scopus 로고
    • Optimal decision procedures for finite Markov chains, Parts I-III
    • Bather, J. (1973). Optimal decision procedures for finite Markov chains, Parts I-III, Adv. Appl. Probab. 5 328-339, 521-553.
    • (1973) Adv. Appl. Probab. , vol.5 , pp. 328-339
    • Bather, J.1
  • 4
    • 38249021963 scopus 로고
    • Controlled semi-Markov models under long-run average rewards
    • Bhattacharya, R. N., M. Majumdar (1989). Controlled semi-Markov models under long-run average rewards, J. Statist. Plann. Inference 22 223-242.
    • (1989) J. Statist. Plann. Inference , vol.22 , pp. 223-242
    • Bhattacharya, R.N.1    Majumdar, M.2
  • 5
    • 0000615044 scopus 로고
    • Discrete dynamic programming
    • Blackwell, D. (1962). Discrete dynamic programming, Ann. Math. Statist. 33 719-726.
    • (1962) Ann. Math. Statist. , vol.33 , pp. 719-726
    • Blackwell, D.1
  • 7
    • 0017019748 scopus 로고
    • On existence of overtaking optimal trajectories over an infinite time horizon
    • Brock, W. A., A. Haurie (1976). On existence of overtaking optimal trajectories over an infinite time horizon, Math. Oper. Res. 1 337-346.
    • (1976) Math. Oper. Res. , vol.1 , pp. 337-346
    • Brock, W.A.1    Haurie, A.2
  • 8
    • 0022722259 scopus 로고
    • On the existence of catching up optimal solutions for Lagrange problems defined on unbounded intervals
    • Carlson, D. A. (1986). On the existence of catching up optimal solutions for Lagrange problems defined on unbounded intervals, J. Optim. Theory Appl. 49 1-19.
    • (1986) J. Optim. Theory Appl. , vol.49 , pp. 1-19
    • Carlson, D.A.1
  • 9
    • 0000104548 scopus 로고
    • Contraction mappings in the theory underlying dynamic programming
    • Denardo, E. V. (1967). Contraction mappings in the theory underlying dynamic programming, SIAM Rev. 9 165-177.
    • (1967) SIAM Rev. , vol.9 , pp. 165-177
    • Denardo, E.V.1
  • 10
    • 0000083216 scopus 로고
    • An optimality condition for discrete dynamical programming with no discounting
    • _, B. L. Miller (1968). An optimality condition for discrete dynamical programming with no discounting, Ann. Math. Statist. 39 1220-1227.
    • (1968) Ann. Math. Statist. , vol.39 , pp. 1220-1227
    • Miller, B.L.1
  • 11
    • 0141489723 scopus 로고
    • Overtaking optimality for Markov Decisions Chains
    • _, U. G. Rothblum (1979). Overtaking optimality for Markov Decisions Chains, Math. Oper. Res. 4 144-152.
    • (1979) Math. Oper. Res. , vol.4 , pp. 144-152
    • Rothblum, U.G.1
  • 12
    • 0003954462 scopus 로고
    • John Wiley and Sons, Inc., New York, London
    • Doob, J. L. (1967). Stochastic Processes, John Wiley and Sons, Inc., New York, London.
    • (1967) Stochastic Processes
    • Doob, J.L.1
  • 14
    • 0141824324 scopus 로고
    • On controlled finite state Markov processes with compact control sets
    • Fainberg, E. A. (1975). On controlled finite state Markov processes with compact control sets (English translation), Theory Probab. Appl. 20 856-862.
    • (1975) Theory Probab. Appl. , vol.20 , pp. 856-862
    • Fainberg, E.A.1
  • 15
    • 84983602577 scopus 로고
    • On optimality criteria for dynamic programs with long finite horizons
    • Flynn, J. (1980). On optimality criteria for dynamic programs with long finite horizons, J. Math. Anal. Appl. 76 202-208.
    • (1980) J. Math. Anal. Appl. , vol.76 , pp. 202-208
    • Flynn, J.1
  • 16
    • 5744243679 scopus 로고
    • Optimal steady states, excessive functions and deterministic dynamic programming
    • _ (1989). Optimal steady states, excessive functions and deterministic dynamic programming, J. Math. Anal. Appl. 144 586-594.
    • (1989) J. Math. Anal. Appl. , vol.144 , pp. 586-594
  • 17
    • 84959817843 scopus 로고
    • On optimal development in a multisector economy
    • Gale, D. (1967). On optimal development in a multisector economy, Rev. Econom. Stud. 34 1-19.
    • (1967) Rev. Econom. Stud. , vol.34 , pp. 1-19
    • Gale, D.1
  • 18
    • 0346379071 scopus 로고
    • On controlled discrete-time Markov Decision Processes
    • Gubenko, L. G., E. S. Štatland (1975). On controlled discrete-time Markov Decision Processes (English translation), Theory Probab. Math. Statist. 7 47-61.
    • (1975) Theory Probab. Math. Statist. , vol.7 , pp. 47-61
    • Gubenko, L.G.1    Štatland, E.S.2
  • 20
    • 3943073327 scopus 로고
    • Optimal control on an infinite horizon: The turnpike approach
    • Haurie, A. (1976). Optimal control on an infinite horizon: The turnpike approach, J. Math. Econom. 3 81-102.
    • (1976) J. Math. Econom. , vol.3 , pp. 81-102
    • Haurie, A.1
  • 21
    • 0026220873 scopus 로고
    • Average optimality in dynamic programming on Borel spaces-unbounded costs and controls
    • Hernández-Lerma, O. (1991). Average optimality in dynamic programming on Borel spaces-unbounded costs and controls, Systems Control Lett. 17 232-242.
    • (1991) Systems Control Lett. , vol.17 , pp. 232-242
    • Hernández-Lerma, O.1
  • 22
    • 0009258232 scopus 로고
    • Average cost Markov Decision Processes: Optimality conditions
    • _, J. C. Hennet, J. B. Lasserre (1991). Average cost Markov Decision Processes: Optimality conditions, J. Math. Anal. Appl. 158 396-406.
    • (1991) J. Math. Anal. Appl. , vol.158 , pp. 396-406
    • Hennet, J.C.1    Lasserre, J.B.2
  • 23
    • 0003434766 scopus 로고
    • Foundations of Non-Stationary Dynamic Programming with Discrete-Time Parameter
    • Springer-Verlag, New York
    • Hinderer, K. (1970). Foundations of Non-Stationary Dynamic Programming with Discrete-Time Parameter, Lecture Notes in Operations Research 33, Springer-Verlag, New York.
    • (1970) Lecture Notes in Operations Research 33
    • Hinderer, K.1
  • 24
    • 0008627283 scopus 로고
    • Dynamic programming and Markov potential theory
    • Amsterdam
    • Hordijk, A. (1974). Dynamic programming and Markov potential theory, Math. Centre Tracts 51, Amsterdam.
    • (1974) Math. Centre Tracts , vol.51
    • Hordijk, A.1
  • 26
    • 0003391813 scopus 로고
    • Academic Press, New York
    • Kuratowski, K. (1968). Topology. Vol. II, Academic Press, New York.
    • (1968) Topology , vol.2
    • Kuratowski, K.1
  • 28
    • 0023455653 scopus 로고
    • Infinite horizon optimization for finite state Markov chain
    • Leizarowitz, A. (1987). Infinite horizon optimization for finite state Markov chain, SIAM J. Control Optim. 25 1601-1618.
    • (1987) SIAM J. Control Optim. , vol.25 , pp. 1601-1618
    • Leizarowitz, A.1
  • 29
    • 0039122642 scopus 로고
    • On infinite products of stochastic matrices
    • _ (1992). On infinite products of stochastic matrices, Linear Algebra Appl. 168 189-219.
    • (1992) Linear Algebra Appl. , vol.168 , pp. 189-219
  • 30
    • 0040345999 scopus 로고
    • Maximal average-reward policies for semi-Markov decision processes with arbitrary state and action space
    • Lippman, S. A. (1971). Maximal average-reward policies for semi-Markov decision processes with arbitrary state and action space, Ann. Math. Statist. 42 1717-1726.
    • (1971) Ann. Math. Statist. , vol.42 , pp. 1717-1726
    • Lippman, S.A.1
  • 31
    • 77957101448 scopus 로고
    • Markov Decision Processes
    • Chapter 8 Eds. D. P. Heyman & M. J. Sobel, Elsevier Science Publications B. V. (North-Holland)
    • Puterman, M. L. (1990). Markov Decision Processes, Chapter 8 Handbooks in OR & MS vol. 2, Eds. D. P. Heyman & M. J. Sobel, Elsevier Science Publications B. V. (North-Holland).
    • (1990) Handbooks in OR & MS , vol.2
    • Puterman, M.L.1
  • 32
    • 5744238162 scopus 로고
    • Arbitrary state Markovian Decision Processes
    • Ross, S. M. (1968). Arbitrary state Markovian Decision Processes, Ann. Math. Statist. 29 2118-2122.
    • (1968) Ann. Math. Statist. , vol.29 , pp. 2118-2122
    • Ross, S.M.1
  • 34
    • 0001295069 scopus 로고
    • Conditions for optimality in dynamic programming and for the limit of n-stage optimal policy to be optimal
    • Schäl, M. (1975). Conditions for optimality in dynamic programming and for the limit of n-stage optimal policy to be optimal, Z. Wahrch. verw. Gebiete 32 179-196.
    • (1975) Z. Wahrch. Verw. Gebiete , vol.32 , pp. 179-196
    • Schäl, M.1
  • 36
    • 0011109733 scopus 로고
    • The asymptotic behaviour of undiscounted value iteration in Markov Decision Programs
    • Schweitzer, P. J., A. Federgrüen (1977). The asymptotic behaviour of undiscounted value iteration in Markov Decision Programs, Math. Oper. Res. 2 360-381.
    • (1977) Math. Oper. Res. , vol.2 , pp. 360-381
    • Schweitzer, P.J.1    Federgrüen, A.2
  • 37
    • 0000576225 scopus 로고
    • Negative dynamic programming
    • Strauch, R. E. (1966). Negative dynamic programming, Ann. Math. Statist. 37 871-890.
    • (1966) Ann. Math. Statist. , vol.37 , pp. 871-890
    • Strauch, R.E.1
  • 38
    • 0001473026 scopus 로고
    • On finding optimal policies in discrete dynamic programming with no discounting
    • Veinott, Jr., A. F. (1966). On finding optimal policies in discrete dynamic programming with no discounting, Ann. Math. Statist. 37 1284-294.
    • (1966) Ann. Math. Statist. , vol.37 , pp. 1284-1294
    • Veinott Jr., A.F.1
  • 39
    • 0002224201 scopus 로고
    • Existence of optimal programs of accumulation for an infinite time horizon
    • von Weizsäcker, C. C. (1968). Existence of optimal programs of accumulation for an infinite time horizon, Rev. Econom. Stud. 32 85-164.
    • (1968) Rev. Econom. Stud. , vol.32 , pp. 85-164
    • Von Weizsäcker, C.C.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.