메뉴 건너뛰기




Volumn 29, Issue 2, 2004, Pages 339-352

On average reward semi-Markov decision processes with a general multichain structure

Author keywords

Average reward criterion; Data transformation method; Multichain structure; Optimal policy; Semi Markov decision processes

Indexed keywords

DECISION THEORY; MATHEMATICAL MODELS; OPTIMIZATION; PROBABILITY; STATE SPACE METHODS;

EID: 4043081188     PISSN: 0364765X     EISSN: None     Source Type: Journal    
DOI: 10.1287/moor.1030.0077     Document Type: Article
Times cited : (30)

References (32)
  • 2
    • 0008860655 scopus 로고
    • Time-average optimal constrained semi-Markov decision processes
    • Beutler, F. J., K. W. Ross. 1986. Time-average optimal constrained semi-Markov decision processes. Adv. Appl. Probab. 18 341-359.
    • (1986) Adv. Appl. Probab. , vol.18 , pp. 341-359
    • Beutler, F.J.1    Ross., K.W.2
  • 3
    • 0023420571 scopus 로고
    • Uniformization for semi-Markov decision processes under stationary policies
    • Beutler, F. J., K. W. Ross. 1987. Uniformization for semi-Markov decision processes under stationary policies. J. Appl. Probab. 24 644-656.
    • (1987) J. Appl. Probab. , vol.24 , pp. 644-656
    • Beutler, F.J.1    Ross., K.W.2
  • 4
    • 0032643313 scopus 로고    scopus 로고
    • Solving semi-Markov decision problems using average reward reinforcement learning
    • Das, T. K., A. Gosavi, S. Mahadevan, N. Marchalleck. 1999. Solving semi-Markov decision problems using average reward reinforcement learning. Management Sci. 45(4) 560-574.
    • (1999) Management Sci. , vol.45 , Issue.4 , pp. 560-574
    • Das, T.K.1    Gosavi, A.2    Mahadevan, S.3    Marchalleck., N.4
  • 5
    • 0344352147 scopus 로고
    • A dynamic programming algorithm for embedded Markov chains when the planning horizon is at infinity
    • DeCani, J. S. 1964. A dynamic programming algorithm for embedded Markov chains when the planning horizon is at infinity. Management Sci. 10 716-733.
    • (1964) Management Sci. , vol.10 , pp. 716-733
    • DeCani, J.S.1
  • 6
    • 0038956073 scopus 로고
    • Multichain Markov renewal programs
    • Denardo, E. V., B. L. Fox. 1968. Multichain Markov renewal programs. SIAM J. Appl. Math. 16(3) 468-487.
    • (1968) SIAM J. Appl. Math. , vol.16 , Issue.3 , pp. 468-487
    • Denardo, E.V.1    Fox., B.L.2
  • 7
    • 0021518483 scopus 로고
    • On the existence of average optimal policies in semi-regenerative decision models
    • Deppe, H. 1984. On the existence of average optimal policies in semi-regenerative decision models. Math. Oper. Res. 9(4) 558-575.
    • (1984) Math. Oper. Res. , vol.9 , Issue.4 , pp. 558-575
    • Deppe, H.1
  • 10
    • 0001181029 scopus 로고
    • The optimality equation in average cost denumerable state semi-Markov decision problems recurrency conditions algorithms
    • Federgruen, A., H. C. Tijms. 1978. The optimality equation in average cost denumerable state semi-Markov decision problems, recurrency conditions and algorithms. J. Appl. Probab. 15 356-373.
    • (1978) J. Appl. Probab. , vol.15 , pp. 356-373
    • Federgruen, A.1    Tijms., H.C.2
  • 11
    • 0141601273 scopus 로고
    • A fixed point approach to undiscounted Markov renewal programs
    • Federgruen, A., P. J. Schweitzer. 1984. A fixed point approach to undiscounted Markov renewal programs. SIAM J. Algebra Discrete Math. 5(4 ) 539-550.
    • (1984) SIAM J. Algebra Discrete Math. , vol.5 , Issue.4 , pp. 539-550
    • Federgruen, A.1    Schweitzer., P.J.2
  • 12
    • 4043183495 scopus 로고
    • Denumerable state semi-Markov decision processes with unbounded costs average cost criterion
    • Federgruen, A., A. Hordijk, H. C. Tijms. 1979. Denumerable state semi-Markov decision processes with unbounded costs, average cost criterion. Stochastic Proc. Appl. 9 223-235.
    • (1979) Stochastic Proc. Appl. , vol.9 , pp. 223-235
    • Federgruen, A.1    Hordijk, A.2    Tijms., H.C.3
  • 13
    • 0020749516 scopus 로고
    • Denumerable undiscounted semi-Markov decision processes with unbounded rewards
    • Federgruen, A., P. J. Schweitzer, H. C. Tijms. 1983. Denumerable undiscounted semi-Markov decision processes with unbounded rewards. Math. Oper. Res. 8(2) 298-313.
    • (1983) Math. Oper. Res. , vol.8 , Issue.2 , pp. 298-313
    • Federgruen, A.1    Schweitzer, P.J.2    Tijms., H.C.3
  • 14
    • 0348090398 scopus 로고
    • Markov renewal programming by linear fractional programming
    • Fox, B. 1966. Markov renewal programming by linear fractional programming. SIAM J. Appl. Math. 14(6) 1418-1432.
    • (1966) SIAM J. Appl. Math. , vol.14 , Issue.6 , pp. 1418-1432
    • Fox, B.1
  • 15
    • 4043071436 scopus 로고
    • Existence of stationary optimal policies for some Markov renewal programs
    • Fox, B. 1967. Existence of stationary optimal policies for some Markov renewal programs. SIAM Rev. 9(3) 573-576.
    • (1967) SIAM Rev. , vol.9 , Issue.3 , pp. 573-576
    • Fox, B.1
  • 16
    • 0032201689 scopus 로고    scopus 로고
    • Approximate solution of Markov renewal programs with finite time horizon
    • Hinderer, K., K-H. Waldmann. 1998. Approximate solution of Markov renewal programs with finite time horizon. SIAM J. Control Optim. 37(2) 502-520.
    • (1998) SIAM J. Control Optim. , vol.37 , Issue.2 , pp. 502-520
    • Hinderer, K.1    Waldmann., K.-H.2
  • 18
    • 4043080051 scopus 로고    scopus 로고
    • An introduction to Markov decision processes
    • Chinese Xidian University Xian China
    • Hu, Q., J. Liu. 2000. An introduction to Markov decision processes (Chinese). Xidian University, Xian, China.
    • (2000)
    • Hu, Q.1    Liu., J.2
  • 19
    • 0000067432 scopus 로고
    • Markov-renewal programming
    • 1: Formulation, finite return models
    • Jewell, W. S. 1963a. Markov-renewal programming. I: Formulation, finite return models. Oper Res. 11 938-948.
    • (1963) Oper Res. , vol.11 , pp. 938-948
    • Jewell, W.S.1
  • 20
    • 0000067432 scopus 로고
    • Markov-renewal programming II 2: Infinite return models, example
    • Jewell, W. S. 1963b. Markov-renewal programming. II: Infinite return models, example. Oper. Res. 11 949-971.
    • (1963) Oper. Res. , pp. 949-971
    • Jewell, W.S.1
  • 21
    • 0002775664 scopus 로고
    • Linear programming finite Markov control problems
    • Centre Amsterdam The Netherlands
    • Kallenberg, L. C. M. 1983. Linear programming and finite Markov control problems. Math. Centre, Amsterdam, The Netherlands.
    • (1983) Math
    • Kallenberg, L.C.M.1
  • 22
    • 0040345999 scopus 로고
    • Maximal average-reward policies for semi-Markov decision processes with arbitrary state and action space
    • Lippman, S. A. 1971. Maximal average-reward policies for semi-Markov decision processes with arbitrary state and action space. Ann. Math. Statist. 42(5) 1717-1726.
    • (1971) Ann. Math. Statist. , vol.42 , Issue.5 , pp. 1717-1726
    • Lippman, S.A.1
  • 23
    • 4043129713 scopus 로고
    • The functional equations of undiscounted denumerable state Markov renewal programming
    • ed. Semi-Markov Model. Plenum New York
    • Mann, E. 1986. The functional equations of undiscounted denumerable state Markov renewal programming. J. Janssen, ed. Semi-Markov Model. Plenum, New York, 79-96.
    • (1986) J. Janssen , pp. 79-96
    • Mann, E.1
  • 24
    • 0141712981 scopus 로고
    • Improved conditions for convergence in undiscounted Markov renewal programming
    • Platzman, L. 1977. Improved conditions for convergence in undiscounted Markov renewal programming. Oper. Res. 25 529-533.
    • (1977) Oper. Res. , vol.25 , pp. 529-533
    • Platzman, L.1
  • 27
    • 0141601275 scopus 로고
    • On the second optimality equation for semi-Markov decision models
    • Schal, M. 1992. On the second optimality equation for semi-Markov decision models. Math. Oper. Res. 17(2) 470-486.
    • (1992) Math. Oper. Res. , vol.17 , Issue.2 , pp. 470-486
    • Schal, M.1
  • 28
    • 0015080430 scopus 로고
    • Iterative solution of the functional equations of undiscounted Markov renewal programming
    • Schweitzer, P. J. 1971. Iterative solution of the functional equations of undiscounted Markov renewal programming. J. Math. Anal. Appl. 34 495-501.
    • (1971) J. Math. Anal. Appl. , vol.34 , pp. 495-501
    • Schweitzer, P.J.1
  • 29
    • 0021507292 scopus 로고
    • A value-iteration scheme for undiscounted multichain Markov renewal programs
    • Schweitzer, P. J. 1984. A value-iteration scheme for undiscounted multichain Markov renewal programs. Zeitschrift Oper. Res. 28 143-152.
    • (1984) Zeitschrift Oper. Res. , vol.28 , pp. 143-152
    • Schweitzer, P.J.1
  • 30
    • 4043126892 scopus 로고
    • Iterative bounds on the relative value vector in undiscounted Markov renewal programming
    • Schweitzer, P. J. 1985. Iterative bounds on the relative value vector in undiscounted Markov renewal programming. Zeitschrift Oper. Res. 29 269-284.
    • (1985) Zeitschrift Oper. Res. , vol.29 , pp. 269-284
    • Schweitzer, P.J.1
  • 31
    • 0018031514 scopus 로고
    • The functional equations of undiscounted Markov renewal programming
    • Schweitzer, P. J., A. Federgruen. 1978. The functional equations of undiscounted Markov renewal programming. Math. Oper. Res. 3(4) 308-321.
    • (1978) Math. Oper. Res. , vol.3 , Issue.4 , pp. 308-321
    • Schweitzer, P.J.1    Federgruen., A.2
  • 32
    • 84971109251 scopus 로고
    • Average cost semi-Markov decision processes the control of queueing systems
    • Sennott, L. I. 1989. Average cost semi-Markov decision processes and the control of queueing systems. Probab. Engrg. Inform. Sci. 3 247-272.
    • (1989) Probab. Engrg. Inform. Sci. , vol.3 , pp. 247-272
    • Sennott, L.I.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.