SCOPUS 정보 검색 플랫폼

Volumn 29, Issue 2, 2004, Pages 339-352

On average reward semi-Markov decision processes with a general multichain structure

a INSTITUTE OF APPLIED MATHEMATICS (China)

Author keywords

Average reward criterion; Data transformation method; Multichain structure; Optimal policy; Semi Markov decision processes

Indexed keywords

DECISION THEORY; MATHEMATICAL MODELS; OPTIMIZATION; PROBABILITY; STATE SPACE METHODS;

AVERAGE REWARD CRITERION; MULTICHAIN STRUCTURE; OPTIMAL POLICY; SEMI-MARKOV DECISION PROCESSES;

MARKOV PROCESSES;

EID: 4043081188 PISSN: 0364765X EISSN: None Source Type: Journal
DOI: 10.1287/moor.1030.0077 Document Type: Article

Times cited : (30)

References (32)

1
- 0027557742
- Discrete-time controlled Markov processes with average cost criterion: A survey
- Arapostathis, A., V. S. Borkar, E. Fernandez-Gaucherand, M. K. Ghosh, S. I. Marcus. 1993. Discrete-time controlled Markov processes with average cost criterion: A survey. SIAM J. Contin. Optim. 31(2) 282-344.
- (1993) SIAM J. Contin. Optim. , vol.31 , Issue.2 , pp. 282-344
- Arapostathis, A.¹ Borkar, V.S.² Fernandez-Gaucherand, E.³ Ghosh, M.K.⁴ Marcus., S.I.⁵

2
- 0008860655
- Time-average optimal constrained semi-Markov decision processes
- Beutler, F. J., K. W. Ross. 1986. Time-average optimal constrained semi-Markov decision processes. Adv. Appl. Probab. 18 341-359.
- (1986) Adv. Appl. Probab. , vol.18 , pp. 341-359
- Beutler, F.J.¹ Ross., K.W.²

3
- 0023420571
- Uniformization for semi-Markov decision processes under stationary policies
- Beutler, F. J., K. W. Ross. 1987. Uniformization for semi-Markov decision processes under stationary policies. J. Appl. Probab. 24 644-656.
- (1987) J. Appl. Probab. , vol.24 , pp. 644-656
- Beutler, F.J.¹ Ross., K.W.²

4
- 0032643313
- Solving semi-Markov decision problems using average reward reinforcement learning
- Das, T. K., A. Gosavi, S. Mahadevan, N. Marchalleck. 1999. Solving semi-Markov decision problems using average reward reinforcement learning. Management Sci. 45(4) 560-574.
- (1999) Management Sci. , vol.45 , Issue.4 , pp. 560-574
- Das, T.K.¹ Gosavi, A.² Mahadevan, S.³ Marchalleck., N.⁴

5
- 0344352147
- A dynamic programming algorithm for embedded Markov chains when the planning horizon is at infinity
- DeCani, J. S. 1964. A dynamic programming algorithm for embedded Markov chains when the planning horizon is at infinity. Management Sci. 10 716-733.
- (1964) Management Sci. , vol.10 , pp. 716-733
- DeCani, J.S.¹

6
- 0038956073
- Multichain Markov renewal programs
- Denardo, E. V., B. L. Fox. 1968. Multichain Markov renewal programs. SIAM J. Appl. Math. 16(3) 468-487.
- (1968) SIAM J. Appl. Math. , vol.16 , Issue.3 , pp. 468-487
- Denardo, E.V.¹ Fox., B.L.²

7
- 0021518483
- On the existence of average optimal policies in semi-regenerative decision models
- Deppe, H. 1984. On the existence of average optimal policies in semi-regenerative decision models. Math. Oper. Res. 9(4) 558-575.
- (1984) Math. Oper. Res. , vol.9 , Issue.4 , pp. 558-575
- Deppe, H.¹

8
- 0003421685
- Academic, New York
- Derman, C. 1970. Finite state Markovian Decision Processes. Academic, New York.
- (1970) Finite State Markovian Decision Processes
- Derman, C.¹

9
- 0003634432
- Springer-Verlag New York
- Dynkin, E. B., A. A. Yushkevich. 1979. Controlled Markov Processes. Springer-Verlag, New York.
- (1979) Controlled Markov Processes
- Dynkin, E.B.¹ Yushkevich., A.A.²

10
- 0001181029
- The optimality equation in average cost denumerable state semi-Markov decision problems recurrency conditions algorithms
- Federgruen, A., H. C. Tijms. 1978. The optimality equation in average cost denumerable state semi-Markov decision problems, recurrency conditions and algorithms. J. Appl. Probab. 15 356-373.
- (1978) J. Appl. Probab. , vol.15 , pp. 356-373
- Federgruen, A.¹ Tijms., H.C.²

11
- 0141601273
- A fixed point approach to undiscounted Markov renewal programs
- Federgruen, A., P. J. Schweitzer. 1984. A fixed point approach to undiscounted Markov renewal programs. SIAM J. Algebra Discrete Math. 5(4 ) 539-550.
- (1984) SIAM J. Algebra Discrete Math. , vol.5 , Issue.4 , pp. 539-550
- Federgruen, A.¹ Schweitzer., P.J.²

12
- 4043183495
- Denumerable state semi-Markov decision processes with unbounded costs average cost criterion
- Federgruen, A., A. Hordijk, H. C. Tijms. 1979. Denumerable state semi-Markov decision processes with unbounded costs, average cost criterion. Stochastic Proc. Appl. 9 223-235.
- (1979) Stochastic Proc. Appl. , vol.9 , pp. 223-235
- Federgruen, A.¹ Hordijk, A.² Tijms., H.C.³

13
- 0020749516
- Denumerable undiscounted semi-Markov decision processes with unbounded rewards
- Federgruen, A., P. J. Schweitzer, H. C. Tijms. 1983. Denumerable undiscounted semi-Markov decision processes with unbounded rewards. Math. Oper. Res. 8(2) 298-313.
- (1983) Math. Oper. Res. , vol.8 , Issue.2 , pp. 298-313
- Federgruen, A.¹ Schweitzer, P.J.² Tijms., H.C.³

14
- 0348090398
- Markov renewal programming by linear fractional programming
- Fox, B. 1966. Markov renewal programming by linear fractional programming. SIAM J. Appl. Math. 14(6) 1418-1432.
- (1966) SIAM J. Appl. Math. , vol.14 , Issue.6 , pp. 1418-1432
- Fox, B.¹

15
- 4043071436
- Existence of stationary optimal policies for some Markov renewal programs
- Fox, B. 1967. Existence of stationary optimal policies for some Markov renewal programs. SIAM Rev. 9(3) 573-576.
- (1967) SIAM Rev. , vol.9 , Issue.3 , pp. 573-576
- Fox, B.¹

16
- 0032201689
- Approximate solution of Markov renewal programs with finite time horizon
- Hinderer, K., K-H. Waldmann. 1998. Approximate solution of Markov renewal programs with finite time horizon. SIAM J. Control Optim. 37(2) 502-520.
- (1998) SIAM J. Control Optim. , vol.37 , Issue.2 , pp. 502-520
- Hinderer, K.¹ Waldmann., K.-H.²

17
- 4344615237
- Semi-Markovian decision processes
- Ottawa Canada
- Howard, R. A. 1963. Semi-Markovian decision processes. Proc. Internat. Statist. Inst. Ottawa, Canada.
- (1963) Proc. Internat. Statist. Inst.
- Howard, R.A.¹

18
- 4043080051
- An introduction to Markov decision processes
- Chinese Xidian University Xian China
- Hu, Q., J. Liu. 2000. An introduction to Markov decision processes (Chinese). Xidian University, Xian, China.
- (2000)
- Hu, Q.¹ Liu., J.²

19
- 0000067432
- Markov-renewal programming
- 1: Formulation, finite return models
- Jewell, W. S. 1963a. Markov-renewal programming. I: Formulation, finite return models. Oper Res. 11 938-948.
- (1963) Oper Res. , vol.11 , pp. 938-948
- Jewell, W.S.¹

20
- 0000067432
- Markov-renewal programming II 2: Infinite return models, example
- Jewell, W. S. 1963b. Markov-renewal programming. II: Infinite return models, example. Oper. Res. 11 949-971.
- (1963) Oper. Res. , pp. 949-971
- Jewell, W.S.¹

21
- 0002775664
- Linear programming finite Markov control problems
- Centre Amsterdam The Netherlands
- Kallenberg, L. C. M. 1983. Linear programming and finite Markov control problems. Math. Centre, Amsterdam, The Netherlands.
- (1983) Math
- Kallenberg, L.C.M.¹

22
- 0040345999
- Maximal average-reward policies for semi-Markov decision processes with arbitrary state and action space
- Lippman, S. A. 1971. Maximal average-reward policies for semi-Markov decision processes with arbitrary state and action space. Ann. Math. Statist. 42(5) 1717-1726.
- (1971) Ann. Math. Statist. , vol.42 , Issue.5 , pp. 1717-1726
- Lippman, S.A.¹

23
- 4043129713
- The functional equations of undiscounted denumerable state Markov renewal programming
- ed. Semi-Markov Model. Plenum New York
- Mann, E. 1986. The functional equations of undiscounted denumerable state Markov renewal programming. J. Janssen, ed. Semi-Markov Model. Plenum, New York, 79-96.
- (1986) J. Janssen , pp. 79-96
- Mann, E.¹

24
- 0141712981
- Improved conditions for convergence in undiscounted Markov renewal programming
- Platzman, L. 1977. Improved conditions for convergence in undiscounted Markov renewal programming. Oper. Res. 25 529-533.
- (1977) Oper. Res. , vol.25 , pp. 529-533
- Platzman, L.¹

25
- 0003998452
- John Wiley and Sons, New York
- Puterman, M. L. 1994. Markov Decision Processes: Discrete Stochastic Dynam. Programming. John Wiley and Sons, New York.
- (1994) Markov Decision Processes: Discrete Stochastic Dynam. Programming
- Puterman, M.L.¹

26
- 0003644137
- Holden Day San Francisco
- Ross, S. M. 1970. Applied Probability Models with Optimization Applications. Holden Day, San Francisco.
- (1970) Applied Probability Models With Optimization Applications
- Ross, S.M.¹

27
- 0141601275
- On the second optimality equation for semi-Markov decision models
- Schal, M. 1992. On the second optimality equation for semi-Markov decision models. Math. Oper. Res. 17(2) 470-486.
- (1992) Math. Oper. Res. , vol.17 , Issue.2 , pp. 470-486
- Schal, M.¹

28
- 0015080430
- Iterative solution of the functional equations of undiscounted Markov renewal programming
- Schweitzer, P. J. 1971. Iterative solution of the functional equations of undiscounted Markov renewal programming. J. Math. Anal. Appl. 34 495-501.
- (1971) J. Math. Anal. Appl. , vol.34 , pp. 495-501
- Schweitzer, P.J.¹

29
- 0021507292
- A value-iteration scheme for undiscounted multichain Markov renewal programs
- Schweitzer, P. J. 1984. A value-iteration scheme for undiscounted multichain Markov renewal programs. Zeitschrift Oper. Res. 28 143-152.
- (1984) Zeitschrift Oper. Res. , vol.28 , pp. 143-152
- Schweitzer, P.J.¹

30
- 4043126892
- Iterative bounds on the relative value vector in undiscounted Markov renewal programming
- Schweitzer, P. J. 1985. Iterative bounds on the relative value vector in undiscounted Markov renewal programming. Zeitschrift Oper. Res. 29 269-284.
- (1985) Zeitschrift Oper. Res. , vol.29 , pp. 269-284
- Schweitzer, P.J.¹

31
- 0018031514
- The functional equations of undiscounted Markov renewal programming
- Schweitzer, P. J., A. Federgruen. 1978. The functional equations of undiscounted Markov renewal programming. Math. Oper. Res. 3(4) 308-321.
- (1978) Math. Oper. Res. , vol.3 , Issue.4 , pp. 308-321
- Schweitzer, P.J.¹ Federgruen., A.²

32
- 84971109251
- Average cost semi-Markov decision processes the control of queueing systems
- Sennott, L. I. 1989. Average cost semi-Markov decision processes and the control of queueing systems. Probab. Engrg. Inform. Sci. 3 247-272.
- (1989) Probab. Engrg. Inform. Sci. , vol.3 , pp. 247-272
- Sennott, L.I.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.