SCOPUS 정보 검색 플랫폼

Volumn 45, Issue 7, 2009, Pages 1628-1638

Continuous-time Markov decision processes with nth-bias optimality criteria

b HONG KONG UNIVERSITY OF SCIENCE AND TECHNOLOGY (Hong Kong)

Author keywords

Continuous time systems; Markov decision processes; Multichain model; nth bias optimality criteria; Performance analysis; Policy iteration algorithms; Sensitivity analysis

Indexed keywords

MARKOV DECISION PROCESSES; MULTICHAIN MODEL; NTH-BIAS OPTIMALITY CRITERIA; PERFORMANCE ANALYSIS; POLICY ITERATION ALGORITHMS;

ALGORITHMS; ITERATIVE METHODS; MARKOV PROCESSES; NONLINEAR CONTROL SYSTEMS; OPTIMIZATION; SENSITIVITY ANALYSIS;

CONTINUOUS TIME SYSTEMS;

EID: 67349166673 PISSN: 00051098 EISSN: None Source Type: Journal
DOI: 10.1016/j.automatica.2009.03.009 Document Type: Article

Times cited : (7)

References (26)

1
- 0003706777
- Springer-Verlag, New York
- Anderson W.J. Continuous-time Markov chains (1991), Springer-Verlag, New York
- (1991) Continuous-time Markov chains
- Anderson, W.J.¹

2
- 0027557742
- Discrete-time controlled Markov processes with average cost criterion: A survey
- Arapostathis A., Borkar V.S., Fernandez-Gaucherand E., Ghosh M.K., and Markus S.I. Discrete-time controlled Markov processes with average cost criterion: A survey. SIAM Journal on Control and Optimization 31 2 (1993) 282-344
- (1993) SIAM Journal on Control and Optimization , vol.31 , Issue.2 , pp. 282-344
- Arapostathis, A.¹ Borkar, V.S.² Fernandez-Gaucherand, E.³ Ghosh, M.K.⁴ Markus, S.I.⁵

3
- 0037289322
- From perturbation analysis to Markov decision processes and reinforcement learning
- Cao X.-R. From perturbation analysis to Markov decision processes and reinforcement learning. Discrete Event Dynamic Systems: Theory and Applications 13 1-2 (2003) 9-39
- (2003) Discrete Event Dynamic Systems: Theory and Applications , vol.13 , Issue.1-2 , pp. 9-39
- Cao, X.-R.¹

4
- 84889784415
- Springer, New York
- Cao X.-R. Stochastic learning and optimization - a sensitivity-based approach (2007), Springer, New York
- (2007) Stochastic learning and optimization - a sensitivity-based approach
- Cao, X.-R.¹

5
- 3843150404
- A unified approach to Markov decision problems and performance sensitivity analysis with discounted and average criteria: Multichain cases
- Cao X.-R., and Guo X.P. A unified approach to Markov decision problems and performance sensitivity analysis with discounted and average criteria: Multichain cases. Automatica 40 9 (2004) 1749-1759
- (2004) Automatica , vol.40 , Issue.9 , pp. 1749-1759
- Cao, X.-R.¹ Guo, X.P.²

6
- 44849084717
- The nth-order bias optimality for multichain Markov decision processes
- Cao X.-R., and Zhang J.Y. The nth-order bias optimality for multichain Markov decision processes. IEEE Transactions on Automatic Control 53 2 (2008) 496-508
- (2008) IEEE Transactions on Automatic Control , vol.53 , Issue.2 , pp. 496-508
- Cao, X.-R.¹ Zhang, J.Y.²

7
- 84966250746
- On the integro-differential equations of purely discontinuous Markoff processes
- Feller W. On the integro-differential equations of purely discontinuous Markoff processes. Transactions of the American Mathematical Society 48 (1940) 488-515
- (1940) Transactions of the American Mathematical Society , vol.48 , pp. 488-515
- Feller, W.¹

8
- 33244489385
- Optimal control of ergodic continuous-time Markov chains with average sample-path rewards
- Guo X.P., and Cao X.-R. Optimal control of ergodic continuous-time Markov chains with average sample-path rewards. SIAM Journal on Control and Optimization 44 1 (2005) 29-48
- (2005) SIAM Journal on Control and Optimization , vol.44 , Issue.1 , pp. 29-48
- Guo, X.P.¹ Cao, X.-R.²

9
- 0037291699
- Drift and monotonicity conditions for continuous-time controlled Markov chains an average criterion
- Guo X.P., and Hernández-Lerma O. Drift and monotonicity conditions for continuous-time controlled Markov chains an average criterion. IEEE Transactions on Automatic Control 48 2 (2003) 236-245
- (2003) IEEE Transactions on Automatic Control , vol.48 , Issue.2 , pp. 236-245
- Guo, X.P.¹ Hernández-Lerma, O.²

10
- 48549099294
- A survey of recent results on continuous-time Markov decision processes
- Guo X.P., Hernández-Lerma O., and Prieto-Rumeau T. A survey of recent results on continuous-time Markov decision processes. Top 14 2 (2006) 177-261
- (2006) Top , vol.14 , Issue.2 , pp. 177-261
- Guo, X.P.¹ Hernández-Lerma, O.² Prieto-Rumeau, T.³

11
- 0035707615
- A note on optimality conditions for continuous-time Markov decision processes with average cost criterion
- Guo X.P., and Liu K. A note on optimality conditions for continuous-time Markov decision processes with average cost criterion. IEEE Transactions on Automatic Control 46 12 (2001) 1984-1989
- (2001) IEEE Transactions on Automatic Control , vol.46 , Issue.12 , pp. 1984-1989
- Guo, X.P.¹ Liu, K.²

12
- 33746866011
- Average optimality for continuous-time Markov decision processes in Polish spaces
- Guo X.P., and Rieder U. Average optimality for continuous-time Markov decision processes in Polish spaces. The Annals of Applied Probability 16 2 (2006) 730-756
- (2006) The Annals of Applied Probability , vol.16 , Issue.2 , pp. 730-756
- Guo, X.P.¹ Rieder, U.²

13
- 67349090704
- Bias optimality for multichain continuous-time Markov decision processes
- Preprint
- Guo, X. P., Song, X. Y., & Zhang, J. Y. (2009). Bias optimality for multichain continuous-time Markov decision processes. Preprint
- (2009)
- Guo, X.P.¹ Song, X.Y.² Zhang, J.Y.³

14
- 0032400452
- Bias optimality in controlled queueing systems
- Haviv M., and Puterman M.L. Bias optimality in controlled queueing systems. Journal of Applied Probability 35 1 (1998) 136-150
- (1998) Journal of Applied Probability , vol.35 , Issue.1 , pp. 136-150
- Haviv, M.¹ Puterman, M.L.²

15
- 0003644124
- Wiley, New York
- Howard R.A. Dynamic programming and Markov processes (1960), Wiley, New York
- (1960) Dynamic programming and Markov processes
- Howard, R.A.¹

16
- 0015300472
- Nondiscounted continuous-time Markov decision processes with countable state and action spaces
- Kakumanu P. Nondiscounted continuous-time Markov decision processes with countable state and action spaces. SIAM Journal on Control 10 (1972) 210-220
- (1972) SIAM Journal on Control , vol.10 , pp. 210-220
- Kakumanu, P.¹

17
- 0004284913
- CRC Press
- Kitaev M.Y., and Rykov V.V. Controlled queueing systems (1995), CRC Press
- (1995) Controlled queueing systems
- Kitaev, M.Y.¹ Rykov, V.V.²

18
- 33646733560
- Bias optimality
- Feinberg E.A., and Shwartz A. (Eds), Kluwer, Boston
- Lewis M.E., and Puterman M.L. Bias optimality. In: Feinberg E.A., and Shwartz A. (Eds). Handbook of Markov decision processes (2002), Kluwer, Boston 89-111
- (2002) Handbook of Markov decision processes , pp. 89-111
- Lewis, M.E.¹ Puterman, M.L.²

19
- 0001361614
- Finite state continuous time Markov decision processes with an infinite planning horizon
- Miller B.L. Finite state continuous time Markov decision processes with an infinite planning horizon. Journal Of Mathematical Analysis and Applications 22 (1968) 552-569
- (1968) Journal Of Mathematical Analysis and Applications , vol.22 , pp. 552-569
- Miller, B.L.¹

20
- 4344665765
- Springer
- Oksendal B., and Sulem A. Applied stochastic control of jump diffusions (2007), Springer
- (2007) Applied stochastic control of jump diffusions
- Oksendal, B.¹ Sulem, A.²

21
- 13944271922
- The Laurent series, sensitive discount and Blackwell optimality for continuous-time controlled Markov chains
- Prieto-Rumeau T., and Hernández-Lerma O. The Laurent series, sensitive discount and Blackwell optimality for continuous-time controlled Markov chains. Mathematical Methods of Operations Research 61 1 (2005) 123-145
- (2005) Mathematical Methods of Operations Research , vol.61 , Issue.1 , pp. 123-145
- Prieto-Rumeau, T.¹ Hernández-Lerma, O.²

22
- 33748788330
- Bias optimality for continuous-time controlled Markov chains
- Prieto-Rumeau T., and Hernández-Lerma O. Bias optimality for continuous-time controlled Markov chains. SIAM Journal on Control and Optimization 45 1 (2006) 51-73
- (2006) SIAM Journal on Control and Optimization , vol.45 , Issue.1 , pp. 51-73
- Prieto-Rumeau, T.¹ Hernández-Lerma, O.²

23
- 85102627959
- Wiley, New York
- Puterman M.L. Markov decision processes: Discrete stochastic dynamic programming (1994), Wiley, New York
- (1994) Markov decision processes: Discrete stochastic dynamic programming
- Puterman, M.L.¹

24
- 85060257643
- Wiley, New York
- Sennott L.I. Stochastic dynamic programming and the control of queueing systems (1999), Wiley, New York
- (1999) Stochastic dynamic programming and the control of queueing systems
- Sennott, L.I.¹

25
- 13944283496
- A Laurent series for the resolvent of a strongly continuous stochastic semi-group
- Taylor H.M. A Laurent series for the resolvent of a strongly continuous stochastic semi-group. Mathematical Programming Studies 6 (1976) 258-263
- (1976) Mathematical Programming Studies , vol.6 , pp. 258-263
- Taylor, H.M.¹

26
- 0000590831
- Discrete dynamic programming with sensitive discount optimality criteria
- Veinott A.F. Discrete dynamic programming with sensitive discount optimality criteria. The Annals of Mathematical Statistics 40 5 (1969) 1635-1660
- (1969) The Annals of Mathematical Statistics , vol.40 , Issue.5 , pp. 1635-1660
- Veinott, A.F.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.