메뉴 건너뛰기




Volumn 45, Issue 7, 2009, Pages 1628-1638

Continuous-time Markov decision processes with nth-bias optimality criteria

Author keywords

Continuous time systems; Markov decision processes; Multichain model; nth bias optimality criteria; Performance analysis; Policy iteration algorithms; Sensitivity analysis

Indexed keywords

MARKOV DECISION PROCESSES; MULTICHAIN MODEL; NTH-BIAS OPTIMALITY CRITERIA; PERFORMANCE ANALYSIS; POLICY ITERATION ALGORITHMS;

EID: 67349166673     PISSN: 00051098     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.automatica.2009.03.009     Document Type: Article
Times cited : (7)

References (26)
  • 3
    • 0037289322 scopus 로고    scopus 로고
    • From perturbation analysis to Markov decision processes and reinforcement learning
    • Cao X.-R. From perturbation analysis to Markov decision processes and reinforcement learning. Discrete Event Dynamic Systems: Theory and Applications 13 1-2 (2003) 9-39
    • (2003) Discrete Event Dynamic Systems: Theory and Applications , vol.13 , Issue.1-2 , pp. 9-39
    • Cao, X.-R.1
  • 5
    • 3843150404 scopus 로고    scopus 로고
    • A unified approach to Markov decision problems and performance sensitivity analysis with discounted and average criteria: Multichain cases
    • Cao X.-R., and Guo X.P. A unified approach to Markov decision problems and performance sensitivity analysis with discounted and average criteria: Multichain cases. Automatica 40 9 (2004) 1749-1759
    • (2004) Automatica , vol.40 , Issue.9 , pp. 1749-1759
    • Cao, X.-R.1    Guo, X.P.2
  • 6
    • 44849084717 scopus 로고    scopus 로고
    • The nth-order bias optimality for multichain Markov decision processes
    • Cao X.-R., and Zhang J.Y. The nth-order bias optimality for multichain Markov decision processes. IEEE Transactions on Automatic Control 53 2 (2008) 496-508
    • (2008) IEEE Transactions on Automatic Control , vol.53 , Issue.2 , pp. 496-508
    • Cao, X.-R.1    Zhang, J.Y.2
  • 7
    • 84966250746 scopus 로고
    • On the integro-differential equations of purely discontinuous Markoff processes
    • Feller W. On the integro-differential equations of purely discontinuous Markoff processes. Transactions of the American Mathematical Society 48 (1940) 488-515
    • (1940) Transactions of the American Mathematical Society , vol.48 , pp. 488-515
    • Feller, W.1
  • 8
    • 33244489385 scopus 로고    scopus 로고
    • Optimal control of ergodic continuous-time Markov chains with average sample-path rewards
    • Guo X.P., and Cao X.-R. Optimal control of ergodic continuous-time Markov chains with average sample-path rewards. SIAM Journal on Control and Optimization 44 1 (2005) 29-48
    • (2005) SIAM Journal on Control and Optimization , vol.44 , Issue.1 , pp. 29-48
    • Guo, X.P.1    Cao, X.-R.2
  • 9
    • 0037291699 scopus 로고    scopus 로고
    • Drift and monotonicity conditions for continuous-time controlled Markov chains an average criterion
    • Guo X.P., and Hernández-Lerma O. Drift and monotonicity conditions for continuous-time controlled Markov chains an average criterion. IEEE Transactions on Automatic Control 48 2 (2003) 236-245
    • (2003) IEEE Transactions on Automatic Control , vol.48 , Issue.2 , pp. 236-245
    • Guo, X.P.1    Hernández-Lerma, O.2
  • 10
    • 48549099294 scopus 로고    scopus 로고
    • A survey of recent results on continuous-time Markov decision processes
    • Guo X.P., Hernández-Lerma O., and Prieto-Rumeau T. A survey of recent results on continuous-time Markov decision processes. Top 14 2 (2006) 177-261
    • (2006) Top , vol.14 , Issue.2 , pp. 177-261
    • Guo, X.P.1    Hernández-Lerma, O.2    Prieto-Rumeau, T.3
  • 11
    • 0035707615 scopus 로고    scopus 로고
    • A note on optimality conditions for continuous-time Markov decision processes with average cost criterion
    • Guo X.P., and Liu K. A note on optimality conditions for continuous-time Markov decision processes with average cost criterion. IEEE Transactions on Automatic Control 46 12 (2001) 1984-1989
    • (2001) IEEE Transactions on Automatic Control , vol.46 , Issue.12 , pp. 1984-1989
    • Guo, X.P.1    Liu, K.2
  • 12
    • 33746866011 scopus 로고    scopus 로고
    • Average optimality for continuous-time Markov decision processes in Polish spaces
    • Guo X.P., and Rieder U. Average optimality for continuous-time Markov decision processes in Polish spaces. The Annals of Applied Probability 16 2 (2006) 730-756
    • (2006) The Annals of Applied Probability , vol.16 , Issue.2 , pp. 730-756
    • Guo, X.P.1    Rieder, U.2
  • 13
    • 67349090704 scopus 로고    scopus 로고
    • Bias optimality for multichain continuous-time Markov decision processes
    • Preprint
    • Guo, X. P., Song, X. Y., & Zhang, J. Y. (2009). Bias optimality for multichain continuous-time Markov decision processes. Preprint
    • (2009)
    • Guo, X.P.1    Song, X.Y.2    Zhang, J.Y.3
  • 14
    • 0032400452 scopus 로고    scopus 로고
    • Bias optimality in controlled queueing systems
    • Haviv M., and Puterman M.L. Bias optimality in controlled queueing systems. Journal of Applied Probability 35 1 (1998) 136-150
    • (1998) Journal of Applied Probability , vol.35 , Issue.1 , pp. 136-150
    • Haviv, M.1    Puterman, M.L.2
  • 16
    • 0015300472 scopus 로고
    • Nondiscounted continuous-time Markov decision processes with countable state and action spaces
    • Kakumanu P. Nondiscounted continuous-time Markov decision processes with countable state and action spaces. SIAM Journal on Control 10 (1972) 210-220
    • (1972) SIAM Journal on Control , vol.10 , pp. 210-220
    • Kakumanu, P.1
  • 18
    • 33646733560 scopus 로고    scopus 로고
    • Bias optimality
    • Feinberg E.A., and Shwartz A. (Eds), Kluwer, Boston
    • Lewis M.E., and Puterman M.L. Bias optimality. In: Feinberg E.A., and Shwartz A. (Eds). Handbook of Markov decision processes (2002), Kluwer, Boston 89-111
    • (2002) Handbook of Markov decision processes , pp. 89-111
    • Lewis, M.E.1    Puterman, M.L.2
  • 19
    • 0001361614 scopus 로고
    • Finite state continuous time Markov decision processes with an infinite planning horizon
    • Miller B.L. Finite state continuous time Markov decision processes with an infinite planning horizon. Journal Of Mathematical Analysis and Applications 22 (1968) 552-569
    • (1968) Journal Of Mathematical Analysis and Applications , vol.22 , pp. 552-569
    • Miller, B.L.1
  • 21
    • 13944271922 scopus 로고    scopus 로고
    • The Laurent series, sensitive discount and Blackwell optimality for continuous-time controlled Markov chains
    • Prieto-Rumeau T., and Hernández-Lerma O. The Laurent series, sensitive discount and Blackwell optimality for continuous-time controlled Markov chains. Mathematical Methods of Operations Research 61 1 (2005) 123-145
    • (2005) Mathematical Methods of Operations Research , vol.61 , Issue.1 , pp. 123-145
    • Prieto-Rumeau, T.1    Hernández-Lerma, O.2
  • 25
    • 13944283496 scopus 로고
    • A Laurent series for the resolvent of a strongly continuous stochastic semi-group
    • Taylor H.M. A Laurent series for the resolvent of a strongly continuous stochastic semi-group. Mathematical Programming Studies 6 (1976) 258-263
    • (1976) Mathematical Programming Studies , vol.6 , pp. 258-263
    • Taylor, H.M.1
  • 26
    • 0000590831 scopus 로고
    • Discrete dynamic programming with sensitive discount optimality criteria
    • Veinott A.F. Discrete dynamic programming with sensitive discount optimality criteria. The Annals of Mathematical Statistics 40 5 (1969) 1635-1660
    • (1969) The Annals of Mathematical Statistics , vol.40 , Issue.5 , pp. 1635-1660
    • Veinott, A.F.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.