SCOPUS 정보 검색 플랫폼

Volumn 48, Issue 5, 2003, Pages 758-769

Semi-Markov decision problems and performance sensitivity analysis

a HONG KONG UNIVERSITY OF SCIENCE AND TECHNOLOGY (Hong Kong)

Author keywords

Discounted Poisson equations; Discrete event dynamic systems (DEDS); Lyapunov equations; Markov decision processes (MDPs); Perturbation analysis (PA); Perturbation realization; Poisson equations; Policy iteration; Potentials; Reinforcement learning (RL)

Indexed keywords

ALGORITHMS; DECISION MAKING; ITERATIVE METHODS; LYAPUNOV METHODS; MATHEMATICAL MODELS; PERTURBATION TECHNIQUES; POISSON EQUATION; SENSITIVITY ANALYSIS;

REINFORCEMENT LEARNING (RL);

MARKOV PROCESSES;

EID: 0038631988 PISSN: 00189286 EISSN: None Source Type: Journal
DOI: 10.1109/TAC.2003.811252 Document Type: Article

Times cited : (90)

References (23)

1
- 0003565783
- Belmont, MA: Athena Scientific
- D. P. Bertsekas, Dynamic Programming and Optimal Control. Belmont, MA: Athena Scientific, 1995, vol. II.
- (1995) Dynamic Programming and Optimal Control , vol.2
- Bertsekas, D.P.¹

2
- 0016036113
- Nonnegative matrices in the mathematical sciences
- A. Herman and R. J. Plemmons, "Nonnegative matrices in the mathematical sciences," SIAM J. Numer. Anal. vol. 11, pp. 145-154, 1974.
- (1974) SIAM J. Numer. Anal. , vol.11 , pp. 145-154
- Herman, A.¹ Plemmons, R.J.²

3
- 0003487482
- Belmont, MA: Athena Scientific
- D. P. Bertsekas and T. N. Tsitsiklis, Neuro-Dynamic Programming. Belmont, MA: Athena Scientific, 1996.
- (1996) Neuro-Dynamic Programming
- Bertsekas, D.P.¹ Tsitsiklis, T.N.²

4
- 0003983929
- New York: Springer-Verlag
- X.-R. Cao, Realization Probabilities: The Dynamics of Queueing Systems. New York: Springer-Verlag, 1994.
- (1994) Realization Probabilities: The Dynamics of Queueing Systems
- Cao, X.-R.¹

5
- 0009843739
- The Maclaurin series for performance functions of Markov chains
- _, "The Maclaurin series for performance functions of Markov chains," Adv. Appl. Probab., vol. 30, pp. 676-692, 1998.
- (1998) Adv. Appl. Probab. , vol.30 , pp. 676-692

6
- 0038258780
- The relation among potentials, perturbation analysis, Markov decision processes, and other topics
- _, "The relation among potentials, perturbation analysis, Markov decision processes, and other topics," J. Discrete Event Dyna. Syst., vol. 8, pp. 71-87, 1998.
- (1998) J. Discrete Event Dyna. Syst. , vol.8 , pp. 71-87

7
- 0033884215
- A unified approach to Markov decision problems and performance sensitivity analysis
- _, "A unified approach to Markov decision problems and performance sensitivity analysis," Automatica, vol. 36, pp. 771-774, 2000.
- (2000) Automatica , vol.36 , pp. 771-774

8
- 0037289322
- From perturbation analysis to Markov decision processes and reinforcement learning
- _, "From perturbation analysis to Markov decision processes and reinforcement learning," J. Discrete Event Dyna. Syst., vol. 13, pp. 9-39, 2003.
- (2003) J. Discrete Event Dyna. Syst. , vol.13 , pp. 9-39

9
- 0031258478
- Potentials, perturbation realization, and sensitivity analysis of Markov processes
- Oct.
- X.-R. Cao and H. F. Chen, "Potentials, perturbation realization, and sensitivity analysis of Markov processes," IEEE Trans. Automat. Contr., vol. 42, pp. 1382-1393, Oct. 1997.
- (1997) IEEE Trans. Automat. Contr. , vol.42 , pp. 1382-1393
- Cao, X.-R.¹ Chen, H.F.²

10
- 0036604532
- A time aggregation approach to Markov decision processes
- X.-R. Cao, Z. Y. Ren, S. Bhatnagar, M. Fu, and S. Marcus, "A time aggregation approach to Markov decision processes," Automatica, vol. 38, pp. 929-943, 2002.
- (2002) Automatica , vol.38 , pp. 929-943
- Cao, X.-R.¹ Ren, Z.Y.² Bhatnagar, S.³ Fu, M.⁴ Marcus, S.⁵

11
- 0003864139
- Norwell, MA: Kluwer
- C. Cassandras and S. Lafortune, Introduction to Discrete Event Dynamic Systems. Norwell, MA: Kluwer, 1999.
- (1999) Introduction to Discrete Event Dynamic Systems
- Cassandras, C.¹ Lafortune, S.²

12
- 0003745958
- Upper Saddle River, NJ: Prentice Hall
- E. Çinlar, Introduction to Stochastic Processes. Upper Saddle River, NJ: Prentice Hall, 1975.
- (1975) Introduction to Stochastic Processes
- Çinlar, E.¹

13
- 0038597488
- Single sample path based recursive algorithms for Markov decision processes
- to be published
- H.-T. Fang and X.-R. Cao, "Single sample path based recursive algorithms for Markov decision processes," IEEE Trans. Automat. Contr., 2003, to be published.
- (2003) IEEE Trans. Automat. Contr.
- Fang, H.-T.¹ Cao, X.-R.²

14
- 0030522182
- A Lyapunov bound for solutions of Poisson's equation
- P.W. Glynn and S. P. Meyn, "A Lyapunov bound for solutions of Poisson's equation," Ann. Probab., vol. 24, pp. 916-931, 1996.
- (1996) Ann. Probab. , vol.24 , pp. 916-931
- Glynn, P.W.¹ Meyn, S.P.²

15
- 0003585978
- Nonvell, MA: Kluwer
- Y. C. Ho and X.-R. Cao, Perturbation Analysis of Discrete-Event Dynamic Systems. Nonvell, MA: Kluwer, 1991.
- (1991) Perturbation Analysis of Discrete-Event Dynamic Systems
- Ho, Y.C.¹ Cao, X.-R.²

16
- 0003979966
- New York: Van Nostrand
- J. G. Kemeny and J. L. Snell, Finite Markov Chains. New York: Van Nostrand, 1960.
- (1960) Finite Markov Chains
- Kemeny, J.G.¹ Snell, J.L.²

17
- 0004210802
- Theory. New York: Wiley
- L. Kleinrock, Queueing Systems, Volume 1: Theory. New York: Wiley, 1975.
- (1975) Queueing Systems , vol.1
- Kleinrock, L.¹

18
- 0035249254
- Simulation-based optimization of Markov reward processes
- Feb.
- P. Marbach and T. N. Tsitsiklis, "Simulation-based optimization of Markov reward processes," IEEE Trans. Automat. Contr., vol. 46, pp. 191-209, Feb. 2001.
- (2001) IEEE Trans. Automat. Contr. , vol.46 , pp. 191-209
- Marbach, P.¹ Tsitsiklis, T.N.²

19
- 0003637131
- London, U.K.: Springer-Verlag
- S. P. Meyn and R. L. Tweedie, Markov Chains and Stochastic Stability. London, U.K.: Springer-Verlag, 1993.
- (1993) Markov Chains and Stochastic Stability
- Meyn, S.P.¹ Tweedie, R.L.²

20
- 85102627959
- New York: Wiley
- M. L. Puterman, Markov Decision Processes; Discrete Stochastic Dynamic Programming. New York: Wiley, 1994.
- (1994) Markov Decision Processes; Discrete Stochastic Dynamic Programming
- Puterman, M.L.¹

21
- 0004102479
- Cambridge, MA: MIT Press
- R. S. Sutton and A. G. Barto, Reinforcement Learning: An Introduction. Cambridge, MA: MIT Press, 1998.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

22
- 0033170372
- Between MDPs and Semi-MDPs: A framework for temporal abstraction in reinforcement learning
- R. S. Sutton, D. Precup, and S. Singh, "Between MDPs and Semi-MDPs: a framework for temporal abstraction in reinforcement learning," Artif. Intell., vol. 112, pp. 181-211, 1999.
- (1999) Artif. Intell. , vol.112 , pp. 181-211
- Sutton, R.S.¹ Precup, D.² Singh, S.³

23
- 0003636741
- Wiley
- H. C. Tijms, Stochastic Models - An Algorithmic Approach: Wiley, 1994.
- (1994) Stochastic Models - An Algorithmic Approach
- Tijms, H.C.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.