SCOPUS 정보 검색 플랫폼

Volumn 53, Issue 4, 2008, Pages 1076-1082

Event-based optimization of Markov systems

Author keywords

Markov decision processes (MDPs); Performance potentials; Perturbation analysis (PA); Policy gradients; Policy iteration

Indexed keywords

DECISION THEORY; OPTIMIZATION; PERTURBATION TECHNIQUES; SENSITIVITY ANALYSIS;

EVENT-BASED OPTIMIZATION; PERFORMANCE POTENTIALS; POLICY GRADIENTS; POLICY ITERATION;

MARKOV PROCESSES;

EID: 44849134414 PISSN: 00189286 EISSN: None Source Type: Journal
DOI: 10.1109/TAC.2008.919557 Document Type: Article

Times cited : (45)

References (9)

1
- 0037288370
- Recent advances in hierarchical reinforcement learning, special issue on reinforcement learning
- A. Barto and S. Mahadevan, "Recent advances in hierarchical reinforcement learning, special issue on reinforcement learning," Discrete Event Dyn. Syst.: Theory Appl., vol. 13, pp. 41-77, 2003.
- (2003) Discrete Event Dyn. Syst.: Theory Appl , vol.13 , pp. 41-77
- Barto, A.¹ Mahadevan, S.²

2
- 0013535965
- Infinite-horizon policy-gradient estimation,
- J. Baxter and P. L. Bartlett, "Infinite-horizon policy-gradient estimation,", J. Artif. Intell. Res., vol. 15, pp. 319-350, 2001.
- (2001) J. Artif. Intell. Res , vol.15 , pp. 319-350
- Baxter, J.¹ Bartlett, P.L.²

3
- 14644388113
- Basic ideas for event-based optimization of Markov systems,
- X. R. Cao, "Basic ideas for event-based optimization of Markov systems,: Discrete Event Dyn. Syst.: Theory Appl., vol. 15, pp. 169-197, 2005.
- (2005) Discrete Event Dyn. Syst.: Theory Appl , vol.15 , pp. 169-197
- Cao, X.R.¹

6
- 0003745958
- Upper Saddle River, NJ: Prentice-Hall
- E. Çinlar, Introduction to Stochastic Processes. Upper Saddle River, NJ: Prentice-Hall, 1995.
- (1995) Introduction to Stochastic Processes
- Çinlar, E.¹

7
- 0038380746
- Convergence of simulation-based policy iteration
- W. L. Cooper, S. G. Henderson, and M. E. Lewis, "Convergence of simulation-based policy iteration," Probab. Eng. Inf. Sci., vol. 17, pp. 213 234, 2003.
- (2003) Probab. Eng. Inf. Sci , vol.17 , pp. 213-234
- Cooper, W.L.¹ Henderson, S.G.² Lewis, M.E.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.