SCOPUS 정보 검색 플랫폼

Volumn 2006, Issue , 2006, Pages 881-888

PAC model-free reinforcement learning

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTATION COSTS; MARKOV DECISION PROCESS; PARALLEL SAMPLING; Q-LEARNING;

COMPUTATION THEORY; DATA STORAGE EQUIPMENT; DECISION THEORY; MARKOV PROCESSES; MATHEMATICAL MODELS;

LEARNING SYSTEMS;

EID: 33749255382 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (140)

References (10)

1
- 0029210635
- Learning to act using real-time dynamic programming
- Barto, A. G., Bradtke, S. J., & Singh, S. P. (1995). Learning to act using real-time dynamic programming. Artificial Intelligence, 72, 81-138.
- (1995) Artificial Intelligence , vol.72 , pp. 81-138
- Barto, A.G.¹ Bradtke, S.J.² Singh, S.P.³

2
- 0041965975
- R-MAX - A general polynomial time algorithm for near-optimal reinforcement learning
- Brafman, R. I., & Tennenholtz, M. (2002). R-MAX - a general polynomial time algorithm for near-optimal reinforcement learning. Journal of Machine Learning Research, 3, 213-231.
- (2002) Journal of Machine Learning Research , vol.3 , pp. 213-231
- Brafman, R.I.¹ Tennenholtz, M.²

3
- 14344266002
- Learning rates for Q-learning
- Even-Dar, E., & Mansour, Y. (2003). Learning rates for Q-learning. Journal of Machine Learning Research, 5, 1-25.
- (2003) Journal of Machine Learning Research , vol.5 , pp. 1-25
- Even-Dar, E.¹ Mansour, Y.²

5
- 23244466805
- Doctoral dissertation, Gatsby Computational Neuroscience Unit, University College London
- Kakade, S. M. (2003). On the sample complexity of reinforcement learning. Doctoral dissertation, Gatsby Computational Neuroscience Unit, University College London.
- (2003) On the Sample Complexity of Reinforcement Learning
- Kakade, S.M.¹

7
- 0036832954
- Near-optimal reinforcement learning in polynomial time
- Kearns, M. J., & Singh, S. P. (2002). Near-optimal reinforcement learning in polynomial time. Machine Learning, 49, 209-232.
- (2002) Machine Learning , vol.49 , pp. 209-232
- Kearns, M.J.¹ Singh, S.P.²

8
- 31844432138
- A theoretical analysis of model-based interval estimation
- Strehl, A. L., & Littman, M. L. (2005). A theoretical analysis of model-based interval estimation. Proceedings of the Twenty-second International Conference on Machine Learning (ICML-05) (pp. 857-864).
- (2005) Proceedings of the Twenty-second International Conference on Machine Learning (ICML-05) , pp. 857-864
- Strehl, A.L.¹ Littman, M.L.²

9
- 0004102479
- The MIT Press
- Sutton, R. S., & Barto, A. G. (1998). Reinforcement learning: An introduction. The MIT Press.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

10
- 34249833101
- Q-learning
- Watkins, C. J. C. H., & Dayan, P. (1992). Q-learning. Machine Learning, 8, 279-292.
- (1992) Machine Learning , vol.8 , pp. 279-292
- Watkins, C.J.C.H.¹ Dayan, P.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.