SCOPUS 정보 검색 플랫폼

Proceedings of the 22nd Conference on Uncertainty in Artificial Intelligence, UAI 2006

Volumn , Issue , 2006, Pages 485-493

Incremental model-based learners with formal learning-time guarantees

(3) Strehl, Alexander L a Li, Lihong a Littman, Michael L a

Author keywords

[No Author keywords available]

Indexed keywords

ACTION SPACES; COMPUTATIONAL COSTS; COMPUTATIONAL DEMANDS; EXPERIMENTAL EVALUATION; FINITE STATE; INTERNAL MODELS; INTERVAL ESTIMATION; LARGE-SCALE PROBLEM; MARKOV DECISION PROCESSES; MODEL-BASED ALGORITHMS; PROBABLY APPROXIMATELY CORRECT; THEORETICAL FRAMEWORK;

ARTIFICIAL INTELLIGENCE; DYNAMIC PROGRAMMING; MARKOV PROCESSES; MATHEMATICAL MODELS;

LEARNING ALGORITHMS;

EID: 34548745051 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (40)

References (11)

1
- 0029210635
- Learning to act using real-time dynamic programming
- Barto, A. G., Bradtke, S. J., & Singh, S. P. (1995). Learning to act using real-time dynamic programming. Artificial Intelligence, 72, 81-138.
- (1995) Artificial Intelligence , vol.72 , pp. 81-138
- Barto, A.G.¹ Bradtke, S.J.² Singh, S.P.³

2
- 0003487482
- Belmont, MA: Athena Scientific
- Bertsekas, D. P., & Tsitsiklis, J. N. (1996). Neuro-dynamic programming. Belmont, MA: Athena Scientific.
- (1996) Neuro-dynamic Programming
- Bertsekas, D.P.¹ Tsitsiklis, J.N.²

3
- 0041965975
- R-MAX - A general polynomial time algorithm for near-optimal reinforcement learning
- Brafman, R. I., & Tennenholtz, M. (2002). R-MAX - a general polynomial time algorithm for near-optimal reinforcement learning. Journal of Machine Learning Research, 3, 213-231.
- (2002) Journal of Machine Learning Research , vol.3 , pp. 213-231
- Brafman, R.I.¹ Tennenholtz, M.²

4
- 1942421149
- Action elimination and stopping conditions for reinforcement learning
- Even-Dar, E., Mannor, S., & Mansour, Y. (2003). Action elimination and stopping conditions for reinforcement learning. The Twentieth International Conference on Machine Learning (ICML 2003) (pp. 162-169).
- (2003) The Twentieth International Conference on Machine Learning (ICML 2003) , pp. 162-169
- Even-Dar, E.¹ Mannor, S.² Mansour, Y.³

5
- 78650606637
- A quantitative study of hypothesis selection
- Fong, P. W. L. (1995). A quantitative study of hypothesis selection. Proceedings of the Twelfth International Conference on Machine Learning (ICML-95) (pp. 226-234).
- (1995) Proceedings of the Twelfth International Conference on Machine Learning (ICML-95) , pp. 226-234
- Fong, P.W.L.¹

6
- 0004280606
- Cambridge, MA: The MIT Press
- Kaelbling, L. P. (1993). Learning in embedded systems. Cambridge, MA: The MIT Press.
- (1993) Learning in Embedded Systems
- Kaelbling, L.P.¹

7
- 23244466805
- Doctoral dissertation, Gatsby Computational Neuroscience Unit, University College London
- Kakade, S. M. (2003). On the sample complexity of reinforcement learning. Doctoral dissertation, Gatsby Computational Neuroscience Unit, University College London.
- (2003) On the Sample Complexity of Reinforcement Learning
- Kakade, S.M.¹

8
- 0036832954
- Near-optimal reinforcement learning in polynomial time
- Kearns, M. J., & Singh, S. P. (2002). Near-optimal reinforcement learning in polynomial time. Machine Learning, 49, 209-232.
- (2002) Machine Learning , vol.49 , pp. 209-232
- Kearns, M.J.¹ Singh, S.P.²

9
- 31844432138
- A theoretical analysis of model-based interval estimation
- ICML 2005 - Proceedings of the 22nd International Conference on Machine Learning
- Strehl, A. L., & Littman, M. L. (2005). A theoretical analysis of model-based interval estimation. Proceedings of the Twenty-second International Conference on Machine Learning (ICML-05) (pp. 857-864). (Pubitemid 43183415)
- (2005) ICML 2005 - Proceedings of the 22nd International Conference on Machine Learning , pp. 857-864
- Strehl, A.L.¹ Littman, M.L.²

10
- 0004102479
- The MIT Press
- Sutton, R. S., & Barto, A. G. (1998). Reinforcement learning: An introduction. The MIT Press.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

11
- 0021518106
- A theory of the learnable
- Valiant, L. G. (1984). A theory of the learnable. Communications of the ACM, 27, 1134-1142.
- (1984) Communications of the ACM , vol.27 , pp. 1134-1142
- Valiant, L.G.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.