SCOPUS 정보 검색 플랫폼

Volumn 1, Issue , 2007, Pages 645-650

Efficient structure learning in factored-state MDPs

Author keywords

[No Author keywords available]

Indexed keywords

BAYESIAN NETWORKS; DATA STRUCTURES; LEARNING ALGORITHMS; PROBABILITY; PROBLEM SOLVING;

STRUCTURE LEARNING;

REINFORCEMENT LEARNING;

EID: 36348930987 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (84)

References (11)

1
- 33747670266
- Learning factor graphs in polynomial time and sample complexity
- Abbeel, P.; Koller, D.; and Ng, A. Y. 2006. Learning factor graphs in polynomial time and sample complexity. JMLR.
- (2006) JMLR
- Abbeel, P.¹ Koller, D.² Ng, A.Y.³

2
- 0346942368
- Decision-theoretic planning: Structural assumptions and computational leverage
- Boutilier, C.; Dean, T.; and Hanks, S. 1999. Decision-theoretic planning: Structural assumptions and computational leverage. Journal of Artificial Intelligence Research 11:1-94.
- (1999) Journal of Artificial Intelligence Research , vol.11 , pp. 1-94
- Boutilier, C.¹ Dean, T.² Hanks, S.³

3
- 0041965975
- R-MAX-a general polynomial time algorithm for near-optimal reinforcement learning
- Brafman, R. I., and Tennenholtz, M. 2002. R-MAX-a general polynomial time algorithm for near-optimal reinforcement learning. Journal of Machine Learning Research 3:213-231.
- (2002) Journal of Machine Learning Research , vol.3 , pp. 213-231
- Brafman, R.I.¹ Tennenholtz, M.²

4
- 33749242809
- Learning the structure of factored Markov decision processes in reinforcement learning problems
- Degris, T.; Sigaud, O.; and Wuillemin, P.-H. 2006. Learning the structure of factored Markov decision processes in reinforcement learning problems. In ICML-06: Proceedings of the 23rd international conference on Machine learning, 257-264.
- (2006) ICML-06: Proceedings of the 23rd international conference on Machine learning , pp. 257-264
- Degris, T.¹ Sigaud, O.² Wuillemin, P.-H.³

5
- 0002278788
- Hierarchical reinforcement learning with the MAXQ value function decomposition
- Dietterich, T. G. 2000. Hierarchical reinforcement learning with the MAXQ value function decomposition. Journal of Artificial Intelligence Research 13:227-303.
- (2000) Journal of Artificial Intelligence Research , vol.13 , pp. 227-303
- Dietterich, T.G.¹

6
- 33749245414
- Algorithmdirected exploration for model-based reinforcement learning in factored MDPs
- Guestrin, C.; Patrascu, R.; and Schuurmans, D. 2002. Algorithmdirected exploration for model-based reinforcement learning in factored MDPs. In Pmceedings of the International Conference on Machine Learning, 235-242.
- (2002) Pmceedings of the International Conference on Machine Learning , pp. 235-242
- Guestrin, C.¹ Patrascu, R.² Schuurmans, D.³

7
- 23244466805
- Ph.D. Dissertation, Gatsby Computational Neuroscience Unit, University College London
- Kakade, S. M. 2003. On the Sample Complexity of Reinforcement Learning. Ph.D. Dissertation, Gatsby Computational Neuroscience Unit, University College London.
- (2003) On the Sample Complexity of Reinforcement Learning
- Kakade, S.M.¹

8
- 84880677563
- Efficient reinforcement learning in factored MDPs
- Kearns, M. J., and Koller, D. 1999. Efficient reinforcement learning in factored MDPs. In Proceedings of the 16th International Joint Conference on Artificial Intelligence (IJCAI), 740-747.
- (1999) Proceedings of the 16th International Joint Conference on Artificial Intelligence (IJCAI) , pp. 740-747
- Kearns, M.J.¹ Koller, D.²

9
- 0036832954
- Near-optimal reinforcement learning in polynomial time
- Kearns, M. J., and Singh, S. P. 2002. Near-optimal reinforcement learning in polynomial time. Machine Learning 49(2-3):209-232.
- (2002) Machine Learning , vol.49 , Issue.2-3 , pp. 209-232
- Kearns, M.J.¹ Singh, S.P.²

10
- 34548745051
- Incremental modelbased learners with formal learning-time guarantees
- Strehl, A. L.; Li, L.; and Littman, M. L. 2006. Incremental modelbased learners with formal learning-time guarantees. In UAI-06: Proceedings of the 22nd conference on Uncertainty in Artificial Intelligence, 485-493.
- (2006) UAI-06: Proceedings of the 22nd conference on Uncertainty in Artificial Intelligence , pp. 485-493
- Strehl, A.L.¹ Li, L.² Littman, M.L.³

11
- 0004102479
- The MIT Press
- Sutton, R. S., and Barto, A. G. 1998. Reinforcement Learning: An Introduction. The MIT Press.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.