SCOPUS 정보 검색 플랫폼 - 논문 보기

메뉴 건너뛰기

Proceedings of the National Conference on Artificial Intelligence

Volumn 1, Issue , 2010, Pages 612-617

Integrating sample-based planning and model-based reinforcement learning

(3) Walsh, Thomas J a Goschin, Sergiu a Littman, Michael L a

a RUTGERS UNIVERSITY (United States)

Author keywords

[No Author keywords available]

Indexed keywords

ARTIFICIAL INTELLIGENCE; COMPUTATIONAL EFFICIENCY; PLANNING; REINFORCEMENT LEARNING;

COMPUTATION TIME; MODEL-BASED REINFORCEMENT LEARNING; NEAR-OPTIMAL POLICIES; NUMBER OF STATE; SAMPLE COMPLEXITY;

LEARNING ALGORITHMS;

EID: 77958578580 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (58)

References (17)

1
- 71549133876
- UCT for tactical assault planning in real-time strategy games
- Balla, R.-K., and Fern, A. 2009. UCT for tactical assault planning in real-time strategy games. In IJCAI.
- (2009) IJCAI
- Balla, R.-K.¹ Fern, A.²

2
- 0034248853
- Stochastic dynamic programming with factored representations
- Boutilier, C; Dearden, R.; and Goldszmidt, M. 2000. Stochastic dynamic programming with factored representations. Artificial Intelligence 121(1):49-107.
- (2000) Artificial Intelligence , vol.121 , Issue.1 , pp. 49-107
- Boutilier, C.¹ Dearden, R.² Goldszmidt, M.³

3
- 70349275222
- Bandit algorithms for tree search
- Coquelin, P.-A., and Munos, R. 2007. Bandit algorithms for tree search. In UAI.
- (2007) UAI
- Coquelin, P.-A.¹ Munos, R.²

4
- 84880882489
- Online learning and exploiting relational models in reinforcement learning
- Croonenborghs, T.; Ramon, J.; Blocked, H.; and Bruynooghe, M. 2007. Online learning and exploiting relational models in reinforcement learning. In IJCAI.
- (2007) IJCAI
- Croonenborghs, T.¹ Ramon, J.² Blocked, H.³ Bruynooghe, M.⁴

5
- 33749242809
- Learning the structure of factored Markov decision processes in reinforcement learning problems
- Degris, T; Sigaud, O.; and Wuillemin, P.-H. 2006. Learning the structure of factored Markov decision processes in reinforcement learning problems. In ICML.
- (2006) ICML
- Degris, T.¹ Sigaud, O.² Wuillemin, P.-H.³

6
- 77958578450
- Combining online and offline knowledge in UCT
- Gelly, S., and Silver, D. 2007. Combining online and offline knowledge in UCT. In ICML.
- (2007) ICML
- Gelly, S.¹ Silver, D.²

7
- 0036832951
- A sparse sampling algorithm for near-optimal planning in large Markov decision processes
- Kearns, M.; Mansour, Y.; and Ng, A. Y. 2002. A sparse sampling algorithm for near-optimal planning in large Markov decision processes. Machine Learning 49:193-208.
- (2002) Machine Learning , vol.49 , pp. 193-208
- Kearns, M.¹ Mansour, Y.² Ng, A.Y.³

8
- 34547975806
- Bandit based Monte-Carlo planning
- Kocsis, L., and Szepesvari, C. 2006. Bandit based Monte-Carlo planning. In ECML.
- (2006) ECML
- Kocsis, L.¹ Szepesvari, C.²

9
- 71149086468
- Approximate inference for planning in stochastic relational worlds
- Lang, T, and Toussaint, M. 2009. Approximate inference for planning in stochastic relational worlds. In ICML.
- (2009) ICML
- Lang, T.¹ Toussaint, M.²

10
- 56449122733
- Knows what it knows: A framework for self-aware learning
- Li, L.; Littman, M. L.; and Walsh, T. J. 2008. Knows what it knows: A framework for self-aware learning. In ICML.
- (2008) ICML
- Li, L.¹ Littman, M.L.² Walsh, T.J.³

11
- 70349428076
- Ph.D. Dissertation, Rutgers University, NJ, USA
- Li, L. 2009. A Unifying Framework for Computational Reinforcement Learning Theory. Ph.D. Dissertation, Rutgers University, NJ, USA.
- (2009) A Unifying Framework for Computational Reinforcement Learning Theory
- Li, L.¹

12
- 34748875246
- Learning symbolic models of stochastic domains
- Pasula, H. M.; Zettlemoyer, L. S.; and Kaelbling, L. P. 2007. Learning symbolic models of stochastic domains. Journal of Artificial Intelligence Research 29:309-352.
- (2007) Journal of Artificial Intelligence Research , vol.29 , pp. 309-352
- Pasula, H.M.¹ Zettlemoyer, L.S.² Kaelbling, L.P.³

13
- 85102627959
- New York: Wiley
- Puterman, M. L. 1994. Markov Decision Processes: Discrete Stochastic Dynamic Programming. New York: Wiley.
- (1994) Markov Decision Processes: Discrete Stochastic Dynamic Programming
- Puterman, M.L.¹

14
- 56449110907
- Sample-based learning and search with permanent and transient memories
- Silver, D.; Sutton, R. S.; and Müller, M. 2008. Sample-based learning and search with permanent and transient memories. In ICML.
- (2008) ICML
- Silver, D.¹ Sutton, R.S.² Müller, M.³

15
- 73549084301
- Reinforcement learning in finite MDPs: PAC analysis
- Strehl, A. L.; Li, L.; and Littman, M. L. 2009. Reinforcement learning in finite MDPs: PAC analysis. Journal of Machine Learning Research 10(2) :413-444.
- (2009) Journal of Machine Learning Research , vol.10 , Issue.2 , pp. 413-444
- Strehl, A.L.¹ Li, L.² Littman, M.L.³

16
- 0004102479
- Cambridge, MA: MIT Press
- Sutton, R. S., and Barto, A. G. 1998. Reinforcement Learning: An Introduction. Cambridge, MA: MIT Press.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

17
- 79958846996
- Exploring compact reinforcement-learning representations with linear regression
- Walsh, T. J.; Szita, I.; Diuk, C; and Littman, M. L. 2009. Exploring compact reinforcement-learning representations with linear regression. In UAI.
- (2009) UAI
- Walsh, T.J.¹ Szita, I.² Diuk, C.³ Littman, M.L.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.