-
1
-
-
71549133876
-
UCT for tactical assault planning in real-time strategy games
-
Balla, R.-K., and Fern, A. 2009. UCT for tactical assault planning in real-time strategy games. In IJCAI.
-
(2009)
IJCAI
-
-
Balla, R.-K.1
Fern, A.2
-
2
-
-
0034248853
-
Stochastic dynamic programming with factored representations
-
Boutilier, C; Dearden, R.; and Goldszmidt, M. 2000. Stochastic dynamic programming with factored representations. Artificial Intelligence 121(1):49-107.
-
(2000)
Artificial Intelligence
, vol.121
, Issue.1
, pp. 49-107
-
-
Boutilier, C.1
Dearden, R.2
Goldszmidt, M.3
-
3
-
-
70349275222
-
Bandit algorithms for tree search
-
Coquelin, P.-A., and Munos, R. 2007. Bandit algorithms for tree search. In UAI.
-
(2007)
UAI
-
-
Coquelin, P.-A.1
Munos, R.2
-
4
-
-
84880882489
-
Online learning and exploiting relational models in reinforcement learning
-
Croonenborghs, T.; Ramon, J.; Blocked, H.; and Bruynooghe, M. 2007. Online learning and exploiting relational models in reinforcement learning. In IJCAI.
-
(2007)
IJCAI
-
-
Croonenborghs, T.1
Ramon, J.2
Blocked, H.3
Bruynooghe, M.4
-
5
-
-
33749242809
-
Learning the structure of factored Markov decision processes in reinforcement learning problems
-
Degris, T; Sigaud, O.; and Wuillemin, P.-H. 2006. Learning the structure of factored Markov decision processes in reinforcement learning problems. In ICML.
-
(2006)
ICML
-
-
Degris, T.1
Sigaud, O.2
Wuillemin, P.-H.3
-
6
-
-
77958578450
-
Combining online and offline knowledge in UCT
-
Gelly, S., and Silver, D. 2007. Combining online and offline knowledge in UCT. In ICML.
-
(2007)
ICML
-
-
Gelly, S.1
Silver, D.2
-
7
-
-
0036832951
-
A sparse sampling algorithm for near-optimal planning in large Markov decision processes
-
Kearns, M.; Mansour, Y.; and Ng, A. Y. 2002. A sparse sampling algorithm for near-optimal planning in large Markov decision processes. Machine Learning 49:193-208.
-
(2002)
Machine Learning
, vol.49
, pp. 193-208
-
-
Kearns, M.1
Mansour, Y.2
Ng, A.Y.3
-
8
-
-
34547975806
-
Bandit based Monte-Carlo planning
-
Kocsis, L., and Szepesvari, C. 2006. Bandit based Monte-Carlo planning. In ECML.
-
(2006)
ECML
-
-
Kocsis, L.1
Szepesvari, C.2
-
9
-
-
71149086468
-
Approximate inference for planning in stochastic relational worlds
-
Lang, T, and Toussaint, M. 2009. Approximate inference for planning in stochastic relational worlds. In ICML.
-
(2009)
ICML
-
-
Lang, T.1
Toussaint, M.2
-
10
-
-
56449122733
-
Knows what it knows: A framework for self-aware learning
-
Li, L.; Littman, M. L.; and Walsh, T. J. 2008. Knows what it knows: A framework for self-aware learning. In ICML.
-
(2008)
ICML
-
-
Li, L.1
Littman, M.L.2
Walsh, T.J.3
-
14
-
-
56449110907
-
Sample-based learning and search with permanent and transient memories
-
Silver, D.; Sutton, R. S.; and Müller, M. 2008. Sample-based learning and search with permanent and transient memories. In ICML.
-
(2008)
ICML
-
-
Silver, D.1
Sutton, R.S.2
Müller, M.3
-
17
-
-
79958846996
-
Exploring compact reinforcement-learning representations with linear regression
-
Walsh, T. J.; Szita, I.; Diuk, C; and Littman, M. L. 2009. Exploring compact reinforcement-learning representations with linear regression. In UAI.
-
(2009)
UAI
-
-
Walsh, T.J.1
Szita, I.2
Diuk, C.3
Littman, M.L.4
|