-
1
-
-
33749245240
-
An environment model for nonstationary reinforcement learning
-
Choi, S. P. M., Yeung, D.-Y., & Zhang, N. L. (2000). An environment model for nonstationary reinforcement learning. Advances in Neural Information Processing Systems 12 (pp. 994-1000).
-
(2000)
Advances in Neural Information Processing Systems
, vol.12
, pp. 994-1000
-
-
Choi, S.P.M.1
Yeung, D.-Y.2
Zhang, N.L.3
-
2
-
-
33749246501
-
Hidden-mode markov decision processes for nonstationary sequential decision making
-
London, UK: Springer-Verlag
-
Choi, S. P. M., Yeung, D.-Y., & Zhang, N. L. (2001). Hidden-mode markov decision processes for nonstationary sequential decision making. Sequence Learning - Paradigms, Algorithms, and Applications (pp. 264-287). London, UK: Springer-Verlag.
-
(2001)
Sequence Learning - Paradigms, Algorithms, and Applications
, pp. 264-287
-
-
Choi, S.P.M.1
Yeung, D.-Y.2
Zhang, N.L.3
-
3
-
-
0036618011
-
Multiple model-based reinforcement learning
-
Doya, K., Samejima, K., Katagiri, K., & Kawato, M. (2002). Multiple model-based reinforcement learning. Neural Computation, 14, 1347-1369.
-
(2002)
Neural Computation
, vol.14
, pp. 1347-1369
-
-
Doya, K.1
Samejima, K.2
Katagiri, K.3
Kawato, M.4
-
4
-
-
0029679044
-
Reinforcement learning: A survey
-
Kaelbling, L. P., Littman, M., & Moore, A. (1996). Reinforcement learning: A survey. Journal of Artificial Intelligence Research, 4, 237-285.
-
(1996)
Journal of Artificial Intelligence Research
, vol.4
, pp. 237-285
-
-
Kaelbling, L.P.1
Littman, M.2
Moore, A.3
-
5
-
-
0027684215
-
Prioritized sweeping: Reinforcement learning with less data and less time
-
Moore, A. W., & Atkeson, C. G. (1993). Prioritized sweeping: Reinforcement learning with less data and less time. Machine Learning, 13, 103-130.
-
(1993)
Machine Learning
, vol.13
, pp. 103-130
-
-
Moore, A.W.1
Atkeson, C.G.2
-
6
-
-
33749233935
-
Improving reinforcement learning with context detection
-
to appear Hakodate, Japan
-
Silva, B. C., Basso, E. W., Bazzan, A. L., Engel, P. M., & Perotto, F. S. (2006). Improving reinforcement learning with context detection. Fifth International Joint Conference on Autonomous Agents and Multi Agent Systems - (AAMAS 2006) - to appear. Hakodate, Japan.
-
(2006)
Fifth International Joint Conference on Autonomous Agents and Multi Agent Systems - (AAMAS 2006)
-
-
Silva, B.C.1
Basso, E.W.2
Bazzan, A.L.3
Engel, P.M.4
Perotto, F.S.5
-
7
-
-
31844457132
-
Predictive state representations: A new theory for modeling dynamical systems
-
Arlington, Virginia, United States: AUAI Press
-
Singh, S., James, M. R., & Rudary, M. R. (2004). Predictive state representations: a new theory for modeling dynamical systems. AUAI '04: Proceedings of the 20th conference on Uncertainty in Artificial Intelligence (pp. 512-519). Arlington, Virginia, United States: AUAI Press.
-
(2004)
AUAI '04: Proceedings of the 20th Conference on Uncertainty in Artificial Intelligence
, pp. 512-519
-
-
Singh, S.1
James, M.R.2
Rudary, M.R.3
-
9
-
-
0033170372
-
Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
-
Sutton, R. S., Precup, D., & Singh, S. P. (1999). Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning. Artificial Intelligence, 112, 181-211.
-
(1999)
Artificial Intelligence
, vol.112
, pp. 181-211
-
-
Sutton, R.S.1
Precup, D.2
Singh, S.P.3
|