-
3
-
-
0037288370
-
Recent advances in hierarchical reinforcement learning
-
Barto, A., & Mahadevan, S. (2003) Recent advances in hierarchical reinforcement learning, Discrete Event Systems Special Issue on Reinforcement Learning, 13, 41-77.
-
(2003)
Discrete Event Systems Special Issue on Reinforcement Learning
, vol.13
, pp. 41-77
-
-
Barto, A.1
Mahadevan, S.2
-
6
-
-
0036531878
-
Multiagent learning using a variable learning rate
-
Bowling, M., & Veloso, M. (2002). Multiagent learning using a variable learning rate. Artificial Intelligence, 136, 215-250.
-
(2002)
Artificial Intelligence
, vol.136
, pp. 215-250
-
-
Bowling, M.1
Veloso, M.2
-
7
-
-
0042254114
-
Policy recognition in the abstract hidden markov model
-
Bui, H., Venkatesh, S., & West, G. (2002). Policy recognition in the abstract hidden markov model. Journal of Artificial Intelligence Research, 17, 451-499.
-
(2002)
Journal of Artificial Intelligence Research
, vol.17
, pp. 451-499
-
-
Bui, H.1
Venkatesh, S.2
West, G.3
-
8
-
-
0032208335
-
Elevator group control using multiple reinforcement learning agents
-
Crites, R., & Barto, A. (1998). Elevator group control using multiple reinforcement learning agents. Machine Learning, 33, 235-262.
-
(1998)
Machine Learning
, vol.33
, pp. 235-262
-
-
Crites, R.1
Barto, A.2
-
9
-
-
0002278788
-
Hierarchical reinforcement learning with the MAXQ value function decomposition
-
Dietterich, T. (2000). Hierarchical reinforcement learning with the MAXQ value function decomposition. Journal of Artificial Intelligence Research, 13, 227-303.
-
(2000)
Journal of Artificial Intelligence Research
, vol.13
, pp. 227-303
-
-
Dietterich, T.1
-
20
-
-
0030082467
-
Composite dispatching rules for multiple-vehicle agv systems
-
Lee, J. (1996). Composite dispatching rules for multiple-vehicle agv systems. Simulation, 66, 121-130.
-
(1996)
Simulation
, vol.66
, pp. 121-130
-
-
Lee, J.1
-
25
-
-
0030647149
-
Reinforcement learning in the multi-robot domain (1997)
-
Mataric, M. (1997). Reinforcement learning in the multi-robot domain (1997). Autonomous Robots, 4, 73-83.
-
(1997)
Autonomous Robots
, vol.4
, pp. 73-83
-
-
Mataric, M.1
-
27
-
-
0004260006
-
-
Academic Press
-
Owen, G. (1995). Game theory. Academic Press.
-
(1995)
Game theory
-
-
Owen, G.1
-
29
-
-
0012646255
-
Learning to cooperate via policy search
-
Peshkin, L., Kim, K., Meuleau, N., & Kaelbling, L. (2000). Learning to cooperate via policy search. In Proceedings of the sixteenth international conference on uncertainty in artificial intelligence (pp. 489-496).
-
(2000)
Proceedings of the sixteenth international conference on uncertainty in artificial intelligence
, pp. 489-496
-
-
Peshkin, L.1
Kim, K.2
Meuleau, N.3
Kaelbling, L.4
-
31
-
-
1142292938
-
The communicative multiagent team decision problem: Analyzing teamwork theories and models
-
Pynadath, D., & Tambe, M. (2002). The communicative multiagent team decision problem: Analyzing teamwork theories and models. Journal of Artificial Intelligence Research, 16, 389-426.
-
(2002)
Journal of Artificial Intelligence Research
, vol.16
, pp. 389-426
-
-
Pynadath, D.1
Tambe, M.2
-
34
-
-
0001395498
-
Distributed value functions
-
Schneider, J., Wong, W., Moore, A., & Riedmiller, M. Distributed value functions. In Proceedings of the sixteenth international conference on machine learning (pp. 371-378).
-
Proceedings of the sixteenth international conference on machine learning
, pp. 371-378
-
-
Schneider, J.1
Wong, W.2
Moore, A.3
Riedmiller, M.4
-
37
-
-
0032208403
-
Learning to improve coordinated actions in cooperative distributed problem-solving environments
-
Sugawara, T., & Lesser, V. Learning to improve coordinated actions in cooperative distributed problem-solving environments. Machine Learning, 33, 129-154.
-
Machine Learning
, vol.33
, pp. 129-154
-
-
Sugawara, T.1
Lesser, V.2
-
38
-
-
0033170372
-
Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
-
Sutton, R., Precup, D., & Singh, S. (1999). Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning. Artificial Intelligence, 112, 181-211.
-
(1999)
Artificial Intelligence
, vol.112
, pp. 181-211
-
-
Sutton, R.1
Precup, D.2
Singh, S.3
|