-
1
-
-
84902354924
-
Decentralized control of partially observable Markov decision processes
-
C. Amato, G. Chowdhary, A. Geramifard, N. K. Ure, and M. J. Kochenderfer. Decentralized control of partially observable Markov decision processes. In Proceedings of the Fifty-Second IEEE Conference on Decision and Control, 2013.
-
(2013)
Proceedings of the Fifty-Second IEEE Conference on Decision and Control
-
-
Amato, C.1
Chowdhary, G.2
Geramifard, A.3
Ure, N.K.4
Kochenderfer, M.J.5
-
4
-
-
0141988716
-
Recent advances in hierarchical reinforcement learning
-
A. Barto and S. Mahadevan. Recent advances in hierarchical reinforcement learning. Discrete Event Dynamic Systems, 13:41-77, 2003.
-
(2003)
Discrete Event Dynamic Systems
, vol.13
, pp. 41-77
-
-
Barto, A.1
Mahadevan, S.2
-
6
-
-
27344432831
-
Solving transition-independent decentralized Markov decision processes
-
R. Becker, S. Zilberstein, V. Lesser, and C. V. Goldman. Solving transition-independent decentralized Markov decision processes. Journal of Artificial Intelligence Research, 22:423-455, 2004.
-
(2004)
Journal of Artificial Intelligence Research
, vol.22
, pp. 423-455
-
-
Becker, R.1
Zilberstein, S.2
Lesser, V.3
Goldman, C.V.4
-
7
-
-
0036874366
-
The complexity of decentralized control of Markov decision processes
-
D. S. Bernstein, R. Givan, N. Immerman, and S. Zilberstein. The complexity of decentralized control of Markov decision processes. Mathematics of Operations Research, 27(4):819-840, 2002.
-
(2002)
Mathematics of Operations Research
, vol.27
, Issue.4
, pp. 819-840
-
-
Bernstein, D.S.1
Givan, R.2
Immerman, N.3
Zilberstein, S.4
-
12
-
-
0002278788
-
Hierarchical reinforcement learning with the MAXQ value function decomposition
-
T. Dietterich. Hierarchical reinforcement learning with the MAXQ value function decomposition. Journal of Artificial Intelligence Research, 13:227-303, 2000.
-
(2000)
Journal of Artificial Intelligence Research
, vol.13
, pp. 227-303
-
-
Dietterich, T.1
-
13
-
-
33846942607
-
Hierarchical multi-agent reinforcement learning
-
M. Ghavamzadeh, S. Mahadevan, and R. Makar. Hierarchical multi-agent reinforcement learning. Journal of Autonomous Agents and Multi-Agent Systems, 13(2):197-229, 2006.
-
(2006)
Journal of Autonomous Agents and Multi-Agent Systems
, vol.13
, Issue.2
, pp. 197-229
-
-
Ghavamzadeh, M.1
Mahadevan, S.2
Makar, R.3
-
15
-
-
27844487453
-
A survey of multi-agent organizational paradigms
-
B. Horling and V. Lesser. A survey of multi-agent organizational paradigms. The Knowledge Engineering Review, 19(4):281-316, 2004.
-
(2004)
The Knowledge Engineering Review
, vol.19
, Issue.4
, pp. 281-316
-
-
Horling, B.1
Lesser, V.2
-
23
-
-
84878301624
-
Incremental clustering and expansion for faster optimal planning in dec-POMDPs
-
F. A. Oliehoek, M. T. J. Spaan, C. Amato, and S. Whiteson. Incremental clustering and expansion for faster optimal planning in Dec-POMDPs. Journal of Artificial Intelligence Research, 46:449-509, 2013.
-
(2013)
Journal of Artificial Intelligence Research
, vol.46
, pp. 449-509
-
-
Oliehoek, F.A.1
Spaan, M.T.J.2
Amato, C.3
Whiteson, S.4
-
28
-
-
85122663910
-
Navigation among movable obstacles: Real-time reasoning in complex environments
-
M. Stilman and J. Kuffner. Navigation among movable obstacles: Real-time reasoning in complex environments. International Journal on Humanoid Robotics, 2(4):479-504, 2005.
-
(2005)
International Journal on Humanoid Robotics
, vol.2
, Issue.4
, pp. 479-504
-
-
Stilman, M.1
Kuffner, J.2
-
29
-
-
27544506565
-
Reinforcement learning for robocup soccer keepaway
-
P. Stone, R. Sutton, and G. Kuhlmann. Reinforcement learning for robocup soccer keepaway. Adaptive Behavior, 13(3):165-188, 2005.
-
(2005)
Adaptive Behavior
, vol.13
, Issue.3
, pp. 165-188
-
-
Stone, P.1
Sutton, R.2
Kuhlmann, G.3
-
30
-
-
0033170372
-
Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
-
R. S. Sutton, D. Precup, and S. Singh. Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning. Artificial Intelligence, 112(1):181-211, 1999.
-
(1999)
Artificial Intelligence
, vol.112
, Issue.1
, pp. 181-211
-
-
Sutton, R.S.1
Precup, D.2
Singh, S.3
|