-
1
-
-
77958561050
-
Incremental policy generation for finite-horizon Dec-POMDPs
-
Thessaloniki, Greece, September 19-23, AAAI
-
Christopher Amato, Jilles Steeve Dibangoye, Shlomo Zilberstein, Incremental policy generation for finite-horizon Dec-POMDPs, in: Proceedings of the 19th International Conference on Automated Planning and Scheduling (ICAPS), Thessaloniki, Greece, September 19-23, AAAI, 2009.
-
(2009)
Proceedings of the 19th International Conference on Automated Planning and Scheduling (ICAPS)
-
-
Christopher, A.1
Jilles, S.D.2
Shlomo, Z.3
-
2
-
-
77952736651
-
An investigation into mathematical programming for finite horizon decentralized POMDPs
-
Aras Raghav, Dutech Alain An investigation into mathematical programming for finite horizon decentralized POMDPs. J. Artif. Intell. Res. 2010, 37:329-396.
-
(2010)
J. Artif. Intell. Res.
, vol.37
, pp. 329-396
-
-
Aras, R.1
Dutech, A.2
-
3
-
-
0041966002
-
Using confidence bounds for exploitation-exploration trade-offs
-
Auer Peter Using confidence bounds for exploitation-exploration trade-offs. J. Mach. Learn. Res. 2002, 3:397-422.
-
(2002)
J. Mach. Learn. Res.
, vol.3
, pp. 397-422
-
-
Auer, P.1
-
4
-
-
84868275593
-
Sample bounded distributed reinforcement learning for decentralized POMDPs
-
Toronto, Canada, July
-
Bikramjit Banerjee, Jeremy Lyle, Landon Kraemer, Rajesh Yellamraju. Sample bounded distributed reinforcement learning for decentralized POMDPs, in: Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence (AAAI-12), Toronto, Canada, July 2012, pp. 1256-1262.
-
(2012)
Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence (AAAI-12)
, pp. 1256-1262
-
-
Bikramjit, B.1
Jeremy, L.2
Landon, K.3
Rajesh, Y.4
-
5
-
-
84880904080
-
General game learning using knowledge transfer
-
Hyderabad, India
-
Bikramjit Banerjee, Peter Stone, General game learning using knowledge transfer, in: Proceedings of the 20th International Joint Conference on Artificial Intelligence (IJCAI-07), Hyderabad, India, 2007, pp. 672-677.
-
(2007)
Proceedings of the 20th International Joint Conference on Artificial Intelligence (IJCAI-07)
, pp. 672-677
-
-
Bikramjit, B.1
Peter, S.2
-
6
-
-
0036874366
-
The complexity of decentralized control of Markov decision processes
-
Bernstein Daniel S., Givan Robert, Immerman Neil, Zilberstein Shlomo The complexity of decentralized control of Markov decision processes. Math. Oper. Res. 2002, 27:819-840.
-
(2002)
Math. Oper. Res.
, vol.27
, pp. 819-840
-
-
Bernstein, D.S.1
Givan, R.2
Immerman, N.3
Zilberstein, S.4
-
8
-
-
84962119252
-
-
The MARL Toolbox version 1.3
-
Lucian Busoniu, The MARL Toolbox version 1.3, 2010. http://busoniu.net/repository.php.
-
(2010)
-
-
Busoniu, L.1
-
10
-
-
84899853392
-
Point-based incremental pruning heuristic for solving finite-horizon Dec-POMDPs
-
Budapest, Hungary
-
Jilles S. Dibangoye, Abdel-Illah Mouaddib, Brahim Chai-draa, Point-based incremental pruning heuristic for solving finite-horizon Dec-POMDPs, in: Proceedings of The 8th International Conference on Autonomous Agents and Multiagent Systems (AAMAS-09), Budapest, Hungary, 2009, pp. 569-576.
-
(2009)
Proceedings of The 8th International Conference on Autonomous Agents and Multiagent Systems (AAMAS-09)
, pp. 569-576
-
-
Jilles, S.1
Dibangoye2
Abdel-Illah, M.3
Brahim, C.-D.4
-
12
-
-
84962119248
-
Mobile robotics: Kilobots
-
K-Team, Mobile robotics: Kilobots, 〈〉. http://www.k-team.com/mobile-robotics-products/kilobot.
-
-
-
-
14
-
-
84876899100
-
Informed initial policies for learning in Dec-POMDPs
-
Valencia, Spain, June
-
Landon Kraemer, Bikramjit Banerjee, Informed initial policies for learning in Dec-POMDPs, in: Proceedings of the AAMAS-12 Workshop on Adaptive Learning Agents (ALA-12), Valencia, Spain, June 2012, pp. 135-143.
-
(2012)
Proceedings of the AAMAS-12 Workshop on Adaptive Learning Agents (ALA-12)
, pp. 135-143
-
-
Landon, K.1
Bikramjit, B.2
-
15
-
-
84899441707
-
Concurrent reinforcement learning as a rehearsal for decentralized planning under uncertainty (extended abstract)
-
St. Paul, MN, May
-
Landon Kraemer, Bikramjit Banerjee, Concurrent reinforcement learning as a rehearsal for decentralized planning under uncertainty (extended abstract), in: Proceedings of the 12th International Conference on Autonomous Agents and Multi-agent Systems (AAMAS-13), St. Paul, MN, May 2013, pp. 1291-1292.
-
(2013)
Proceedings of the 12th International Conference on Autonomous Agents and Multi-agent Systems (AAMAS-13)
, pp. 1291-1292
-
-
Landon, K.1
Bikramjit, B.2
-
17
-
-
0030647149
-
Reinforcement learning in the multi-robot domain
-
Mataric Maja J. Reinforcement learning in the multi-robot domain. Auton. Robots 1997, 4:73-83.
-
(1997)
Auton. Robots
, vol.4
, pp. 73-83
-
-
Mataric, M.J.1
-
18
-
-
0141596576
-
Policy invariance under reward transformations: theory and application to reward shaping
-
Morgan Kaufmann, Bled, Slovenia
-
Andrew Y. Ng, Daishi Harada, Stuart Russell, Policy invariance under reward transformations: theory and application to reward shaping, in: Proceedings of 16th International Conference on Machine Learning, Morgan Kaufmann, Bled, Slovenia, 1999, pp. 278-287.
-
(1999)
Proceedings of 16th International Conference on Machine Learning
, pp. 278-287
-
-
Andrew, Y.N.1
Daishi, H.2
Stuart, R.3
-
19
-
-
84868289680
-
Heuristic search for identical payoff Bayesian games
-
Toronto, Canada
-
Frans A. Oliehoek, Matthijs T.J. Spaan, Jilles S. Dibangoye, Christopher Amato, Heuristic search for identical payoff Bayesian games, in: Proceedings of the Ninth International Conference on Autonomous Agents and Multiagent Systems (AAMAS-10), Toronto, Canada, 2010, pp. 1115-1122.
-
(2010)
Proceedings of the Ninth International Conference on Autonomous Agents and Multiagent Systems (AAMAS-10)
, pp. 1115-1122
-
-
Frans, A.1
Oliehoek2
Matthijs, T.J.3
Spaan4
Jilles, S.5
Dibangoye6
Christopher, A.7
-
20
-
-
52249098423
-
Optimal and approximate Q-value functions for decentralized POMDPs
-
Oliehoek Frans A., Spaan Matthijs T.J., Vlassis Nikos Optimal and approximate Q-value functions for decentralized POMDPs. JAIR 2008, 32:289-353.
-
(2008)
JAIR
, vol.32
, pp. 289-353
-
-
Oliehoek, F.A.1
Spaan, M.T.J.2
Vlassis, N.3
-
22
-
-
84899811776
-
Spaan, Lossless clustering of histories in decentralized POMDPs
-
Budapest, Hungary
-
Frans A. Oliehoek, Shimon Whiteson, Matthijs T.J. Spaan, Lossless clustering of histories in decentralized POMDPs, in: Proceedings of the 8th International Conference on Autonomous Agents and Multiagent Systems (AAMAS-09), Budapest, Hungary, 2009, pp. 577-584.
-
(2009)
Proceedings of the 8th International Conference on Autonomous Agents and Multiagent Systems (AAMAS-09)
, pp. 577-584
-
-
Frans, A.1
Oliehoek2
Shimon, W.3
Matthijs, T.J.4
-
23
-
-
27344432348
-
Accelerating reinforcement learning through implicit imitation
-
Price Bob, Boutilier Craig Accelerating reinforcement learning through implicit imitation. J. Artif. Intell. Res. 2003, 19:569-629.
-
(2003)
J. Artif. Intell. Res.
, vol.19
, pp. 569-629
-
-
Price, B.1
Boutilier, C.2
-
24
-
-
84880856384
-
Memory-bounded dynamic programming for Dec-POMDPs
-
Hyderabad, India
-
Sven Seuken, Shlomo Zilberstein, Memory-bounded dynamic programming for Dec-POMDPs, in: Proceedings of the 20th International Joint Conference on Artificial Intelligence (IJCAI-07), Hyderabad, India, 2007, pp. 2009-2015.
-
(2007)
Proceedings of the 20th International Joint Conference on Artificial Intelligence (IJCAI-07)
, pp. 2009-2015
-
-
Sven, S.1
Shlomo, Z.2
-
25
-
-
84962095723
-
Dec-POMDP problem domains and format
-
Matthijs Spaan, Dec-POMDP problem domains and format. 〈〉. http://masplan.org/.
-
-
-
Matthijs, S.1
-
26
-
-
84868299292
-
Scaling up optimal heuristic search in Dec-POMDPs via incremental expansion
-
Barcelona, Spain
-
Matthijs T.J. Spaan, Frans A. Oliehoek, Christopher Amato, Scaling up optimal heuristic search in Dec-POMDPs via incremental expansion, in: Proceedings of the Twenty-Second International Joint Conference on Artificial Intelligence (IJCAI-11), Barcelona, Spain, 2011, pp. 2027-2032.
-
(2011)
Proceedings of the Twenty-Second International Joint Conference on Artificial Intelligence (IJCAI-11)
, pp. 2027-2032
-
-
Matthijs, T.J.S.1
Frans, A.2
Oliehoek3
Christopher, A.4
-
28
-
-
33750691009
-
Point-based dynamic programming for Dec-POMDPs
-
Boston, MA
-
Daniel Szer, François Charpillet, Point-based dynamic programming for Dec-POMDPs, in: Proceedings of the 21st National Conference on Artificial Intelligence, Boston, MA, 2006, pp. 1233-1238.
-
(2006)
Proceedings of the 21st National Conference on Artificial Intelligence
, pp. 1233-1238
-
-
Daniel, S.1
François, C.2
-
29
-
-
80053153738
-
Rollout sampling policy iteration for decentralized POMDPs
-
Feng Wu, Shlomo Zilberstein, Xiaoping Chen, Rollout sampling policy iteration for decentralized POMDPs, in: Proceedings of the 26th Conference on Uncertainty in Artificial Intelligence (UAI-10), 2010, pp. 666-673.
-
(2010)
Proceedings of the 26th Conference on Uncertainty in Artificial Intelligence (UAI-10)
, pp. 666-673
-
-
Feng, W.1
Shlomo, Z.2
Xiaoping, C.3
|