-
1
-
-
80053179816
-
Optimizing memory-bounded controllers for decentralized POMDPs
-
Amato, C.; Bernstein, D.; and Zilberstein, S. 2007. Optimizing memory-bounded controllers for decentralized POMDPs. In Proc. UAI.
-
(2007)
Proc. UAI
-
-
Amato, C.1
Bernstein, D.2
Zilberstein, S.3
-
2
-
-
0036874366
-
The complexity of decentralized control of markov decision processes
-
Bernstein, D. S.; Givan, R.; Immerman, N.; and Zilberstein, S. 2002. The complexity of decentralized control of markov decision processes. Mathematics of Operations Research 27:819-840.
-
(2002)
Mathematics of Operations Research
, vol.27
, pp. 819-840
-
-
Bernstein, D.S.1
Givan, R.2
Immerman, N.3
Zilberstein, S.4
-
3
-
-
0041965975
-
R-max - A general polynomial time algorithm for near-optimal reinforcement learning
-
Brafman, R. I., and Tennenholtz, M. 2002. R-max - A general polynomial time algorithm for near-optimal reinforcement learning. Journal of Machine Learning Research 3:213-231.
-
(2002)
Journal of Machine Learning Research
, vol.3
, pp. 213-231
-
-
Brafman, R.I.1
Tennenholtz, M.2
-
4
-
-
0026998041
-
Reinforcement learning with perceptual aliasing: The perceptual distinctions approach
-
San Jose, CA: AAAI Press
-
Chrisman, L. 1992. Reinforcement learning with perceptual aliasing: The perceptual distinctions approach. In Proceedings of the Tenth National Conference on Articial Intelligence, 183-188. San Jose, CA: AAAI Press.
-
(1992)
Proceedings of the Tenth National Conference on Articial Intelligence
, pp. 183-188
-
-
Chrisman, L.1
-
6
-
-
4544325183
-
Approximate solutions for partially observable stochastic games with common payoffs
-
Emery-Montemerlo, R.; Gordon, G.; Schneider, J.; and Thrun, S. 2004. Approximate solutions for partially observable stochastic games with common payoffs. Autonomous Agents and Multiagent Systems, International Joint Conference on 1:136-143.
-
(2004)
Autonomous Agents and Multiagent Systems, International Joint Conference on
, vol.1
, pp. 136-143
-
-
Emery-Montemerlo, R.1
Gordon, G.2
Schneider, J.3
Thrun, S.4
-
9
-
-
0002103968
-
Learning finite-state controllers for partially observable environments
-
Meuleau, N.; Peshkin, L.; Kim, K.; and Kaelbling, L. 1999. Learning finite-state controllers for partially observable environments. In Proc. UAI, 427-436.
-
(1999)
Proc. UAI
, pp. 427-436
-
-
Meuleau, N.1
Peshkin, L.2
Kim, K.3
Kaelbling, L.4
-
10
-
-
84880823326
-
Taming decentralized pomdps: Towards efficient policy computation for multiagent settings
-
Nair, R.; Tambe, M.; Yokoo, M.; Pynadath, D.; and Marsella, S. 2003. Taming decentralized pomdps: Towards efficient policy computation for multiagent settings. In Proceedings of the 18th International Joint Conference on Artificial Intelligence (IJCAI-03), 705-711.
-
(2003)
Proceedings of the 18th International Joint Conference on Artificial Intelligence (IJCAI-03)
, pp. 705-711
-
-
Nair, R.1
Tambe, M.2
Yokoo, M.3
Pynadath, D.4
Marsella, S.5
-
11
-
-
84868289680
-
Heuristic search for identical payoff bayesian games
-
Oliehoek, F. A.; Spaan, M. T. J.; Dibangoye, J. S.; and Amato, C. 2010. Heuristic search for identical payoff bayesian games. In Proceedings of the Ninth International Conference on Autonomous Agents and Multiagent Systems (AAMAS-10), 1115-1122.
-
(2010)
Proceedings of the Ninth International Conference on Autonomous Agents and Multiagent Systems (AAMAS-10)
, pp. 1115-1122
-
-
Oliehoek, F.A.1
Spaan, M.T.J.2
Dibangoye, J.S.3
Amato, C.4
-
14
-
-
33646435268
-
Model-based online learning of POMDPs
-
Proceedings of the European Conference on Machine Learning (ECML), volume Springer
-
Shani, G.; Brafman, R.; and Shimony, S. 2005. Model-based online learning of POMDPs. In Proceedings of the European Conference on Machine Learning (ECML), volume Lecture Notes in Computer Science 3720, 353-364. Springer.
-
(2005)
Lecture Notes in Computer Science
, vol.3720
, pp. 353-364
-
-
Shani, G.1
Brafman, R.2
Shimony, S.3
-
19
-
-
85140781301
-
Coordinated multi-agent reinforcement learning in networked distributed POMDPs
-
Zhang, C., and Lesser, V. 2011. Coordinated multi-agent reinforcement learning in networked distributed POMDPs. In Proc. AAAl-11.
-
(2011)
Proc. AAAl-11
-
-
Zhang, C.1
Lesser, V.2
|