-
1
-
-
1142293055
-
Transition-independent decentralized Markov decision processes
-
Becker, R.; Zilberstein, S.; Lesser, V.; and Goldman, C. V. 2003. Transition-independent decentralized Markov decision processes. In Proceedings of the 2nd International Conference on Autonomous Agents and Multi-agent Systems, 41-48.
-
(2003)
Proceedings of the 2nd International Conference on Autonomous Agents and Multi-agent Systems
, pp. 41-48
-
-
Becker, R.1
Zilberstein, S.2
Lesser, V.3
Goldman, C.V.4
-
2
-
-
0036874366
-
The complexity of decentralized control of Markov decision processes
-
Bernstein, D.; Givan, R.; Immerman, N.; and Zilberstein, S. 2002. The complexity of decentralized control of Markov decision processes. Mathematics of Operations Research 27 (4):819-840.
-
(2002)
Mathematics of Operations Research
, vol.27
, Issue.4
, pp. 819-840
-
-
Bernstein, D.1
Givan, R.2
Immerman, N.3
Zilberstein, S.4
-
4
-
-
0041965975
-
R-MAX-a general polynomial time algorithm for near-optimal reinforcement learning
-
Brafman, R., and Tennenholtz, M. 2002. R-MAX-a general polynomial time algorithm for near-optimal reinforcement learning. Journal of Machine Learning Research 3:213-231.
-
(2002)
Journal of Machine Learning Research
, vol.3
, pp. 213-231
-
-
Brafman, R.1
Tennenholtz, M.2
-
6
-
-
0020113091
-
Decentralized control of finite state Markov processes
-
Hsu, K., and Marcus, S. I. 1982. Decentralized control of finite state Markov processes. IEEE Transactions on Automatic Control AC-27(2):426-431.
-
(1982)
IEEE Transactions on Automatic Control
, vol.AC-27
, Issue.2
, pp. 426-431
-
-
Hsu, K.1
Marcus, S.I.2
-
7
-
-
4644369748
-
Nash Q-learaing for general-sum stochastic games
-
Hu, J., and Wellman, M. 2003. Nash Q-learaing for general-sum stochastic games. Journal of Machine Learning Research 4:1039-1069.
-
(2003)
Journal of Machine Learning Research
, vol.4
, pp. 1039-1069
-
-
Hu, J.1
Wellman, M.2
-
8
-
-
0032073263
-
Planning and acting in partially observable stochastic domains
-
Kaelbling, L.; Littman, M.; and Cassandra, A. 1998. Planning and acting in partially observable stochastic domains. Artificial Intelligence 101:99-134.
-
(1998)
Artificial Intelligence
, vol.101
, pp. 99-134
-
-
Kaelbling, L.1
Littman, M.2
Cassandra, A.3
-
11
-
-
0000619048
-
Extensive games and the problem of information
-
Kuhn, H., and Tucker, A., eds., Princeton University Press
-
Kuhn, H. 1953. Extensive games and the problem of information. In Kuhn, H., and Tucker, A., eds., Contributions to the Theory of Games II. Princeton University Press. 193-216.
-
(1953)
Contributions to the Theory of Games
, vol.2
, pp. 193-216
-
-
Kuhn, H.1
-
13
-
-
84880823326
-
Taming decentralized POMDPs: Towards efficient policy computation for multiagent settings
-
Nair, R.; Pynadath, D.; Yokoo, M.; Tambe, M.; and Marsella, S. 2003. Taming decentralized POMDPs: Towards efficient policy computation for multiagent settings. In Proceedings of the 18th International Joint Conference on Artificial Intelligence, 705-711.
-
(2003)
Proceedings of the 18th International Joint Conference on Artificial Intelligence
, pp. 705-711
-
-
Nair, R.1
Pynadath, D.2
Yokoo, M.3
Tambe, M.4
Marsella, S.5
-
15
-
-
0012646255
-
Learning to cooperate via policy search
-
Peshkin, L.; Kim, K.-E.; Meuleau, N.; and Kaelbling, L. P. 2000. Learning to cooperate via policy search. In Proceedings of the 16th International Conference on Uncertainty in Artificial Intelligence, 489-496.
-
(2000)
Proceedings of the 16th International Conference on Uncertainty in Artificial Intelligence
, pp. 489-496
-
-
Peshkin, L.1
Kim, K.-E.2
Meuleau, N.3
Kaelbling, L.P.4
-
18
-
-
0015658957
-
The optimal control of partially observable Markov processes over a finite horizon
-
Smallwood, R., and Sondik, E. 1973. The optimal control of partially observable Markov processes over a finite horizon. Operations Research 21:1071-1088.
-
(1973)
Operations Research
, vol.21
, pp. 1071-1088
-
-
Smallwood, R.1
Sondik, E.2
|