SCOPUS 정보 검색 플랫폼

Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS

Volumn 1, Issue , 2008, Pages 518-525

Interaction-driven Markov games for decentralized multiagent planning under uncertainty

(2) Spaan, Matthijs T J a Melo, Francisco S a,b

a INSTITUTO SUPERIOR TÉCNICO (Portugal)

b Carnegie Mellon University (United States)

Author keywords

Cooperative multiagent systems; Planning under uncertainty; Team Markov games

Indexed keywords

AUTONOMOUS AGENTS;

APPROXIMATE SOLUTION; COOPERATIVE MULTIAGENT SYSTEMS; FUNDAMENTAL PROPERTIES; MARKOV GAMES; MULTI-AGENT DECISION MAKING; MULTI-AGENT PLANNING; MULTIAGENT DECISIONS; PLANNING UNDER UNCERTAINTY;

MULTI AGENT SYSTEMS;

EID: 84899992307 PISSN: 15488403 EISSN: 15582914 Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (63)

References (21)

1
- 27344432831
- Solving transition independent decentralized Markov decision processes
- R. Becker, S. Zilberstein, V. Lesser, and C. Goldman. Solving transition independent decentralized Markov decision processes. J. Artificial Intelligence Research, 22:423-455, 2004.
- (2004) J. Artificial Intelligence Research , vol.22 , pp. 423-455
- Becker, R.¹ Zilberstein, S.² Lesser, V.³ Goldman, C.⁴

2
- 84880740944
- Bounded policy iteration for decentralized POMDPs
- D. Bernstein, E. Hansen, and S. Zilberstein. Bounded policy iteration for decentralized POMDPs. In Proc. Int. Joint Conf. Artificial Intelligence, pages 1287-1292, 2005.
- (2005) Proc. Int. Joint Conf. Artificial Intelligence , pp. 1287-1292
- Bernstein, D.¹ Hansen, E.² Zilberstein, S.³

3
- 0036874366
- The complexity of decentralized control of Markov decision processes
- D. Bernstein, S. Zilberstein, and N. Immerman. The complexity of decentralized control of Markov decision processes. Mathematics of Operations Research, 27(4):819-840, 2002.
- (2002) Mathematics of Operations Research , vol.27 , Issue.4 , pp. 819-840
- Bernstein, D.¹ Zilberstein, S.² Immerman, N.³

4
- 0003487482
- Athena Scientific
- D. Bertsekas and J. Tsitsiklis. Neuro-Dynamic Programming. Athena Scientific, 1996.
- (1996) Neuro-Dynamic Programming
- Bertsekas, D.¹ Tsitsiklis, J.²

5
- 0002500351
- Planning learning and coordination in multiagent decision processes
- C. Boutilier. Planning, learning and coordination in multiagent decision processes. In Theoretical Aspects of Rationality and Knowledge, 1996.
- (1996) Theoretical Aspects of Rationality and Knowledge
- Boutilier, C.¹

6
- 0003989210
- PhD thesis, Brown University
- A. Cassandra. Exact and approximate algorithms for partially observable Markov decision processes. PhD thesis, Brown University, 1998.
- (1998) Exact and Approximate Algorithms for Partially Observable Markov Decision Processes
- Cassandra, A.¹

7
- 4544325183
- Approximate solutions for partially observable stochastic games with common payoffs
- R. Emery-Montemerlo, G. Gordon, J. Schneider, and S. Thrun. Approximate solutions for partially observable stochastic games with common payoffs. In Proc. Int. Joint Conf. Autonomous Agents and Multi Agent Systems, volume 1, pages 136-143, 2004.
- (2004) Proc. Int. Joint Conf. Autonomous Agents and Multi Agent Systems , vol.1 , pp. 136-143
- Emery-Montemerlo, R.¹ Gordon, G.² Schneider, J.³ Thrun, S.⁴

8
- 29344465971
- A framework for sequential planning in multiagent settings
- P. Gmytrasiewicz and P. Doshi. A framework for sequential planning in multiagent settings. J. Artificial Intelligence Research, 24:49-79, 2005.
- (2005) J. Artificial Intelligence Research , vol.24 , pp. 49-79
- Gmytrasiewicz, P.¹ Doshi, P.²

9
- 4644369748
- Nash Q-learning for general sum stochastic games
- J. Hu and M. Wellman. Nash Q-learning for general sum stochastic games. J. Machine Learning Research, 4:1039-1069, 2003.
- (2003) J. Machine Learning Research , vol.4 , pp. 1039-1069
- Hu, J.¹ Wellman, M.²

10
- 40949099898
- Utile coordination: Learning interdependencies among cooperative agents
- J. Kok, P. Hoen, B. Bakker, and N. Vlassis. Utile coordination: Learning interdependencies among cooperative agents. In Proc. Symp. on Computational Intelligence and Games, pages 29-36, 2005.
- (2005) Proc. Symp. on Computational Intelligence and Games , pp. 29-36
- Kok, J.¹ Hoen, P.² Bakker, B.³ Vlassis, N.⁴

11
- 0000619048
- Extensive games and the problem of information
- H. Kuhn. Extensive games and the problem of information. Annals of Mathematics Studies, 28:193-216, 1953.
- (1953) Annals of Mathematics Studies , vol.28 , pp. 193-216
- Kuhn, H.¹

12
- 0031632806
- Solving very large weakly coupled Markov decision processes
- N. Meuleau, M. Hauskrecht, K. Kim, L. Peshkin, L. Kaelbling, T. Dean, and C. Boutilier. Solving very large weakly coupled Markov decision processes. In Proc. Nat. Conf. Artificial Intelligence, pages 165-172, 1998.
- (1998) Proc. Nat. Conf. Artificial Intelligence , pp. 165-172
- Meuleau, N.¹ Hauskrecht, M.² Kim, K.³ Peshkin, L.⁴ Kaelbling, L.⁵ Dean, T.⁶ Boutilier, C.⁷

13
- 29344437834
- Networked distributed POMDPs: A synthesis of distributed constraint optimization and POMDPs. in Proc
- R. Nair, P. Varakantham, M. Tambe, and M. Yokoo. Networked distributed POMDPs: A synthesis of distributed constraint optimization and POMDPs. In Proc. Nat. Conf. Artificial Intelligence, pages 133-139, 2005.
- (2005) Nat. Conf. Artificial Intelligence , pp. 133-139
- Nair, R.¹ Varakantham, P.² Tambe, M.³ Yokoo, M.⁴

14
- 52249098423
- Optimal and approximate Q-value functions for decentralized POMDPs
- To appear
- F. A. Oliehoek, M. T. J. Spaan, and N. Vlassis. Optimal and approximate Q-value functions for decentralized POMDPs. J. Artificial Intelligence Research, 2008. To appear.
- (2008) J. Artificial Intelligence Research
- Oliehoek, F.A.¹ Spaan, M.T.J.² Vlassis, N.³

15
- 84899909133
- Exploiting locality of interaction in factored Dec-POMDPs
- F. A. Oliehoek, M. T. J. Spaan, S. Whiteson, and N. Vlassis. Exploiting locality of interaction in factored Dec-POMDPs. In Proc. Int. Joint Conf. Autonomous Agents and Multi Agent Systems, 2008.
- (2008) Proc. Int. Joint Conf. Autonomous Agents and Multi Agent Systems
- Oliehoek, F.A.¹ Spaan, M.T.J.² Whiteson, S.³ Vlassis, N.⁴

16
- 84873476252
- Simple search methods for finding a Nash equilibrium
- in press
- R. Porter, E. Nudelman, and Y. Shoham. Simple search methods for finding a Nash equilibrium. Games and Economic Behavior, 2006 (in press).
- (2006) Games and Economic Behavior
- Porter, R.¹ Nudelman, E.² Shoham, Y.³

17
- 0010276944
- Implicit imitation in multiagent reinforcement learning
- B. Price and C. Boutilier. Implicit imitation in multiagent reinforcement learning. In Proc. Int. Conf. Machine Learning, pages 325-334, 1999.
- (1999) Proc. Int. Conf. Machine Learning , pp. 325-334
- Price, B.¹ Boutilier, C.²

18
- 1142292938
- The communicative multiagent team decision problem: Analyzing teamwork theories and models
- D. Pynadath and M. Tambe. The communicative multiagent team decision problem: Analyzing teamwork theories and models. J. Artificial Intelligence Research, 16:389-423, 2002.
- (2002) J. Artificial Intelligence Research , vol.16 , pp. 389-423
- Pynadath, D.¹ Tambe, M.²

19
- 84903438513
- Decentralized communication strategies for coordinated multi-agent policies
- M. Roth, R. Simmons, and M. Veloso. Decentralized communication strategies for coordinated multi-agent policies. In Multi-Robot Systems: From Swarms to Intelligent Automata, volume III, pages 93-106, 2005.
- (2005) Multi-Robot Systems: From Swarms to Intelligent Automata , vol.3 , pp. 93-106
- Roth, M.¹ Simmons, R.² Veloso, M.³

20
- 60349107649
- Exploiting factored representations for decentralized execution in multi-agent teams
- M. Roth, R. Simmons, and M. Veloso. Exploiting factored representations for decentralized execution in multi-agent teams. In Proc. Int. Joint Conf. Autonomous Agents and Multi Agent Systems, 2007.
- (2007) Proc. Int. Joint Conf. Autonomous Agents and Multi Agent Systems
- Roth, M.¹ Simmons, R.² Veloso, M.³

21
- 84899022377
- How to dynamically merge Markov decision processes
- M. Jordan, M. Kearns, and S. Solla, editors
- S. Singh and D. Cohn. How to dynamically merge Markov decision processes. In M. Jordan, M. Kearns, and S. Solla, editors, Adv. Neural Information Processing Systems 10, pages 1057-1063, 1998.
- (1998) Adv. Neural Information Processing Systems , vol.10 , pp. 1057-1063
- Singh, S.¹ Cohn, D.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.