SCOPUS 정보 검색 플랫폼

Artificial Intelligence

Volumn 175, Issue 11, 2011, Pages 1757-1789

Decentralized MDPs with sparse interactions

(2) Melo, Francisco S a Veloso, Manuela b

a Edifício IST ^* (Portugal)

b Carnegie Mellon University ^* (United States)

Author keywords

Decentralized Markov decision processes; Multiagent coordination; Sparse interaction

Indexed keywords

DECISION-THEORETIC; INDEPENDENT AGENTS; LOCAL INTERACTIONS; MARKOV DECISION PROCESSES; MULTI-AGENT; MULTI-AGENT COORDINATIONS; NEW MODEL; SOLUTION METHODS; SPARSE INTERACTION; STATE SPACE; THEORETICAL ERRORS;

ERROR ANALYSIS; LEARNING ALGORITHMS; MARKOV PROCESSES;

MULTI AGENT SYSTEMS;

EID: 79955976414 PISSN: 00043702 EISSN: None Source Type: Journal
DOI: 10.1016/j.artint.2011.05.001 Document Type: Article

Times cited : (85)

References (51)

1
- 0031632806
- Solving very large weakly coupled Markov decision processes
- N. Meuleau, M. Hauskrecht, K. Kim, L. Peshkin, L. Kaelbling, T. Dean, C. Boutilier, Solving very large weakly coupled Markov decision processes, in: Proc. 15th AAAI Conf. Artificial Intelligence, 1998, pp. 165-172.
- (1998) Proc. 15th AAAI Conf. Artificial Intelligence , pp. 165-172
- Meuleau, N.¹ Hauskrecht, M.² Kim, K.³ Peshkin, L.⁴ Kaelbling, L.⁵ Dean, T.⁶ Boutilier, C.⁷

2
- 84899022377
- How to dynamically merge Markov decision processes
- S. Singh, and D. Cohn How to dynamically merge Markov decision processes Advances in Neural Information Processing Systems 10 1998 1057 1063
- (1998) Advances in Neural Information Processing Systems , vol.10 , pp. 1057-1063
- Singh, S.¹ Cohn, D.²

3
- 27344450974
- Hybrid BDI-POMDP framework for multiagent teaming
- R. Nair, and M. Tambe Hybrid BDI-POMDP framework for multiagent teaming Journal of Artificial Intelligence Research 23 2005 367 420 (Pubitemid 41525873)
- (2005) Journal of Artificial Intelligence Research , vol.23 , pp. 367-420
- Nair, R.¹ Tambe, M.²

4
- 0032645144
- Team-partitioned, opaque-transition reinforcement learning
- P. Stone, M. Veloso, Team-partitioned, opaque-transition reinforcement learning, in: Proc. RoboCup-98, 1998, pp. 206-212.
- (1998) Proc. RoboCup-98 , pp. 206-212
- Stone, P.¹ Veloso, M.²

5
- 33846942607
- Hierarchical multiagent reinforcement learning
- M. Ghavamzadeh, S. Mahadevan, and R. Makar Hierarchical multiagent reinforcement learning Journal of Autonomous Agents and Multiagent Systems 13 2 2006 197 229
- (2006) Journal of Autonomous Agents and Multiagent Systems , vol.13 , Issue.2 , pp. 197-229
- Ghavamzadeh, M.¹ Mahadevan, S.² Makar, R.³

6
- 0012296128
- Multiagent planning with factored MDPs
- C. Guestrin, D. Koller, and R. Parr Multiagent planning with factored MDPs Advances in Neural Information Processing Systems 14 2001 1523 1530
- (2001) Advances in Neural Information Processing Systems , vol.14 , pp. 1523-1530
- Guestrin, C.¹ Koller, D.² Parr, R.³

7
- 40949099898
- Utile coordination: Learning interdependencies among cooperative agents
- J. Kok, P. Hoen, B. Bakker, N. Vlassis, Utile coordination: learning interdependencies among cooperative agents, in: IEEE Symp. Computational Intelligence and Games, 2005, pp. 61-68.
- (2005) IEEE Symp. Computational Intelligence and Games , pp. 61-68
- Kok, J.¹ Hoen, P.² Bakker, B.³ Vlassis, N.⁴

8
- 60349107649
- Exploiting factored representations for decentralized execution in multiagent teams
- M. Roth, R. Simmons, M. Veloso, Exploiting factored representations for decentralized execution in multiagent teams, in: Proc. 6th Int. Conf. Autonomous Agents and Multiagent Systems, 2007, pp. 469-475.
- (2007) Proc. 6th Int. Conf. Autonomous Agents and Multiagent Systems , pp. 469-475
- Roth, M.¹ Simmons, R.² Veloso, M.³

9
- 0141591857
- Graphical models for game theory
- M. Kearns, M. Littman, S. Singh, Graphical models for game theory, in: Proc. 17th Conf. Uncertainty in Artificial Intelligence, 2001, pp. 253-260.
- (2001) Proc. 17th Conf. Uncertainty in Artificial Intelligence , pp. 253-260
- Kearns, M.¹ Littman, M.² Singh, S.³

10
- 79958100489
- Action-graph games
- TR-2008-13, Univ. British Columbia
- A. Xin Jiang, K. Leyton-Brown, N. Bhat, Action-graph games, Tech. rep. TR-2008-13, Univ. British Columbia, 2008.
- (2008) Tech. Rep
- Xin Jiang, A.¹ Leyton-Brown, K.² Bhat, N.³

11
- 79958114832
- Complexity of decentralized control: Special cases
- M. Allen, and S. Zilberstein Complexity of decentralized control: special cases Advances in Neural Information Processing Systems 22 2009 19 27
- (2009) Advances in Neural Information Processing Systems , vol.22 , pp. 19-27
- Allen, M.¹ Zilberstein, S.²

12
- 27344449757
- Decentralized control of cooperative systems: Categorization and complexity analysis
- C. Goldman, and S. Zilberstein Decentralized control of cooperative systems: categorization and complexity analysis Journal of Artificial Intelligence Research 22 2004 143 174 (Pubitemid 41525885)
- (2004) Journal of Artificial Intelligence Research , vol.22 , pp. 143-174
- Goldman, C.V.¹ Zilberstein, S.²

13
- 78650588227
- Exploiting coordination locales in distributed POMDPs via social model shaping
- P. Varakantham, J. Kwak, M. Taylor, J. Marecki, P. Scerri, M. Tambe, Exploiting coordination locales in distributed POMDPs via social model shaping, in: Proc. 19th Int. Conf. Automated Planning and Scheduling, 2009, pp. 313-320.
- (2009) Proc. 19th Int. Conf. Automated Planning and Scheduling , pp. 313-320
- Varakantham, P.¹ Kwak, J.² Taylor, M.³ Marecki, J.⁴ Scerri, P.⁵ Tambe, M.⁶

14
- 18144382304
- John Wiley & Sons, Inc.
- M. Puterman, and Markov Decision Processes Discrete Stochastic Dynamic Programming 1994 John Wiley & Sons, Inc.
- (1994) Discrete Stochastic Dynamic Programming
- Puterman, M.¹ Decision Processes, M.²

15
- 0032073263
- Planning and acting in partially observable stochastic domains
- PII S000437029800023X
- L. Kaelbling, M. Littman, and A. Cassandra Planning and acting in partially observable stochastic domains Artificial Intelligence 101 1998 99 134 (Pubitemid 128387390)
- (1998) Artificial Intelligence , vol.101 , Issue.1-2 , pp. 99-134
- Kaelbling, L.P.¹ Littman, M.L.² Cassandra, A.R.³

16
- 0036874366
- The complexity of decentralized control of Markov decision processes
- D. Bernstein, R. Givan, N. Immerman, and S. Zilberstein The complexity of decentralized control of Markov decision processes Mathematics of Operations Research 27 4 2002 819 840
- (2002) Mathematics of Operations Research , vol.27 , Issue.4 , pp. 819-840
- Bernstein, D.¹ Givan, R.² Immerman, N.³ Zilberstein, S.⁴

17
- 0004049893
- Ph.D. thesis, King s College, Cambridge Univ.
- C. Watkins, Learning from delayed rewards, Ph.D. thesis, King s College, Cambridge Univ., 1989.
- (1989) Learning from Delayed Rewards
- Watkins, C.¹

18
- 0003487482
- Athena Scientific
- D. Bertsekas, and J. Tsitsiklis Neuro-Dynamic Programming 1996 Athena Scientific
- (1996) Neuro-Dynamic Programming
- Bertsekas, D.¹ Tsitsiklis, J.²

19
- 0015658957
- The optimal control of partially observable Markov processes over a finite horizon
- R. Smallwood, and E. Sondik The optimal control of partially observable Markov processes over a finite horizon Operations Research 21 5 1973 1071 1088
- (1973) Operations Research , vol.21 , Issue.5 , pp. 1071-1088
- Smallwood, R.¹ Sondik, E.²

20
- 0032596468
- On the undecidability of probabilistic planning and infinite-horizon partially observable Markov decision problems
- O. Madani, S. Hanks, A. Condon, On the undecidability of probabilistic planning and infinite-horizon partially observable Markov decision problems, in: Proc. 16th AAAI Conf. Artificial Intelligence, 1999, pp. 541-548.
- (1999) Proc. 16th AAAI Conf. Artificial Intelligence , pp. 541-548
- Madani, O.¹ Hanks, S.² Condon, A.³

21
- 0003989210
- Ph.D. thesis, Dept. Computer Sciences, Brown Univ.
- A. Cassandra, Exact and approximate algorithms for partially observable Markov decision processes, Ph.D. thesis, Dept. Computer Sciences, Brown Univ., 1998.
- (1998) Exact and Approximate Algorithms for Partially Observable Markov Decision Processes
- Cassandra, A.¹

22
- 33646244605
- A (revised) survey of approximate methods for solving partially observable Markov decision processes
- National ICT Australia
- D. Aberdeen, A (revised) survey of approximate methods for solving partially observable Markov decision processes, Tech. rep., National ICT Australia, 2003.
- (2003) Tech. Rep.
- Aberdeen, D.¹

23
- 85138579181
- Learning policies for partially observable environments: Scaling up
- M. Littman, A. Cassandra, L. Kaelbling, Learning policies for partially observable environments: scaling up, in: Proc. 12th Int. Conf. Machine Learning, 1995, pp. 362-370.
- (1995) Proc. 12th Int. Conf. Machine Learning , pp. 362-370
- Littman, M.¹ Cassandra, A.² Kaelbling, L.³

24
- 79958154117
- Transition entropy in partially observable Markov decision processes
- F. Melo, M. Ribeiro, Transition entropy in partially observable Markov decision processes, in: Proc. 9th Int. Conf. Intelligent Autonomous Systems, 2006, pp. 282-289.
- (2006) Proc. 9th Int. Conf. Intelligent Autonomous Systems , pp. 282-289
- Melo, F.¹ Ribeiro, M.²

25
- 0000977910
- The complexity of Markov decision processes
- C. Papadimitriou, and J. Tsitsiklis The complexity of Markov decision processes Mathematics of Operations Research 12 3 1987 441 450
- (1987) Mathematics of Operations Research , vol.12 , Issue.3 , pp. 441-450
- Papadimitriou, C.¹ Tsitsiklis, J.²

26
- 51649127552
- Formal models and algorithms for decentralized decision-making under uncertainty
- S. Seuken, and S. Zilberstein Formal models and algorithms for decentralized decision-making under uncertainty Journal of Autonomous Agents and Multiagent Systems 17 2 2008 190 250
- (2008) Journal of Autonomous Agents and Multiagent Systems , vol.17 , Issue.2 , pp. 190-250
- Seuken, S.¹ Zilberstein, S.²

27
- 27344432831
- Solving transition independent decentralized Markov decision processes
- R. Becker, S. Zilberstein, V. Lesser, and C. Goldman Solving transition independent decentralized Markov decision processes Journal of Artificial Intelligence Research 22 2004 423 455 (Pubitemid 41525892)
- (2004) Journal of Artificial Intelligence Research , vol.22 , pp. 423-455
- Becker, R.¹ Zilberstein, S.² Lesser, V.³ Goldman, C.V.⁴

28
- 79958125431
- Ph.D. thesis, Univ. Massachusetts, Amherst
- M. Allen, Interactions in decentralized environments, Ph.D. thesis, Univ. Massachusetts, Amherst, 2009.
- (2009) Interactions in Decentralized Environments
- Allen, M.¹

29
- 0008084202
- Optimal policies for partially observable Markov decision processes
- Dept. Computer Sciences, Brown Univ.
- A. Cassandra, Optimal policies for partially observable Markov decision processes, Tech. rep. CS-94-14, Dept. Computer Sciences, Brown Univ., 1994.
- (1994) Tech. Rep. CS-94-14
- Cassandra, A.¹

30
- 84899992307
- Interaction-driven Markov games for decentralized multiagent planning under uncertainty
- M. Spaan, F. Melo, Interaction-driven Markov games for decentralized multiagent planning under uncertainty, in: Proc. 7th Int. Conf. Autonomous Agents and Multiagent Systems, 2008, pp. 525-532.
- (2008) Proc. 7th Int. Conf. Autonomous Agents and Multiagent Systems , pp. 525-532
- Spaan, M.¹ Melo, F.²

31
- 0031630561
- The dynamics of reinforcement learning in cooperative multiagent systems
- C. Claus, C. Boutilier, The dynamics of reinforcement learning in cooperative multiagent systems, in: Proc. 15th AAAI Conf. Artificial Intelligence, 1998, pp. 746-752.
- (1998) Proc. 15th AAAI Conf. Artificial Intelligence , pp. 746-752
- Claus, C.¹ Boutilier, C.²

32
- 0038637209
- Multi-agent reinforcement learning: Independent vs. cooperative agents
- Morgan Kaufman
- M. Tan Multi-agent reinforcement learning: independent vs. cooperative agents Readings in Agents 1997 Morgan Kaufman 487 494
- (1997) Readings in Agents , pp. 487-494
- Tan, M.¹

33
- 33744514808
- Generalised weakened fictitious play
- DOI 10.1016/j.geb.2005.08.005, PII S089982560500103X
- D. Leslie, and E. Collins Generalised weakened fictitious play Games and Economic Behavior 56 2006 285 298 (Pubitemid 43812068)
- (2006) Games and Economic Behavior , vol.56 , Issue.2 , pp. 285-298
- Leslie, D.S.¹ Collins, E.J.²

34
- 0004102479
- MIT Press
- R. Sutton, and A. Barto Reinforcement Learning: An Introduction 1998 MIT Press
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.¹ Barto, A.²

35
- 0000392613
- Stochastic games
- L. Shapley Stochastic games Proceedings of the National Academy of Sciences 39 1953 1095 1100
- (1953) Proceedings of the National Academy of Sciences , vol.39 , pp. 1095-1100
- Shapley, L.¹

36
- 0010220982
- Planning, learning and coordination in multiagent decision processes
- C. Boutilier, Planning, learning and coordination in multiagent decision processes, in: Theoretical Aspects of Rationality and Knowledge, 1996, pp. 195-210.
- (1996) Theoretical Aspects of Rationality and Knowledge , pp. 195-210
- Boutilier, C.¹

37
- 1142292938
- The communicative multiagent team decision problem: Analyzing teamwork theories and models
- D. Pynadath, and M. Tambe The communicative multiagent team decision problem: analyzing teamworktheories and models Journal of Artificial Intelligence Research 16 2002 389 423 (Pubitemid 43057178)
- (2002) Journal of Artificial Intelligence Research , vol.16 , pp. 389-423
- Pynadath, D.V.¹ Tambe, M.²

38
- 29344465971
- A framework for sequential planning in multi-agent settings
- P. Gmytrasiewicz, and P. Doshi A framework for sequential planning in multiagent settings Journal of Artificial Intelligence Research 24 2005 49 79 (Pubitemid 43130932)
- (2005) Journal of Artificial Intelligence Research , vol.24 , pp. 49-79
- Gmytrasiewicz, P.J.¹ Doshi, P.²

39
- 4544325183
- Approximate solutions for partially observable stochastic games with common payoffs
- R. Emery-Montemerlo, G. Gordon, J. Schneider, S. Thrun, Approximate solutions for partially observable stochastic games with common payoffs, in: Proc. 3rd Int. Conf. Autonomous Agents and Multiagent Systems, 2004, pp. 136-143.
- (2004) Proc. 3rd Int. Conf. Autonomous Agents and Multiagent Systems , pp. 136-143
- Emery-Montemerlo, R.¹ Gordon, G.² Schneider, J.³ Thrun, S.⁴

40
- 60349124018
- Ph.D. thesis, Carnegie Mellon University, August 2007
- M. Roth, Execution-time communication decisions for coordination of multiagent teams, Ph.D. thesis, Carnegie Mellon University, August 2007.
- Execution-time Communication Decisions for Coordination of Multiagent Teams
- Roth, M.¹

41
- 1142293055
- Transition-independent decentralized Markov decision processes
- R. Becker, S. Zilberstein, V. Lesser, C. Goldman, Transition-independent decentralized Markov decision processes, in: Proc. 2nd Int. Conf. Autonomous Agents and Multiagent Systems, 2003, pp. 41-48.
- (2003) Proc. 2nd Int. Conf. Autonomous Agents and Multiagent Systems , pp. 41-48
- Becker, R.¹ Zilberstein, S.² Lesser, V.³ Goldman, C.⁴

42
- 0034819292
- Hierarchical multiagent reinforcement learning
- R. Makar, S. Mahadevan, Hierarchical multiagent reinforcement learning, in: Proc. 5th Int. Conf. Autonomous Agents, 2001, pp. 246-253.
- (2001) Proc. 5th Int. Conf. Autonomous Agents , pp. 246-253
- Makar, R.¹ Mahadevan, S.²

43
- 0036923118
- Context-specific multiagent coordination and planning with factored MDPs
- C. Guestrin, S. Venkataraman, D. Koller, Context-specific multiagent coordination and planning with factored MDPs, in: Proc. 18th AAAI Conf. Artificial Intelligence, 2002, pp. 253-259.
- (2002) Proc. 18th AAAI Conf. Artificial Intelligence , pp. 253-259
- Guestrin, C.¹ Venkataraman, S.² Koller, D.³

44
- 47149086135
- Sparse cooperative Q-learning
- J. Kok, N. Vlassis, Sparse cooperative Q-learning, in: Proc. 21st Int. Conf. Machine Learning, 2004, pp. 61-68.
- (2004) Proc. 21st Int. Conf. Machine Learning , pp. 61-68
- Kok, J.¹ Vlassis, N.²

45
- 33748694986
- Computing Nash equilibria of action-graph games
- N. Bhat, K. Leyton-Brown, Computing Nash equilibria of action-graph games, in: Proc. 20th Conf. Uncertainty in Artificial Intelligence, 2004, pp. 35-42.
- (2004) Proc. 20th Conf. Uncertainty in Artificial Intelligence , pp. 35-42
- Bhat, N.¹ Leyton-Brown, K.²

46
- 33750710723
- A polynomial-time algorithm for action-graph games
- Proceedings of the 21st National Conference on Artificial Intelligence and the 18th Innovative Applications of Artificial Intelligence Conference, AAAI-06/IAAI-06
- A. Xin Jiang, K. Leyton-Brown, A polynomial-time algorithm for action-graph games, in: Proc. 21st AAAI Conf. Artificial Intelligence, 2006, pp. 679-684. (Pubitemid 44705361)
- (2006) Proceedings of the National Conference on Artificial Intelligence , vol.1 , pp. 679-684
- Jiang, A.X.¹ Leyton-Brown, K.²

47
- 36349034015
- Computing pure nash equilibria in symmetric action graph games
- AAAI-07/IAAI-07 Proceedings: 22nd AAAI Conference on Artificial Intelligence and the 19th Innovative Applications of Artificial Intelligence Conference
- A. Xin Jiang, K. Leyton-Brown, Computing pure Nash equilibria in symmetric action-graph games, in: Proc. 22nd AAAI Conf. Artificial Intelligence, 2007, pp. 79-85. (Pubitemid 350149556)
- (2007) Proceedings of the National Conference on Artificial Intelligence , vol.1 , pp. 79-85
- Jiang, A.X.¹ Leyton-Brown, K.²

48
- 4544301377
- Decentralized Markov decision processes with event-driven interactions
- R. Becker, V. Lesser, S. Zilberstein, Decentralized Markov decision processes with event-driven interactions, in: Proc. 3rd Int. Conf. Autonomous Agents and Multiagent Systems, 2004, pp. 302-309.
- (2004) Proc. 3rd Int. Conf. Autonomous Agents and Multiagent Systems , pp. 302-309
- Becker, R.¹ Lesser, V.² Zilberstein, S.³

49
- 33846298515
- Analyzing myopic approaches for multi-agent communication
- DOI 10.1109/IAT.2005.44, 1565602, Proceedings - 2005 IEEE/WIC/ACM International Conference on Intelligent Agent Technology, IAT'05
- R. Becker, V. Lesser, S. Zilberstein, Analyzing myopic approaches for multiagent communication, in: Proc. IEEE Int. Conf. Intelligent Agent Technology, 2005, pp. 550-557. (Pubitemid 46116612)
- (2005) Proceedings - 2005 IEEE/WIC/ACM International Conference on Intelligent Agent Technology, IAT'05 , vol.2005 , pp. 550-557
- Becker, R.¹ Lesser, V.² Zilberstein, S.³

50
- 59449098657
- Analyzing myopic approaches for multiagent communications
- R. Becker, A. Carlin, V. Lesser, and S. Zilberstein Analyzing myopic approaches for multiagent communications Computational Intelligence 25 1 2009 31 50
- (2009) Computational Intelligence , vol.25 , Issue.1 , pp. 31-50
- Becker, R.¹ Carlin, A.² Lesser, V.³ Zilberstein, S.⁴

51
- 41549144251
- On a global upper bound for Jensen s inequality
- S. Simic On a global upper bound for Jensen s inequality Journal of Mathematical Analysis and Applications 343 2008 414 419
- (2008) Journal of Mathematical Analysis and Applications , vol.343 , pp. 414-419
- Simic, S.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.