메뉴 건너뛰기




Volumn 175, Issue 11, 2011, Pages 1757-1789

Decentralized MDPs with sparse interactions

Author keywords

Decentralized Markov decision processes; Multiagent coordination; Sparse interaction

Indexed keywords

DECISION-THEORETIC; INDEPENDENT AGENTS; LOCAL INTERACTIONS; MARKOV DECISION PROCESSES; MULTI-AGENT; MULTI-AGENT COORDINATIONS; NEW MODEL; SOLUTION METHODS; SPARSE INTERACTION; STATE SPACE; THEORETICAL ERRORS;

EID: 79955976414     PISSN: 00043702     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.artint.2011.05.001     Document Type: Article
Times cited : (85)

References (51)
  • 3
    • 27344450974 scopus 로고    scopus 로고
    • Hybrid BDI-POMDP framework for multiagent teaming
    • R. Nair, and M. Tambe Hybrid BDI-POMDP framework for multiagent teaming Journal of Artificial Intelligence Research 23 2005 367 420 (Pubitemid 41525873)
    • (2005) Journal of Artificial Intelligence Research , vol.23 , pp. 367-420
    • Nair, R.1    Tambe, M.2
  • 4
    • 0032645144 scopus 로고    scopus 로고
    • Team-partitioned, opaque-transition reinforcement learning
    • P. Stone, M. Veloso, Team-partitioned, opaque-transition reinforcement learning, in: Proc. RoboCup-98, 1998, pp. 206-212.
    • (1998) Proc. RoboCup-98 , pp. 206-212
    • Stone, P.1    Veloso, M.2
  • 10
    • 79958100489 scopus 로고    scopus 로고
    • Action-graph games
    • TR-2008-13, Univ. British Columbia
    • A. Xin Jiang, K. Leyton-Brown, N. Bhat, Action-graph games, Tech. rep. TR-2008-13, Univ. British Columbia, 2008.
    • (2008) Tech. Rep
    • Xin Jiang, A.1    Leyton-Brown, K.2    Bhat, N.3
  • 12
    • 27344449757 scopus 로고    scopus 로고
    • Decentralized control of cooperative systems: Categorization and complexity analysis
    • C. Goldman, and S. Zilberstein Decentralized control of cooperative systems: categorization and complexity analysis Journal of Artificial Intelligence Research 22 2004 143 174 (Pubitemid 41525885)
    • (2004) Journal of Artificial Intelligence Research , vol.22 , pp. 143-174
    • Goldman, C.V.1    Zilberstein, S.2
  • 15
    • 0032073263 scopus 로고    scopus 로고
    • Planning and acting in partially observable stochastic domains
    • PII S000437029800023X
    • L. Kaelbling, M. Littman, and A. Cassandra Planning and acting in partially observable stochastic domains Artificial Intelligence 101 1998 99 134 (Pubitemid 128387390)
    • (1998) Artificial Intelligence , vol.101 , Issue.1-2 , pp. 99-134
    • Kaelbling, L.P.1    Littman, M.L.2    Cassandra, A.R.3
  • 19
    • 0015658957 scopus 로고
    • The optimal control of partially observable Markov processes over a finite horizon
    • R. Smallwood, and E. Sondik The optimal control of partially observable Markov processes over a finite horizon Operations Research 21 5 1973 1071 1088
    • (1973) Operations Research , vol.21 , Issue.5 , pp. 1071-1088
    • Smallwood, R.1    Sondik, E.2
  • 20
    • 0032596468 scopus 로고    scopus 로고
    • On the undecidability of probabilistic planning and infinite-horizon partially observable Markov decision problems
    • O. Madani, S. Hanks, A. Condon, On the undecidability of probabilistic planning and infinite-horizon partially observable Markov decision problems, in: Proc. 16th AAAI Conf. Artificial Intelligence, 1999, pp. 541-548.
    • (1999) Proc. 16th AAAI Conf. Artificial Intelligence , pp. 541-548
    • Madani, O.1    Hanks, S.2    Condon, A.3
  • 22
    • 33646244605 scopus 로고    scopus 로고
    • A (revised) survey of approximate methods for solving partially observable Markov decision processes
    • National ICT Australia
    • D. Aberdeen, A (revised) survey of approximate methods for solving partially observable Markov decision processes, Tech. rep., National ICT Australia, 2003.
    • (2003) Tech. Rep.
    • Aberdeen, D.1
  • 26
    • 51649127552 scopus 로고    scopus 로고
    • Formal models and algorithms for decentralized decision-making under uncertainty
    • S. Seuken, and S. Zilberstein Formal models and algorithms for decentralized decision-making under uncertainty Journal of Autonomous Agents and Multiagent Systems 17 2 2008 190 250
    • (2008) Journal of Autonomous Agents and Multiagent Systems , vol.17 , Issue.2 , pp. 190-250
    • Seuken, S.1    Zilberstein, S.2
  • 29
    • 0008084202 scopus 로고
    • Optimal policies for partially observable Markov decision processes
    • Dept. Computer Sciences, Brown Univ.
    • A. Cassandra, Optimal policies for partially observable Markov decision processes, Tech. rep. CS-94-14, Dept. Computer Sciences, Brown Univ., 1994.
    • (1994) Tech. Rep. CS-94-14
    • Cassandra, A.1
  • 30
    • 84899992307 scopus 로고    scopus 로고
    • Interaction-driven Markov games for decentralized multiagent planning under uncertainty
    • M. Spaan, F. Melo, Interaction-driven Markov games for decentralized multiagent planning under uncertainty, in: Proc. 7th Int. Conf. Autonomous Agents and Multiagent Systems, 2008, pp. 525-532.
    • (2008) Proc. 7th Int. Conf. Autonomous Agents and Multiagent Systems , pp. 525-532
    • Spaan, M.1    Melo, F.2
  • 31
    • 0031630561 scopus 로고    scopus 로고
    • The dynamics of reinforcement learning in cooperative multiagent systems
    • C. Claus, C. Boutilier, The dynamics of reinforcement learning in cooperative multiagent systems, in: Proc. 15th AAAI Conf. Artificial Intelligence, 1998, pp. 746-752.
    • (1998) Proc. 15th AAAI Conf. Artificial Intelligence , pp. 746-752
    • Claus, C.1    Boutilier, C.2
  • 32
    • 0038637209 scopus 로고    scopus 로고
    • Multi-agent reinforcement learning: Independent vs. cooperative agents
    • Morgan Kaufman
    • M. Tan Multi-agent reinforcement learning: independent vs. cooperative agents Readings in Agents 1997 Morgan Kaufman 487 494
    • (1997) Readings in Agents , pp. 487-494
    • Tan, M.1
  • 33
    • 33744514808 scopus 로고    scopus 로고
    • Generalised weakened fictitious play
    • DOI 10.1016/j.geb.2005.08.005, PII S089982560500103X
    • D. Leslie, and E. Collins Generalised weakened fictitious play Games and Economic Behavior 56 2006 285 298 (Pubitemid 43812068)
    • (2006) Games and Economic Behavior , vol.56 , Issue.2 , pp. 285-298
    • Leslie, D.S.1    Collins, E.J.2
  • 36
    • 0010220982 scopus 로고    scopus 로고
    • Planning, learning and coordination in multiagent decision processes
    • C. Boutilier, Planning, learning and coordination in multiagent decision processes, in: Theoretical Aspects of Rationality and Knowledge, 1996, pp. 195-210.
    • (1996) Theoretical Aspects of Rationality and Knowledge , pp. 195-210
    • Boutilier, C.1
  • 37
    • 1142292938 scopus 로고    scopus 로고
    • The communicative multiagent team decision problem: Analyzing teamwork theories and models
    • D. Pynadath, and M. Tambe The communicative multiagent team decision problem: analyzing teamworktheories and models Journal of Artificial Intelligence Research 16 2002 389 423 (Pubitemid 43057178)
    • (2002) Journal of Artificial Intelligence Research , vol.16 , pp. 389-423
    • Pynadath, D.V.1    Tambe, M.2
  • 46
    • 33750710723 scopus 로고    scopus 로고
    • A polynomial-time algorithm for action-graph games
    • Proceedings of the 21st National Conference on Artificial Intelligence and the 18th Innovative Applications of Artificial Intelligence Conference, AAAI-06/IAAI-06
    • A. Xin Jiang, K. Leyton-Brown, A polynomial-time algorithm for action-graph games, in: Proc. 21st AAAI Conf. Artificial Intelligence, 2006, pp. 679-684. (Pubitemid 44705361)
    • (2006) Proceedings of the National Conference on Artificial Intelligence , vol.1 , pp. 679-684
    • Jiang, A.X.1    Leyton-Brown, K.2
  • 47
    • 36349034015 scopus 로고    scopus 로고
    • Computing pure nash equilibria in symmetric action graph games
    • AAAI-07/IAAI-07 Proceedings: 22nd AAAI Conference on Artificial Intelligence and the 19th Innovative Applications of Artificial Intelligence Conference
    • A. Xin Jiang, K. Leyton-Brown, Computing pure Nash equilibria in symmetric action-graph games, in: Proc. 22nd AAAI Conf. Artificial Intelligence, 2007, pp. 79-85. (Pubitemid 350149556)
    • (2007) Proceedings of the National Conference on Artificial Intelligence , vol.1 , pp. 79-85
    • Jiang, A.X.1    Leyton-Brown, K.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.