메뉴 건너뛰기




Volumn 1, Issue , 2013, Pages 563-570

Approximate solutions for factored Dec-POMDPs with many agents

Author keywords

Factored decentralized partially observable Markov decision processes; Multi agent planning

Indexed keywords

AUTONOMOUS AGENTS; MULTI AGENT SYSTEMS; SCALABILITY;

EID: 84899420781     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (50)

References (34)
  • 1
    • 4544344444 scopus 로고    scopus 로고
    • Transition-independent decentralized Markov decision processes
    • R. Becker, S. Zilberstein, V. Lesser, and C. V. Goldman. Transition-independent decentralized Markov decision processes. In AAMAS, 2003.
    • (2003) AAMAS
    • Becker, R.1    Zilberstein, S.2    Lesser, V.3    Goldman, C.V.4
  • 2
    • 0036874366 scopus 로고    scopus 로고
    • The complexity of decentralized control of Markov decision processes
    • D. S. Bernstein, R. Givan, N. Immerman, and S. Zilberstein. The complexity of decentralized control of Markov decision processes. Math, of OR, 27(4): 819-840, 2002.
    • (2002) Math, of OR , vol.27 , Issue.4 , pp. 819-840
    • Bernstein, D.S.1    Givan, R.2    Immerman, N.3    Zilberstein, S.4
  • 3
    • 0346942368 scopus 로고    scopus 로고
    • Decision-theoretic planning: Structural assumptions and computational leverage
    • C. Boutilier, T. Dean, and S. Hanks. Decision-theoretic planning: Structural assumptions and computational leverage. J AIR, 11: 1-94, 1999.
    • (1999) J AIR , vol.11 , pp. 1-94
    • Boutilier, C.1    Dean, T.2    Hanks, S.3
  • 4
    • 0002436850 scopus 로고    scopus 로고
    • Tractable inference for complex stochastic processes
    • X. Boyen and D. Koller. Tractable inference for complex stochastic processes. In UAI, 1998.
    • (1998) UAI
    • Boyen, X.1    Koller, D.2
  • 5
    • 4544325183 scopus 로고    scopus 로고
    • Approximate solutions for partially observable stochastic games with common payoffs
    • R. Emery-Montemerlo, G. Gordon, J. Schneider, and S. Thrun. Approximate solutions for partially observable stochastic games with common payoffs. In AAMAS, 2004.
    • (2004) AAMAS
    • Emery-Montemerlo, R.1    Gordon, G.2    Schneider, J.3    Thrun, S.4
  • 7
    • 84899946303 scopus 로고    scopus 로고
    • Decentralised coordination of low-power embedded devices using the max-sum algorithm
    • A. Farinelli, A. Rogers, A. Petcu, and N. R. Jennings. Decentralised coordination of low-power embedded devices using the max-sum algorithm. In AAMAS, 2008.
    • (2008) AAMAS
    • Farinelli, A.1    Rogers, A.2    Petcu, A.3    Jennings, N.R.4
  • 8
    • 4544318426 scopus 로고    scopus 로고
    • Efficient solution algorithms for factored MDPs
    • C. Guestrin, D. Koller, R. Parr, and S. Venkataraman. Efficient solution algorithms for factored MDPs. JAIR, 19: 399-468, 2003.
    • (2003) JAIR , vol.19 , pp. 399-468
    • Guestrin, C.1    Koller, D.2    Parr, R.3    Venkataraman, S.4
  • 10
    • 33748543203 scopus 로고    scopus 로고
    • Collaborative multiagent reinforcement learning by payoff propagation
    • J. R. Kok and N. Vlassis. Collaborative multiagent reinforcement learning by payoff propagation. JMLR, 7: 1789-1828, 2006.
    • (2006) JMLR , vol.7 , pp. 1789-1828
    • Kok, J.R.1    Vlassis, N.2
  • 11
    • 84880688552 scopus 로고    scopus 로고
    • Computing factored value functions for policies in structured MDPs
    • D. Koller and R. Parr. Computing factored value functions for policies in structured MDPs. In IJCAI, 1999.
    • (1999) IJCAI
    • Koller, D.1    Parr, R.2
  • 12
    • 84899828955 scopus 로고    scopus 로고
    • Constraint-based dynamic programming for decentralized POMDPs with structured interactions
    • A. Kumar and S. Zilberstein. Constraint-based dynamic programming for decentralized POMDPs with structured interactions. In AAMAS, 2009.
    • (2009) AAMAS
    • Kumar, A.1    Zilberstein, S.2
  • 13
    • 84868288428 scopus 로고    scopus 로고
    • Scalable multiagent planning using probabilistic inference
    • A. Kumar, S. Zilberstein, and M. Toussaint. Scalable multiagent planning using probabilistic inference. In IJCAI, 2011.
    • (2011) IJCAI
    • Kumar, A.1    Zilberstein, S.2    Toussaint, M.3
  • 14
    • 84899969517 scopus 로고    scopus 로고
    • Not all agents are equal: Scaling up distributed POMDPs for agent networks
    • J. Marecki, T. Gupta, P. Varakantham, M. Tambe, and M. Yokoo. Not all agents are equal: scaling up distributed POMDPs for agent networks. In AAMAS, 2008.
    • (2008) AAMAS
    • Marecki, J.1    Gupta, T.2    Varakantham, P.3    Tambe, M.4    Yokoo, M.5
  • 15
    • 0007788905 scopus 로고    scopus 로고
    • The factored frontier algorithm for approximate inference in DBNs
    • K. P. Murphy and Y. Weiss. The factored frontier algorithm for approximate inference in DBNs. In UAI, 2001.
    • (2001) UAI
    • Murphy, K.P.1    Weiss, Y.2
  • 16
    • 34247214638 scopus 로고    scopus 로고
    • Networked distributed POMDPs: A synthesis of distributed constraint optimization and POMDPs
    • R. Nair, P. Varakantham, M. Tambe, and M. Yokoo. Networked distributed POMDPs: A synthesis of distributed constraint optimization and POMDPs. In AAAI, 2005.
    • (2005) AAAI
    • Nair, R.1    Varakantham, P.2    Tambe, M.3    Yokoo, M.4
  • 18
    • 84868288325 scopus 로고    scopus 로고
    • Decentralized POMDPs
    • M. Wiering and M. van Otterlo, editors Springer Berlin Heidelberg
    • F. A. Oliehoek. Decentralized POMDPs. In M. Wiering and M. van Otterlo, editors, Reinforcement Learning: State of the Art. Springer Berlin Heidelberg, 2012.
    • (2012) Reinforcement Learning: State of the Art
    • Oliehoek, F.A.1
  • 19
    • 57349184659 scopus 로고    scopus 로고
    • The cross-entropy method for policy search in decentralized POMDPs
    • F. A. Oliehoek, J. F. Kooi, and N. Vlassis. The cross-entropy method for policy search in decentralized POMDPs. Informatica, 32: 341-357, 2008.
    • (2008) Informatica , vol.32 , pp. 341-357
    • Oliehoek, F.A.1    Kooi, J.F.2    Vlassis, N.3
  • 20
    • 52249098423 scopus 로고    scopus 로고
    • Optimal and approximate Q-value functions for decentralized POMDPs
    • F. A. Oliehoek, M. T. J. Spaan, and N. Vlassis. Optimal and approximate Q-value functions for decentralized POMDPs. JAIR, 32: 289-353, 2008.
    • (2008) JAIR , vol.32 , pp. 289-353
    • Oliehoek, F.A.1    Spaan, M.T.J.2    Vlassis, N.3
  • 22
    • 84885985853 scopus 로고    scopus 로고
    • Exploiting structure in cooperative Bayesian games
    • F. A. Oliehoek, S. Whiteson, and M. T. J. Spaan. Exploiting structure in cooperative Bayesian games. In UAI, 2012.
    • (2012) UAI
    • Oliehoek, F.A.1    Whiteson, S.2    Spaan, M.T.J.3
  • 23
    • 84860644195 scopus 로고    scopus 로고
    • Efficient planning for factored infinite-horizon DEC-POMDPs
    • J. Pajarinen and J. Peltonen. Efficient planning for factored infinite-horizon DEC-POMDPs. In IJCAI, 2011.
    • (2011) IJCAI
    • Pajarinen, J.1    Peltonen, J.2
  • 24
    • 84881044687 scopus 로고    scopus 로고
    • The complexity of multiagent systems: The price of silence
    • Z. Rabinovich, C. V. Goldman, and J. S. Rosenschein. The complexity of multiagent systems: the price of silence. In AAMAS, 2003.
    • (2003) AAMAS
    • Rabinovich, Z.1    Goldman, C.V.2    Rosenschein, J.S.3
  • 25
    • 78650949545 scopus 로고    scopus 로고
    • Bounded approximate decentralised coordination via the max-sum algorithm
    • A. Rogers, A. Farinelli, R. Stranders, and N. Jennings. Bounded approximate decentralised coordination via the max-sum algorithm. Artif Intel., 175(2): 730-759, 2011.
    • (2011) Artif Intel. , vol.175 , Issue.2 , pp. 730-759
    • Rogers, A.1    Farinelli, A.2    Stranders, R.3    Jennings, N.4
  • 26
    • 51649127552 scopus 로고    scopus 로고
    • Formal models and algorithms for decentralized decision making under uncertainty
    • S. Seuken and S. Zilberstein. Formal models and algorithms for decentralized decision making under uncertainty. Autonomous Agents and Multi-Agent Systems, 17(2): 190-250, 2008.
    • (2008) Autonomous Agents and Multi-Agent Systems , vol.17 , Issue.2 , pp. 190-250
    • Seuken, S.1    Zilberstein, S.2
  • 27
    • 84868299292 scopus 로고    scopus 로고
    • Scaling up optimal heuristic search in dec-POMDPs via incremental expansion
    • M. T. J. Spaan, F. A. Oliehoek, and C. Amato. Scaling up optimal heuristic search in Dec-POMDPs via incremental expansion. In IJCAI, 2011.
    • (2011) IJCAI
    • Spaan, M.T.J.1    Oliehoek, F.A.2    Amato, C.3
  • 28
    • 80053226937 scopus 로고    scopus 로고
    • *: A heuristic search algorithm for solving decentralized POMDPs
    • *: A heuristic search algorithm for solving decentralized POMDPs. In UAI, 2005.
    • (2005) UAI
    • Szer, D.1    Charpillet, F.2    Zilberstein, S.3
  • 29
    • 68949157375 scopus 로고    scopus 로고
    • Transfer learning for reinforcement learning domains: A survey
    • M. E. Taylor and P. Stone. Transfer learning for reinforcement learning domains: A survey. JMLR, 10: 1633-1685, 2009.
    • (2009) JMLR , vol.10 , pp. 1633-1685
    • Taylor, M.E.1    Stone, P.2
  • 31
    • 78650622568 scopus 로고    scopus 로고
    • Letting loose a SPIDER on a network of POMDPs: Generating quality guaranteed policies
    • P. Varakantham, J. Marecki, Y. Yabu, M. Tambe, and M. Yokoo. Letting loose a SPIDER on a network of POMDPs: Generating quality guaranteed policies. In AAMAS, 2007.
    • (2007) AAMAS
    • Varakantham, P.1    Marecki, J.2    Yabu, Y.3    Tambe, M.4    Yokoo, M.5
  • 32
    • 84899454751 scopus 로고    scopus 로고
    • Distributed model shaping for scaling to decentralized POMDPs with hundreds of agents
    • P. Velagapudi, P. Varakantham, P. Scerri, and K. Sycara. Distributed model shaping for scaling to decentralized POMDPs with hundreds of agents. In AAMAS, 2011.
    • (2011) AAMAS
    • Velagapudi, P.1    Varakantham, P.2    Scerri, P.3    Sycara, K.4
  • 33
    • 78650593547 scopus 로고    scopus 로고
    • Influence-based policy abstraction for weakly-coupled dec-POMDPs
    • S. J. Witwicki and E. H. Durfee. Influence-based policy abstraction for weakly-coupled Dec-POMDPs. In ICAPS, 2010.
    • (2010) ICAPS
    • Witwicki, S.J.1    Durfee, E.H.2
  • 34
    • 80053153738 scopus 로고    scopus 로고
    • Rollout sampling policy iteration for decentralized POMDPs
    • F. Wu, S. Zilberstein, and X. Chen. Rollout sampling policy iteration for decentralized POMDPs. In UAI, 2010.
    • (2010) UAI
    • Wu, F.1    Zilberstein, S.2    Chen, X.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.